Refining Mitochondrial Intron Classification With ERPIN: Identification Based on Conservation of Sequence Plus Secondary Structure Motifs

Mitochondrial genomes—in particular those of fungi—often encode genes with a large number of Group I and Group II introns that are conserved at both the sequence and the RNA structure level. They provide a rich resource for the investigation of intron and gene structure, self- and protein-guided spl...

Full description

Bibliographic Details
Main Authors: Samuel Prince, Carl Munoz, Fannie Filion-Bienvenue, Pierre Rioux, Matt Sarrasin, B. Franz Lang
Format: Article
Language:English
Published: Frontiers Media S.A. 2022-03-01
Series:Frontiers in Microbiology
Subjects:
Online Access:https://www.frontiersin.org/articles/10.3389/fmicb.2022.866187/full
_version_ 1818392748434128896
author Samuel Prince
Carl Munoz
Fannie Filion-Bienvenue
Pierre Rioux
Matt Sarrasin
B. Franz Lang
author_facet Samuel Prince
Carl Munoz
Fannie Filion-Bienvenue
Pierre Rioux
Matt Sarrasin
B. Franz Lang
author_sort Samuel Prince
collection DOAJ
description Mitochondrial genomes—in particular those of fungi—often encode genes with a large number of Group I and Group II introns that are conserved at both the sequence and the RNA structure level. They provide a rich resource for the investigation of intron and gene structure, self- and protein-guided splicing mechanisms, and intron evolution. Yet, the degree of sequence conservation of introns is limited, and the primary sequence differs considerably among the distinct intron sub-groups. It makes intron identification, classification, structural modeling, and the inference of gene models a most challenging and error-prone task—frequently passed on to an “expert” for manual intervention. To reduce the need for manual curation of intron structures and mitochondrial gene models, computational methods using ERPIN sequence profiles were initially developed in 2007. Here we present a refinement of search models and alignments using the now abundant publicly available fungal mtDNA sequences. In addition, we have tested in how far members of the originally proposed sub-groups are clearly distinguished and validated by our computational approach. We confirm clearly distinct mitochondrial Group I sub-groups IA1, IA3, IB3, IC1, IC2, and ID. Yet, IB1, IB2, and IB4 ERPIN models are overlapping substantially in predictions, and are therefore combined and reported as IB. We have further explored the conversion of our ERPIN profiles into covariance models (CM). Current limitations and prospects of the CM approach will be discussed.
first_indexed 2024-12-14T05:34:21Z
format Article
id doaj.art-845b3653d8f44c05ba3cf79c41518ecc
institution Directory Open Access Journal
issn 1664-302X
language English
last_indexed 2024-12-14T05:34:21Z
publishDate 2022-03-01
publisher Frontiers Media S.A.
record_format Article
series Frontiers in Microbiology
spelling doaj.art-845b3653d8f44c05ba3cf79c41518ecc2022-12-21T23:15:14ZengFrontiers Media S.A.Frontiers in Microbiology1664-302X2022-03-011310.3389/fmicb.2022.866187866187Refining Mitochondrial Intron Classification With ERPIN: Identification Based on Conservation of Sequence Plus Secondary Structure MotifsSamuel PrinceCarl MunozFannie Filion-BienvenuePierre RiouxMatt SarrasinB. Franz LangMitochondrial genomes—in particular those of fungi—often encode genes with a large number of Group I and Group II introns that are conserved at both the sequence and the RNA structure level. They provide a rich resource for the investigation of intron and gene structure, self- and protein-guided splicing mechanisms, and intron evolution. Yet, the degree of sequence conservation of introns is limited, and the primary sequence differs considerably among the distinct intron sub-groups. It makes intron identification, classification, structural modeling, and the inference of gene models a most challenging and error-prone task—frequently passed on to an “expert” for manual intervention. To reduce the need for manual curation of intron structures and mitochondrial gene models, computational methods using ERPIN sequence profiles were initially developed in 2007. Here we present a refinement of search models and alignments using the now abundant publicly available fungal mtDNA sequences. In addition, we have tested in how far members of the originally proposed sub-groups are clearly distinguished and validated by our computational approach. We confirm clearly distinct mitochondrial Group I sub-groups IA1, IA3, IB3, IC1, IC2, and ID. Yet, IB1, IB2, and IB4 ERPIN models are overlapping substantially in predictions, and are therefore combined and reported as IB. We have further explored the conversion of our ERPIN profiles into covariance models (CM). Current limitations and prospects of the CM approach will be discussed.https://www.frontiersin.org/articles/10.3389/fmicb.2022.866187/fullmitochondrial intronsgroup IERPINcovariance modelsinfernalRNA structure
spellingShingle Samuel Prince
Carl Munoz
Fannie Filion-Bienvenue
Pierre Rioux
Matt Sarrasin
B. Franz Lang
Refining Mitochondrial Intron Classification With ERPIN: Identification Based on Conservation of Sequence Plus Secondary Structure Motifs
Frontiers in Microbiology
mitochondrial introns
group I
ERPIN
covariance models
infernal
RNA structure
title Refining Mitochondrial Intron Classification With ERPIN: Identification Based on Conservation of Sequence Plus Secondary Structure Motifs
title_full Refining Mitochondrial Intron Classification With ERPIN: Identification Based on Conservation of Sequence Plus Secondary Structure Motifs
title_fullStr Refining Mitochondrial Intron Classification With ERPIN: Identification Based on Conservation of Sequence Plus Secondary Structure Motifs
title_full_unstemmed Refining Mitochondrial Intron Classification With ERPIN: Identification Based on Conservation of Sequence Plus Secondary Structure Motifs
title_short Refining Mitochondrial Intron Classification With ERPIN: Identification Based on Conservation of Sequence Plus Secondary Structure Motifs
title_sort refining mitochondrial intron classification with erpin identification based on conservation of sequence plus secondary structure motifs
topic mitochondrial introns
group I
ERPIN
covariance models
infernal
RNA structure
url https://www.frontiersin.org/articles/10.3389/fmicb.2022.866187/full
work_keys_str_mv AT samuelprince refiningmitochondrialintronclassificationwitherpinidentificationbasedonconservationofsequenceplussecondarystructuremotifs
AT carlmunoz refiningmitochondrialintronclassificationwitherpinidentificationbasedonconservationofsequenceplussecondarystructuremotifs
AT fanniefilionbienvenue refiningmitochondrialintronclassificationwitherpinidentificationbasedonconservationofsequenceplussecondarystructuremotifs
AT pierrerioux refiningmitochondrialintronclassificationwitherpinidentificationbasedonconservationofsequenceplussecondarystructuremotifs
AT mattsarrasin refiningmitochondrialintronclassificationwitherpinidentificationbasedonconservationofsequenceplussecondarystructuremotifs
AT bfranzlang refiningmitochondrialintronclassificationwitherpinidentificationbasedonconservationofsequenceplussecondarystructuremotifs