In silico identification of multiple conserved motifs within the control region of Culicidae mitogenomes
Abstract Mosquitoes are important vectors for human and animal diseases. Genetic markers, like the mitochondrial COI gene, can facilitate the taxonomic classification of disease vectors, vector-borne disease surveillance, and prevention. Within the control region (CR) of the mitochondrial genome, th...
Main Authors: | , , , , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Nature Portfolio
2022-12-01
|
Series: | Scientific Reports |
Online Access: | https://doi.org/10.1038/s41598-022-26236-5 |
_version_ | 1797977484705136640 |
---|---|
author | Thomas M. R. Harrison Josip Rudar Nicholas Ogden Royce Steeves David R. Lapen Donald Baird Nellie Gagné Oliver Lung |
author_facet | Thomas M. R. Harrison Josip Rudar Nicholas Ogden Royce Steeves David R. Lapen Donald Baird Nellie Gagné Oliver Lung |
author_sort | Thomas M. R. Harrison |
collection | DOAJ |
description | Abstract Mosquitoes are important vectors for human and animal diseases. Genetic markers, like the mitochondrial COI gene, can facilitate the taxonomic classification of disease vectors, vector-borne disease surveillance, and prevention. Within the control region (CR) of the mitochondrial genome, there exists a highly variable and poorly studied non-coding AT-rich area that contains the origin of replication. Although the CR hypervariable region has been used for species differentiation of some animals, few studies have investigated the mosquito CR. In this study, we analyze the mosquito mitogenome CR sequences from 125 species and 17 genera. We discovered four conserved motifs located 80 to 230 bp upstream of the 12S rRNA gene. Two of these motifs were found within all 392 Anopheles (An.) CR sequences while the other two motifs were identified in all 37 Culex (Cx.) CR sequences. However, only 3 of the 304 non-Culicidae Dipteran mitogenome CR sequences contained these motifs. Interestingly, the short motif found in all 37 Culex sequences had poly-A and poly-T stretch of similar length that is predicted to form a stable hairpin. We show that supervised learning using the frequency chaos game representation of the CR can be used to differentiate mosquito genera from their dipteran relatives. |
first_indexed | 2024-04-11T05:07:39Z |
format | Article |
id | doaj.art-9436e9290347420f8907a06fb6d2db46 |
institution | Directory Open Access Journal |
issn | 2045-2322 |
language | English |
last_indexed | 2024-04-11T05:07:39Z |
publishDate | 2022-12-01 |
publisher | Nature Portfolio |
record_format | Article |
series | Scientific Reports |
spelling | doaj.art-9436e9290347420f8907a06fb6d2db462022-12-25T12:13:14ZengNature PortfolioScientific Reports2045-23222022-12-0112111610.1038/s41598-022-26236-5In silico identification of multiple conserved motifs within the control region of Culicidae mitogenomesThomas M. R. Harrison0Josip Rudar1Nicholas Ogden2Royce Steeves3David R. Lapen4Donald Baird5Nellie Gagné6Oliver Lung7Canadian Food Inspection Agency, National Centre for Foreign Animal DiseaseCanadian Food Inspection Agency, National Centre for Foreign Animal DiseasePublic Health Risk Sciences Division, National Microbiology Laboratory, Public Health Agency of CanadaGulf Fisheries Centre, Fisheries & Oceans CanadaOttawa Research Development Centre, Agriculture & Agri-Food CanadaEnvironment and Climate Change Canada, Canadian Rivers Institute, Department of Biology, University of New BrunswickGulf Fisheries Centre, Fisheries & Oceans CanadaCanadian Food Inspection Agency, National Centre for Foreign Animal DiseaseAbstract Mosquitoes are important vectors for human and animal diseases. Genetic markers, like the mitochondrial COI gene, can facilitate the taxonomic classification of disease vectors, vector-borne disease surveillance, and prevention. Within the control region (CR) of the mitochondrial genome, there exists a highly variable and poorly studied non-coding AT-rich area that contains the origin of replication. Although the CR hypervariable region has been used for species differentiation of some animals, few studies have investigated the mosquito CR. In this study, we analyze the mosquito mitogenome CR sequences from 125 species and 17 genera. We discovered four conserved motifs located 80 to 230 bp upstream of the 12S rRNA gene. Two of these motifs were found within all 392 Anopheles (An.) CR sequences while the other two motifs were identified in all 37 Culex (Cx.) CR sequences. However, only 3 of the 304 non-Culicidae Dipteran mitogenome CR sequences contained these motifs. Interestingly, the short motif found in all 37 Culex sequences had poly-A and poly-T stretch of similar length that is predicted to form a stable hairpin. We show that supervised learning using the frequency chaos game representation of the CR can be used to differentiate mosquito genera from their dipteran relatives.https://doi.org/10.1038/s41598-022-26236-5 |
spellingShingle | Thomas M. R. Harrison Josip Rudar Nicholas Ogden Royce Steeves David R. Lapen Donald Baird Nellie Gagné Oliver Lung In silico identification of multiple conserved motifs within the control region of Culicidae mitogenomes Scientific Reports |
title | In silico identification of multiple conserved motifs within the control region of Culicidae mitogenomes |
title_full | In silico identification of multiple conserved motifs within the control region of Culicidae mitogenomes |
title_fullStr | In silico identification of multiple conserved motifs within the control region of Culicidae mitogenomes |
title_full_unstemmed | In silico identification of multiple conserved motifs within the control region of Culicidae mitogenomes |
title_short | In silico identification of multiple conserved motifs within the control region of Culicidae mitogenomes |
title_sort | in silico identification of multiple conserved motifs within the control region of culicidae mitogenomes |
url | https://doi.org/10.1038/s41598-022-26236-5 |
work_keys_str_mv | AT thomasmrharrison insilicoidentificationofmultipleconservedmotifswithinthecontrolregionofculicidaemitogenomes AT josiprudar insilicoidentificationofmultipleconservedmotifswithinthecontrolregionofculicidaemitogenomes AT nicholasogden insilicoidentificationofmultipleconservedmotifswithinthecontrolregionofculicidaemitogenomes AT roycesteeves insilicoidentificationofmultipleconservedmotifswithinthecontrolregionofculicidaemitogenomes AT davidrlapen insilicoidentificationofmultipleconservedmotifswithinthecontrolregionofculicidaemitogenomes AT donaldbaird insilicoidentificationofmultipleconservedmotifswithinthecontrolregionofculicidaemitogenomes AT nelliegagne insilicoidentificationofmultipleconservedmotifswithinthecontrolregionofculicidaemitogenomes AT oliverlung insilicoidentificationofmultipleconservedmotifswithinthecontrolregionofculicidaemitogenomes |