In silico identification of multiple conserved motifs within the control region of Culicidae mitogenomes

Abstract Mosquitoes are important vectors for human and animal diseases. Genetic markers, like the mitochondrial COI gene, can facilitate the taxonomic classification of disease vectors, vector-borne disease surveillance, and prevention. Within the control region (CR) of the mitochondrial genome, th...

Full description

Bibliographic Details
Main Authors: Thomas M. R. Harrison, Josip Rudar, Nicholas Ogden, Royce Steeves, David R. Lapen, Donald Baird, Nellie Gagné, Oliver Lung
Format: Article
Language:English
Published: Nature Portfolio 2022-12-01
Series:Scientific Reports
Online Access:https://doi.org/10.1038/s41598-022-26236-5
_version_ 1797977484705136640
author Thomas M. R. Harrison
Josip Rudar
Nicholas Ogden
Royce Steeves
David R. Lapen
Donald Baird
Nellie Gagné
Oliver Lung
author_facet Thomas M. R. Harrison
Josip Rudar
Nicholas Ogden
Royce Steeves
David R. Lapen
Donald Baird
Nellie Gagné
Oliver Lung
author_sort Thomas M. R. Harrison
collection DOAJ
description Abstract Mosquitoes are important vectors for human and animal diseases. Genetic markers, like the mitochondrial COI gene, can facilitate the taxonomic classification of disease vectors, vector-borne disease surveillance, and prevention. Within the control region (CR) of the mitochondrial genome, there exists a highly variable and poorly studied non-coding AT-rich area that contains the origin of replication. Although the CR hypervariable region has been used for species differentiation of some animals, few studies have investigated the mosquito CR. In this study, we analyze the mosquito mitogenome CR sequences from 125 species and 17 genera. We discovered four conserved motifs located 80 to 230 bp upstream of the 12S rRNA gene. Two of these motifs were found within all 392 Anopheles (An.) CR sequences while the other two motifs were identified in all 37 Culex (Cx.) CR sequences. However, only 3 of the 304 non-Culicidae Dipteran mitogenome CR sequences contained these motifs. Interestingly, the short motif found in all 37 Culex sequences had poly-A and poly-T stretch of similar length that is predicted to form a stable hairpin. We show that supervised learning using the frequency chaos game representation of the CR can be used to differentiate mosquito genera from their dipteran relatives.
first_indexed 2024-04-11T05:07:39Z
format Article
id doaj.art-9436e9290347420f8907a06fb6d2db46
institution Directory Open Access Journal
issn 2045-2322
language English
last_indexed 2024-04-11T05:07:39Z
publishDate 2022-12-01
publisher Nature Portfolio
record_format Article
series Scientific Reports
spelling doaj.art-9436e9290347420f8907a06fb6d2db462022-12-25T12:13:14ZengNature PortfolioScientific Reports2045-23222022-12-0112111610.1038/s41598-022-26236-5In silico identification of multiple conserved motifs within the control region of Culicidae mitogenomesThomas M. R. Harrison0Josip Rudar1Nicholas Ogden2Royce Steeves3David R. Lapen4Donald Baird5Nellie Gagné6Oliver Lung7Canadian Food Inspection Agency, National Centre for Foreign Animal DiseaseCanadian Food Inspection Agency, National Centre for Foreign Animal DiseasePublic Health Risk Sciences Division, National Microbiology Laboratory, Public Health Agency of CanadaGulf Fisheries Centre, Fisheries & Oceans CanadaOttawa Research Development Centre, Agriculture & Agri-Food CanadaEnvironment and Climate Change Canada, Canadian Rivers Institute, Department of Biology, University of New BrunswickGulf Fisheries Centre, Fisheries & Oceans CanadaCanadian Food Inspection Agency, National Centre for Foreign Animal DiseaseAbstract Mosquitoes are important vectors for human and animal diseases. Genetic markers, like the mitochondrial COI gene, can facilitate the taxonomic classification of disease vectors, vector-borne disease surveillance, and prevention. Within the control region (CR) of the mitochondrial genome, there exists a highly variable and poorly studied non-coding AT-rich area that contains the origin of replication. Although the CR hypervariable region has been used for species differentiation of some animals, few studies have investigated the mosquito CR. In this study, we analyze the mosquito mitogenome CR sequences from 125 species and 17 genera. We discovered four conserved motifs located 80 to 230 bp upstream of the 12S rRNA gene. Two of these motifs were found within all 392 Anopheles (An.) CR sequences while the other two motifs were identified in all 37 Culex (Cx.) CR sequences. However, only 3 of the 304 non-Culicidae Dipteran mitogenome CR sequences contained these motifs. Interestingly, the short motif found in all 37 Culex sequences had poly-A and poly-T stretch of similar length that is predicted to form a stable hairpin. We show that supervised learning using the frequency chaos game representation of the CR can be used to differentiate mosquito genera from their dipteran relatives.https://doi.org/10.1038/s41598-022-26236-5
spellingShingle Thomas M. R. Harrison
Josip Rudar
Nicholas Ogden
Royce Steeves
David R. Lapen
Donald Baird
Nellie Gagné
Oliver Lung
In silico identification of multiple conserved motifs within the control region of Culicidae mitogenomes
Scientific Reports
title In silico identification of multiple conserved motifs within the control region of Culicidae mitogenomes
title_full In silico identification of multiple conserved motifs within the control region of Culicidae mitogenomes
title_fullStr In silico identification of multiple conserved motifs within the control region of Culicidae mitogenomes
title_full_unstemmed In silico identification of multiple conserved motifs within the control region of Culicidae mitogenomes
title_short In silico identification of multiple conserved motifs within the control region of Culicidae mitogenomes
title_sort in silico identification of multiple conserved motifs within the control region of culicidae mitogenomes
url https://doi.org/10.1038/s41598-022-26236-5
work_keys_str_mv AT thomasmrharrison insilicoidentificationofmultipleconservedmotifswithinthecontrolregionofculicidaemitogenomes
AT josiprudar insilicoidentificationofmultipleconservedmotifswithinthecontrolregionofculicidaemitogenomes
AT nicholasogden insilicoidentificationofmultipleconservedmotifswithinthecontrolregionofculicidaemitogenomes
AT roycesteeves insilicoidentificationofmultipleconservedmotifswithinthecontrolregionofculicidaemitogenomes
AT davidrlapen insilicoidentificationofmultipleconservedmotifswithinthecontrolregionofculicidaemitogenomes
AT donaldbaird insilicoidentificationofmultipleconservedmotifswithinthecontrolregionofculicidaemitogenomes
AT nelliegagne insilicoidentificationofmultipleconservedmotifswithinthecontrolregionofculicidaemitogenomes
AT oliverlung insilicoidentificationofmultipleconservedmotifswithinthecontrolregionofculicidaemitogenomes