Insertion/deletion hotspots in the Nsp2, Nsp3, S1, and ORF8 genes of SARS-related coronaviruses

Abstract The genome of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) contains many insertions/deletions (indels) from the genomes of other SARS-related coronaviruses. Some of the identified indels have recently reported to involve relatively long segments of 10–300 consecutive bases a...

Full description

Bibliographic Details
Main Authors: Tetsuya Akaishi, Kei Fujiwara, Tadashi Ishii
Format: Article
Language:English
Published: BMC 2022-10-01
Series:BMC Ecology and Evolution
Subjects:
Online Access:https://doi.org/10.1186/s12862-022-02078-7
_version_ 1811197982225203200
author Tetsuya Akaishi
Kei Fujiwara
Tadashi Ishii
author_facet Tetsuya Akaishi
Kei Fujiwara
Tadashi Ishii
author_sort Tetsuya Akaishi
collection DOAJ
description Abstract The genome of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) contains many insertions/deletions (indels) from the genomes of other SARS-related coronaviruses. Some of the identified indels have recently reported to involve relatively long segments of 10–300 consecutive bases and with diverse RNA sequences around gaps between virus species, both of which are different characteristics from the classical shorter in-frame indels. These non-classical complex indels have been identified in non-structural protein 3 (Nsp3), the S1 domain of the spike (S), and open reading frame 8 (ORF8). To determine whether the occurrence of these non-classical indels in specific genomic regions is ubiquitous among broad species of SARS-related coronaviruses in different animal hosts, the present study compared SARS-related coronaviruses from humans (SARS-CoV and SARS-CoV-2), bats (RaTG13 and Rc-o319), and pangolins (GX-P4L), by performing multiple sequence alignment. As a result, indel hotspots with diverse RNA sequences of different lengths between the viruses were confirmed in the Nsp2 gene (approximately 2500–2600 base positions in the overall 29,900 bases), Nsp3 gene (approximately 3000–3300 and 3800–3900 base positions), N-terminal domain of the spike protein (21,500–22,500 base positions), and ORF8 gene (27,800–28,200 base positions). Abnormally high rate of point mutations and complex indels in these regions suggest that the occurrence of mutations in these hotspots may be selectively neutral or even benefit the survival of the viruses. The presence of such indel hotspots has not been reported in different human SARS-CoV-2 strains in the last 2 years, suggesting a lower rate of indels in human SARS-CoV-2. Future studies to elucidate the mechanisms enabling the frequent development of long and complex indels in specific genomic regions of SARS-related coronaviruses would offer deeper insights into the process of viral evolution.
first_indexed 2024-04-12T01:24:38Z
format Article
id doaj.art-d61e3b9f22ac467481ae2db1a35705dd
institution Directory Open Access Journal
issn 2730-7182
language English
last_indexed 2024-04-12T01:24:38Z
publishDate 2022-10-01
publisher BMC
record_format Article
series BMC Ecology and Evolution
spelling doaj.art-d61e3b9f22ac467481ae2db1a35705dd2022-12-22T03:53:42ZengBMCBMC Ecology and Evolution2730-71822022-10-012211810.1186/s12862-022-02078-7Insertion/deletion hotspots in the Nsp2, Nsp3, S1, and ORF8 genes of SARS-related coronavirusesTetsuya Akaishi0Kei Fujiwara1Tadashi Ishii2Department of Education and Support for Regional Medicine, Tohoku UniversityDepartment of Gastroenterology and Metabolism, Nagoya City UniversityDepartment of Education and Support for Regional Medicine, Tohoku UniversityAbstract The genome of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) contains many insertions/deletions (indels) from the genomes of other SARS-related coronaviruses. Some of the identified indels have recently reported to involve relatively long segments of 10–300 consecutive bases and with diverse RNA sequences around gaps between virus species, both of which are different characteristics from the classical shorter in-frame indels. These non-classical complex indels have been identified in non-structural protein 3 (Nsp3), the S1 domain of the spike (S), and open reading frame 8 (ORF8). To determine whether the occurrence of these non-classical indels in specific genomic regions is ubiquitous among broad species of SARS-related coronaviruses in different animal hosts, the present study compared SARS-related coronaviruses from humans (SARS-CoV and SARS-CoV-2), bats (RaTG13 and Rc-o319), and pangolins (GX-P4L), by performing multiple sequence alignment. As a result, indel hotspots with diverse RNA sequences of different lengths between the viruses were confirmed in the Nsp2 gene (approximately 2500–2600 base positions in the overall 29,900 bases), Nsp3 gene (approximately 3000–3300 and 3800–3900 base positions), N-terminal domain of the spike protein (21,500–22,500 base positions), and ORF8 gene (27,800–28,200 base positions). Abnormally high rate of point mutations and complex indels in these regions suggest that the occurrence of mutations in these hotspots may be selectively neutral or even benefit the survival of the viruses. The presence of such indel hotspots has not been reported in different human SARS-CoV-2 strains in the last 2 years, suggesting a lower rate of indels in human SARS-CoV-2. Future studies to elucidate the mechanisms enabling the frequent development of long and complex indels in specific genomic regions of SARS-related coronaviruses would offer deeper insights into the process of viral evolution.https://doi.org/10.1186/s12862-022-02078-7Bat coronavirusinsertion/deletion (indel)Indel hotspotPangolin coronavirus (PCoV)Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2)
spellingShingle Tetsuya Akaishi
Kei Fujiwara
Tadashi Ishii
Insertion/deletion hotspots in the Nsp2, Nsp3, S1, and ORF8 genes of SARS-related coronaviruses
BMC Ecology and Evolution
Bat coronavirus
insertion/deletion (indel)
Indel hotspot
Pangolin coronavirus (PCoV)
Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2)
title Insertion/deletion hotspots in the Nsp2, Nsp3, S1, and ORF8 genes of SARS-related coronaviruses
title_full Insertion/deletion hotspots in the Nsp2, Nsp3, S1, and ORF8 genes of SARS-related coronaviruses
title_fullStr Insertion/deletion hotspots in the Nsp2, Nsp3, S1, and ORF8 genes of SARS-related coronaviruses
title_full_unstemmed Insertion/deletion hotspots in the Nsp2, Nsp3, S1, and ORF8 genes of SARS-related coronaviruses
title_short Insertion/deletion hotspots in the Nsp2, Nsp3, S1, and ORF8 genes of SARS-related coronaviruses
title_sort insertion deletion hotspots in the nsp2 nsp3 s1 and orf8 genes of sars related coronaviruses
topic Bat coronavirus
insertion/deletion (indel)
Indel hotspot
Pangolin coronavirus (PCoV)
Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2)
url https://doi.org/10.1186/s12862-022-02078-7
work_keys_str_mv AT tetsuyaakaishi insertiondeletionhotspotsinthensp2nsp3s1andorf8genesofsarsrelatedcoronaviruses
AT keifujiwara insertiondeletionhotspotsinthensp2nsp3s1andorf8genesofsarsrelatedcoronaviruses
AT tadashiishii insertiondeletionhotspotsinthensp2nsp3s1andorf8genesofsarsrelatedcoronaviruses