Bioinformatics analysis of SARS coronavirus genome polymorphism

<p>Abstract</p> <p>Background</p> <p>We have compared 38 isolates of the SARS-CoV complete genome. The main goal was twofold: first, to analyze and compare nucleotide sequences and to identify positions of single nucleotide polymorphism (SNP), insertions and deletions,...

Full description

Bibliographic Details
Main Authors: Pavlović-Lažetić Gordana M, Mitić Nenad S, Beljanski Miloš V
Format: Article
Language:English
Published: BMC 2004-05-01
Series:BMC Bioinformatics
Subjects:
Online Access:http://www.biomedcentral.com/1471-2105/5/65
_version_ 1818823715682516992
author Pavlović-Lažetić Gordana M
Mitić Nenad S
Beljanski Miloš V
author_facet Pavlović-Lažetić Gordana M
Mitić Nenad S
Beljanski Miloš V
author_sort Pavlović-Lažetić Gordana M
collection DOAJ
description <p>Abstract</p> <p>Background</p> <p>We have compared 38 isolates of the SARS-CoV complete genome. The main goal was twofold: first, to analyze and compare nucleotide sequences and to identify positions of single nucleotide polymorphism (SNP), insertions and deletions, and second, to group them according to sequence similarity, eventually pointing to phylogeny of SARS-CoV isolates. The comparison is based on genome polymorphism such as insertions or deletions and the number and positions of SNPs.</p> <p>Results</p> <p>The nucleotide structure of all 38 isolates is presented. Based on insertions and deletions and dissimilarity due to SNPs, the dataset of all the isolates has been qualitatively classified into three groups each having their own subgroups. These are the A-group with "regular" isolates (no insertions / deletions except for 5' and 3' ends), the B-group of isolates with "long insertions", and the C-group of isolates with "many individual" insertions and deletions. The isolate with the smallest average number of SNPs, compared to other isolates, has been identified (TWH). The density distribution of SNPs, insertions and deletions for each group or subgroup, as well as cumulatively for all the isolates is also presented, along with the gene map for TWH.</p> <p>Since individual SNPs may have occurred at random, positions corresponding to multiple SNPs (occurring in two or more isolates) are identified and presented. This result revises some previous results of a similar type. Amino acid changes caused by multiple SNPs are also identified (for the annotated sequences, as well as presupposed amino acid changes for non-annotated ones). Exact SNP positions for the isolates in each group or subgroup are presented. Finally, a phylogenetic tree for the SARS-CoV isolates has been produced using the CLUSTALW program, showing high compatibility with former qualitative classification.</p> <p>Conclusions</p> <p>The comparative study of SARS-CoV isolates provides essential information for genome polymorphism, indication of strain differences and variants evolution. It may help with the development of effective treatment.</p>
first_indexed 2024-12-18T23:44:23Z
format Article
id doaj.art-635ff2ed2e744b90b3bcbd0e879f47ed
institution Directory Open Access Journal
issn 1471-2105
language English
last_indexed 2024-12-18T23:44:23Z
publishDate 2004-05-01
publisher BMC
record_format Article
series BMC Bioinformatics
spelling doaj.art-635ff2ed2e744b90b3bcbd0e879f47ed2022-12-21T20:47:17ZengBMCBMC Bioinformatics1471-21052004-05-01516510.1186/1471-2105-5-65Bioinformatics analysis of SARS coronavirus genome polymorphismPavlović-Lažetić Gordana MMitić Nenad SBeljanski Miloš V<p>Abstract</p> <p>Background</p> <p>We have compared 38 isolates of the SARS-CoV complete genome. The main goal was twofold: first, to analyze and compare nucleotide sequences and to identify positions of single nucleotide polymorphism (SNP), insertions and deletions, and second, to group them according to sequence similarity, eventually pointing to phylogeny of SARS-CoV isolates. The comparison is based on genome polymorphism such as insertions or deletions and the number and positions of SNPs.</p> <p>Results</p> <p>The nucleotide structure of all 38 isolates is presented. Based on insertions and deletions and dissimilarity due to SNPs, the dataset of all the isolates has been qualitatively classified into three groups each having their own subgroups. These are the A-group with "regular" isolates (no insertions / deletions except for 5' and 3' ends), the B-group of isolates with "long insertions", and the C-group of isolates with "many individual" insertions and deletions. The isolate with the smallest average number of SNPs, compared to other isolates, has been identified (TWH). The density distribution of SNPs, insertions and deletions for each group or subgroup, as well as cumulatively for all the isolates is also presented, along with the gene map for TWH.</p> <p>Since individual SNPs may have occurred at random, positions corresponding to multiple SNPs (occurring in two or more isolates) are identified and presented. This result revises some previous results of a similar type. Amino acid changes caused by multiple SNPs are also identified (for the annotated sequences, as well as presupposed amino acid changes for non-annotated ones). Exact SNP positions for the isolates in each group or subgroup are presented. Finally, a phylogenetic tree for the SARS-CoV isolates has been produced using the CLUSTALW program, showing high compatibility with former qualitative classification.</p> <p>Conclusions</p> <p>The comparative study of SARS-CoV isolates provides essential information for genome polymorphism, indication of strain differences and variants evolution. It may help with the development of effective treatment.</p>http://www.biomedcentral.com/1471-2105/5/65SARS-CoV complete genomeisolatesgenome sequence variationSNPclassificationphylogenetics
spellingShingle Pavlović-Lažetić Gordana M
Mitić Nenad S
Beljanski Miloš V
Bioinformatics analysis of SARS coronavirus genome polymorphism
BMC Bioinformatics
SARS-CoV complete genome
isolates
genome sequence variation
SNP
classification
phylogenetics
title Bioinformatics analysis of SARS coronavirus genome polymorphism
title_full Bioinformatics analysis of SARS coronavirus genome polymorphism
title_fullStr Bioinformatics analysis of SARS coronavirus genome polymorphism
title_full_unstemmed Bioinformatics analysis of SARS coronavirus genome polymorphism
title_short Bioinformatics analysis of SARS coronavirus genome polymorphism
title_sort bioinformatics analysis of sars coronavirus genome polymorphism
topic SARS-CoV complete genome
isolates
genome sequence variation
SNP
classification
phylogenetics
url http://www.biomedcentral.com/1471-2105/5/65
work_keys_str_mv AT pavloviclazeticgordanam bioinformaticsanalysisofsarscoronavirusgenomepolymorphism
AT miticnenads bioinformaticsanalysisofsarscoronavirusgenomepolymorphism
AT beljanskimilosv bioinformaticsanalysisofsarscoronavirusgenomepolymorphism