SDT: a virus classification tool based on pairwise sequence alignment and identity calculation.

The perpetually increasing rate at which viral full-genome sequences are being determined is creating a pressing demand for computational tools that will aid the objective classification of these genome sequences. Taxonomic classification approaches that are based on pairwise genetic identity measur...

Full description

Bibliographic Details
Main Authors: Brejnev Muhizi Muhire, Arvind Varsani, Darren Patrick Martin
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2014-01-01
Series:PLoS ONE
Online Access:http://europepmc.org/articles/PMC4178126?pdf=render
_version_ 1819148942659551232
author Brejnev Muhizi Muhire
Arvind Varsani
Darren Patrick Martin
author_facet Brejnev Muhizi Muhire
Arvind Varsani
Darren Patrick Martin
author_sort Brejnev Muhizi Muhire
collection DOAJ
description The perpetually increasing rate at which viral full-genome sequences are being determined is creating a pressing demand for computational tools that will aid the objective classification of these genome sequences. Taxonomic classification approaches that are based on pairwise genetic identity measures are potentially highly automatable and are progressively gaining favour with the International Committee on Taxonomy of Viruses (ICTV). There are, however, various issues with the calculation of such measures that could potentially undermine the accuracy and consistency with which they can be applied to virus classification. Firstly, pairwise sequence identities computed based on multiple sequence alignments rather than on multiple independent pairwise alignments can lead to the deflation of identity scores with increasing dataset sizes. Also, when gap-characters need to be introduced during sequence alignments to account for insertions and deletions, methodological variations in the way that these characters are introduced and handled during pairwise genetic identity calculations can cause high degrees of inconsistency in the way that different methods classify the same sets of sequences. Here we present Sequence Demarcation Tool (SDT), a free user-friendly computer program that aims to provide a robust and highly reproducible means of objectively using pairwise genetic identity calculations to classify any set of nucleotide or amino acid sequences. SDT can produce publication quality pairwise identity plots and colour-coded distance matrices to further aid the classification of sequences according to ICTV approved taxonomic demarcation criteria. Besides a graphical interface version of the program for Windows computers, command-line versions of the program are available for a variety of different operating systems (including a parallel version for cluster computing platforms).
first_indexed 2024-12-22T13:53:44Z
format Article
id doaj.art-11cef9a93d1c4c7799223b4ebf358b4b
institution Directory Open Access Journal
issn 1932-6203
language English
last_indexed 2024-12-22T13:53:44Z
publishDate 2014-01-01
publisher Public Library of Science (PLoS)
record_format Article
series PLoS ONE
spelling doaj.art-11cef9a93d1c4c7799223b4ebf358b4b2022-12-21T18:23:35ZengPublic Library of Science (PLoS)PLoS ONE1932-62032014-01-0199e10827710.1371/journal.pone.0108277SDT: a virus classification tool based on pairwise sequence alignment and identity calculation.Brejnev Muhizi MuhireArvind VarsaniDarren Patrick MartinThe perpetually increasing rate at which viral full-genome sequences are being determined is creating a pressing demand for computational tools that will aid the objective classification of these genome sequences. Taxonomic classification approaches that are based on pairwise genetic identity measures are potentially highly automatable and are progressively gaining favour with the International Committee on Taxonomy of Viruses (ICTV). There are, however, various issues with the calculation of such measures that could potentially undermine the accuracy and consistency with which they can be applied to virus classification. Firstly, pairwise sequence identities computed based on multiple sequence alignments rather than on multiple independent pairwise alignments can lead to the deflation of identity scores with increasing dataset sizes. Also, when gap-characters need to be introduced during sequence alignments to account for insertions and deletions, methodological variations in the way that these characters are introduced and handled during pairwise genetic identity calculations can cause high degrees of inconsistency in the way that different methods classify the same sets of sequences. Here we present Sequence Demarcation Tool (SDT), a free user-friendly computer program that aims to provide a robust and highly reproducible means of objectively using pairwise genetic identity calculations to classify any set of nucleotide or amino acid sequences. SDT can produce publication quality pairwise identity plots and colour-coded distance matrices to further aid the classification of sequences according to ICTV approved taxonomic demarcation criteria. Besides a graphical interface version of the program for Windows computers, command-line versions of the program are available for a variety of different operating systems (including a parallel version for cluster computing platforms).http://europepmc.org/articles/PMC4178126?pdf=render
spellingShingle Brejnev Muhizi Muhire
Arvind Varsani
Darren Patrick Martin
SDT: a virus classification tool based on pairwise sequence alignment and identity calculation.
PLoS ONE
title SDT: a virus classification tool based on pairwise sequence alignment and identity calculation.
title_full SDT: a virus classification tool based on pairwise sequence alignment and identity calculation.
title_fullStr SDT: a virus classification tool based on pairwise sequence alignment and identity calculation.
title_full_unstemmed SDT: a virus classification tool based on pairwise sequence alignment and identity calculation.
title_short SDT: a virus classification tool based on pairwise sequence alignment and identity calculation.
title_sort sdt a virus classification tool based on pairwise sequence alignment and identity calculation
url http://europepmc.org/articles/PMC4178126?pdf=render
work_keys_str_mv AT brejnevmuhizimuhire sdtavirusclassificationtoolbasedonpairwisesequencealignmentandidentitycalculation
AT arvindvarsani sdtavirusclassificationtoolbasedonpairwisesequencealignmentandidentitycalculation
AT darrenpatrickmartin sdtavirusclassificationtoolbasedonpairwisesequencealignmentandidentitycalculation