XSTREAM: A practical algorithm for identification and architecture modeling of tandem repeats in protein sequences

<p>Abstract</p> <p>Background</p> <p>Biological sequence repeats arranged in tandem patterns are widespread in DNA and proteins. While many software tools have been designed to detect DNA tandem repeats (TRs), useful algorithms for identifying protein TRs with varied le...

Full description

Bibliographic Details
Main Authors: Cooper James B, Newman Aaron M
Format: Article
Language:English
Published: BMC 2007-10-01
Series:BMC Bioinformatics
Online Access:http://www.biomedcentral.com/1471-2105/8/382
_version_ 1811295111087128576
author Cooper James B
Newman Aaron M
author_facet Cooper James B
Newman Aaron M
author_sort Cooper James B
collection DOAJ
description <p>Abstract</p> <p>Background</p> <p>Biological sequence repeats arranged in tandem patterns are widespread in DNA and proteins. While many software tools have been designed to detect DNA tandem repeats (TRs), useful algorithms for identifying protein TRs with varied levels of degeneracy are still needed.</p> <p>Results</p> <p>To address limitations of current repeat identification methods, and to provide an efficient and flexible algorithm for the detection and analysis of TRs in protein sequences, we designed and implemented a new computational method called XSTREAM. Running time tests confirm the practicality of XSTREAM for analyses of multi-genome datasets. Each of the key capabilities of XSTREAM (e.g., merging, nesting, long-period detection, and TR architecture modeling) are demonstrated using anecdotal examples, and the utility of XSTREAM for identifying TR proteins was validated using data from a recently published paper.</p> <p>Conclusion</p> <p>We show that XSTREAM is a practical and valuable tool for TR detection in protein and nucleotide sequences at the multi-genome scale, and an effective tool for modeling TR domains with diverse architectures and varied levels of degeneracy. Because of these useful features, XSTREAM has significant potential for the discovery of naturally-evolved modular proteins with applications for engineering novel biostructural and biomimetic materials, and identifying new vaccine and diagnostic targets.</p>
first_indexed 2024-04-13T05:27:43Z
format Article
id doaj.art-8d2d9b90bc6a4e76849423da33c16c86
institution Directory Open Access Journal
issn 1471-2105
language English
last_indexed 2024-04-13T05:27:43Z
publishDate 2007-10-01
publisher BMC
record_format Article
series BMC Bioinformatics
spelling doaj.art-8d2d9b90bc6a4e76849423da33c16c862022-12-22T03:00:32ZengBMCBMC Bioinformatics1471-21052007-10-018138210.1186/1471-2105-8-382XSTREAM: A practical algorithm for identification and architecture modeling of tandem repeats in protein sequencesCooper James BNewman Aaron M<p>Abstract</p> <p>Background</p> <p>Biological sequence repeats arranged in tandem patterns are widespread in DNA and proteins. While many software tools have been designed to detect DNA tandem repeats (TRs), useful algorithms for identifying protein TRs with varied levels of degeneracy are still needed.</p> <p>Results</p> <p>To address limitations of current repeat identification methods, and to provide an efficient and flexible algorithm for the detection and analysis of TRs in protein sequences, we designed and implemented a new computational method called XSTREAM. Running time tests confirm the practicality of XSTREAM for analyses of multi-genome datasets. Each of the key capabilities of XSTREAM (e.g., merging, nesting, long-period detection, and TR architecture modeling) are demonstrated using anecdotal examples, and the utility of XSTREAM for identifying TR proteins was validated using data from a recently published paper.</p> <p>Conclusion</p> <p>We show that XSTREAM is a practical and valuable tool for TR detection in protein and nucleotide sequences at the multi-genome scale, and an effective tool for modeling TR domains with diverse architectures and varied levels of degeneracy. Because of these useful features, XSTREAM has significant potential for the discovery of naturally-evolved modular proteins with applications for engineering novel biostructural and biomimetic materials, and identifying new vaccine and diagnostic targets.</p>http://www.biomedcentral.com/1471-2105/8/382
spellingShingle Cooper James B
Newman Aaron M
XSTREAM: A practical algorithm for identification and architecture modeling of tandem repeats in protein sequences
BMC Bioinformatics
title XSTREAM: A practical algorithm for identification and architecture modeling of tandem repeats in protein sequences
title_full XSTREAM: A practical algorithm for identification and architecture modeling of tandem repeats in protein sequences
title_fullStr XSTREAM: A practical algorithm for identification and architecture modeling of tandem repeats in protein sequences
title_full_unstemmed XSTREAM: A practical algorithm for identification and architecture modeling of tandem repeats in protein sequences
title_short XSTREAM: A practical algorithm for identification and architecture modeling of tandem repeats in protein sequences
title_sort xstream a practical algorithm for identification and architecture modeling of tandem repeats in protein sequences
url http://www.biomedcentral.com/1471-2105/8/382
work_keys_str_mv AT cooperjamesb xstreamapracticalalgorithmforidentificationandarchitecturemodelingoftandemrepeatsinproteinsequences
AT newmanaaronm xstreamapracticalalgorithmforidentificationandarchitecturemodelingoftandemrepeatsinproteinsequences