Simultaneous alignment and folding of protein sequences

Accurate comparative analysis tools for low-homology proteins remains a difficult challenge in computational biology, especially sequence alignment and consensus folding problems. We presentpartiFold-Align, the first algorithm for simultaneous alignment and consensus folding of unaligned protein seq...

Full description

Bibliographic Details
Main Authors: Waldispuhl, Jerome, O'Donnell, Charles William, Will, Sebastian, Devadas, Srinivas, Backofen, Rolf, Berger, Bonnie
Other Authors: Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory
Format: Article
Language:en_US
Published: Springer Berlin Heidelberg 2010
Online Access:http://hdl.handle.net/1721.1/51685
https://orcid.org/0000-0001-8253-7714
https://orcid.org/0000-0002-2724-7228
_version_ 1826214331973369856
author Waldispuhl, Jerome
O'Donnell, Charles William
Will, Sebastian
Devadas, Srinivas
Backofen, Rolf
Berger, Bonnie
author2 Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory
author_facet Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory
Waldispuhl, Jerome
O'Donnell, Charles William
Will, Sebastian
Devadas, Srinivas
Backofen, Rolf
Berger, Bonnie
author_sort Waldispuhl, Jerome
collection MIT
description Accurate comparative analysis tools for low-homology proteins remains a difficult challenge in computational biology, especially sequence alignment and consensus folding problems. We presentpartiFold-Align, the first algorithm for simultaneous alignment and consensus folding of unaligned protein sequences; the algorithm’s complexity is polynomial in time and space. Algorithmically,partiFold-Align exploits sparsity in the set of super-secondary structure pairings and alignment candidates to achieve an effectively cubic running time for simultaneous pairwise alignment and folding. We demonstrate the efficacy of these techniques on transmembrane β-barrel proteins, an important yet difficult class of proteins with few known three-dimensional structures. Testing against structurally derived sequence alignments,partiFold-Align significantly outperforms state-of-the-art pairwise sequence alignment tools in the most difficult low sequence homology case and improves secondary structure prediction where current approaches fail. Importantly, partiFold-Align requires no prior training. These general techniques are widely applicable to many more protein families. partiFold-Align is available at http://partiFold.csail.mit.edu.
first_indexed 2024-09-23T16:03:42Z
format Article
id mit-1721.1/51685
institution Massachusetts Institute of Technology
language en_US
last_indexed 2024-09-23T16:03:42Z
publishDate 2010
publisher Springer Berlin Heidelberg
record_format dspace
spelling mit-1721.1/516852022-09-29T17:57:17Z Simultaneous alignment and folding of protein sequences Waldispuhl, Jerome O'Donnell, Charles William Will, Sebastian Devadas, Srinivas Backofen, Rolf Berger, Bonnie Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science Massachusetts Institute of Technology. Department of Mathematics Devadas, Srinivas Waldispuhl, Jerome O'Donnell, Charles William Devadas, Srinivas Berger, Bonnie Accurate comparative analysis tools for low-homology proteins remains a difficult challenge in computational biology, especially sequence alignment and consensus folding problems. We presentpartiFold-Align, the first algorithm for simultaneous alignment and consensus folding of unaligned protein sequences; the algorithm’s complexity is polynomial in time and space. Algorithmically,partiFold-Align exploits sparsity in the set of super-secondary structure pairings and alignment candidates to achieve an effectively cubic running time for simultaneous pairwise alignment and folding. We demonstrate the efficacy of these techniques on transmembrane β-barrel proteins, an important yet difficult class of proteins with few known three-dimensional structures. Testing against structurally derived sequence alignments,partiFold-Align significantly outperforms state-of-the-art pairwise sequence alignment tools in the most difficult low sequence homology case and improves secondary structure prediction where current approaches fail. Importantly, partiFold-Align requires no prior training. These general techniques are widely applicable to many more protein families. partiFold-Align is available at http://partiFold.csail.mit.edu. 2010-02-10T19:12:58Z 2010-02-10T19:12:58Z 2009-05 Article http://purl.org/eprint/type/JournalArticle 1611-3349 0302-9743 http://hdl.handle.net/1721.1/51685 Waldispühl, Jérôme et al. “Simultaneous Alignment and Folding of Protein Sequences.” Research in Computational Molecular Biology 2009. https://orcid.org/0000-0001-8253-7714 https://orcid.org/0000-0002-2724-7228 en_US http://dx.doi.org/10.1007/978-3-642-02008-7_25 Research in Computational Molecular Biology Attribution-Noncommercial-Share Alike 3.0 Unported http://creativecommons.org/licenses/by-nc-sa/3.0/ application/pdf Springer Berlin Heidelberg Srinivas Devadas
spellingShingle Waldispuhl, Jerome
O'Donnell, Charles William
Will, Sebastian
Devadas, Srinivas
Backofen, Rolf
Berger, Bonnie
Simultaneous alignment and folding of protein sequences
title Simultaneous alignment and folding of protein sequences
title_full Simultaneous alignment and folding of protein sequences
title_fullStr Simultaneous alignment and folding of protein sequences
title_full_unstemmed Simultaneous alignment and folding of protein sequences
title_short Simultaneous alignment and folding of protein sequences
title_sort simultaneous alignment and folding of protein sequences
url http://hdl.handle.net/1721.1/51685
https://orcid.org/0000-0001-8253-7714
https://orcid.org/0000-0002-2724-7228
work_keys_str_mv AT waldispuhljerome simultaneousalignmentandfoldingofproteinsequences
AT odonnellcharleswilliam simultaneousalignmentandfoldingofproteinsequences
AT willsebastian simultaneousalignmentandfoldingofproteinsequences
AT devadassrinivas simultaneousalignmentandfoldingofproteinsequences
AT backofenrolf simultaneousalignmentandfoldingofproteinsequences
AT bergerbonnie simultaneousalignmentandfoldingofproteinsequences