Multiple structure alignment and consensus identification for proteins

<p>Abstract</p> <p>Background</p> <p>An algorithm is presented to compute a multiple structure alignment for a set of proteins and to generate a consensus (pseudo) protein which captures common substructures present in the given proteins. The algorithm represents each p...

Full description

Bibliographic Details
Main Authors: Ye Jieping, Ilinkin Ivaylo, Janardan Ravi
Format: Article
Language:English
Published: BMC 2010-02-01
Series:BMC Bioinformatics
Online Access:http://www.biomedcentral.com/1471-2105/11/71
_version_ 1818755628173099008
author Ye Jieping
Ilinkin Ivaylo
Janardan Ravi
author_facet Ye Jieping
Ilinkin Ivaylo
Janardan Ravi
author_sort Ye Jieping
collection DOAJ
description <p>Abstract</p> <p>Background</p> <p>An algorithm is presented to compute a multiple structure alignment for a set of proteins and to generate a consensus (pseudo) protein which captures common substructures present in the given proteins. The algorithm represents each protein as a sequence of triples of coordinates of the alpha-carbon atoms along the backbone. It then computes iteratively a sequence of transformation matrices (i.e., translations and rotations) to align the proteins in space and generate the consensus. The algorithm is a heuristic in that it computes an approximation to the optimal alignment that minimizes the sum of the pairwise distances between the consensus and the transformed proteins.</p> <p>Results</p> <p>Experimental results show that the algorithm converges quite rapidly and generates consensus structures that are visually similar to the input proteins. A comparison with other coordinate-based alignment algorithms (MAMMOTH and MATT) shows that the proposed algorithm is competitive in terms of speed and the sizes of the conserved regions discovered in an extensive benchmark dataset derived from the HOMSTRAD and SABmark databases.</p> <p>The algorithm has been implemented in C++ and can be downloaded from the project's web page. Alternatively, the algorithm can be used via a web server which makes it possible to align protein structures by uploading files from local disk or by downloading protein data from the RCSB Protein Data Bank.</p> <p>Conclusions</p> <p>An algorithm is presented to compute a multiple structure alignment for a set of proteins, together with their consensus structure. Experimental results show its effectiveness in terms of the quality of the alignment and computational cost.</p>
first_indexed 2024-12-18T05:42:10Z
format Article
id doaj.art-594466b01a58451e9225353efc8a1e5d
institution Directory Open Access Journal
issn 1471-2105
language English
last_indexed 2024-12-18T05:42:10Z
publishDate 2010-02-01
publisher BMC
record_format Article
series BMC Bioinformatics
spelling doaj.art-594466b01a58451e9225353efc8a1e5d2022-12-21T21:19:09ZengBMCBMC Bioinformatics1471-21052010-02-011117110.1186/1471-2105-11-71Multiple structure alignment and consensus identification for proteinsYe JiepingIlinkin IvayloJanardan Ravi<p>Abstract</p> <p>Background</p> <p>An algorithm is presented to compute a multiple structure alignment for a set of proteins and to generate a consensus (pseudo) protein which captures common substructures present in the given proteins. The algorithm represents each protein as a sequence of triples of coordinates of the alpha-carbon atoms along the backbone. It then computes iteratively a sequence of transformation matrices (i.e., translations and rotations) to align the proteins in space and generate the consensus. The algorithm is a heuristic in that it computes an approximation to the optimal alignment that minimizes the sum of the pairwise distances between the consensus and the transformed proteins.</p> <p>Results</p> <p>Experimental results show that the algorithm converges quite rapidly and generates consensus structures that are visually similar to the input proteins. A comparison with other coordinate-based alignment algorithms (MAMMOTH and MATT) shows that the proposed algorithm is competitive in terms of speed and the sizes of the conserved regions discovered in an extensive benchmark dataset derived from the HOMSTRAD and SABmark databases.</p> <p>The algorithm has been implemented in C++ and can be downloaded from the project's web page. Alternatively, the algorithm can be used via a web server which makes it possible to align protein structures by uploading files from local disk or by downloading protein data from the RCSB Protein Data Bank.</p> <p>Conclusions</p> <p>An algorithm is presented to compute a multiple structure alignment for a set of proteins, together with their consensus structure. Experimental results show its effectiveness in terms of the quality of the alignment and computational cost.</p>http://www.biomedcentral.com/1471-2105/11/71
spellingShingle Ye Jieping
Ilinkin Ivaylo
Janardan Ravi
Multiple structure alignment and consensus identification for proteins
BMC Bioinformatics
title Multiple structure alignment and consensus identification for proteins
title_full Multiple structure alignment and consensus identification for proteins
title_fullStr Multiple structure alignment and consensus identification for proteins
title_full_unstemmed Multiple structure alignment and consensus identification for proteins
title_short Multiple structure alignment and consensus identification for proteins
title_sort multiple structure alignment and consensus identification for proteins
url http://www.biomedcentral.com/1471-2105/11/71
work_keys_str_mv AT yejieping multiplestructurealignmentandconsensusidentificationforproteins
AT ilinkinivaylo multiplestructurealignmentandconsensusidentificationforproteins
AT janardanravi multiplestructurealignmentandconsensusidentificationforproteins