Improved residue contact prediction using support vector machines and a large feature set

<p>Abstract</p> <p>Background</p> <p>Predicting protein residue-residue contacts is an important 2D prediction task. It is useful for <it>ab initio </it>structure prediction and understanding protein folding. In spite of steady progress over the past decade,...

Full description

Bibliographic Details
Main Authors: Baldi Pierre, Cheng Jianlin
Format: Article
Language:English
Published: BMC 2007-04-01
Series:BMC Bioinformatics
Online Access:http://www.biomedcentral.com/1471-2105/8/113
_version_ 1818806367759106048
author Baldi Pierre
Cheng Jianlin
author_facet Baldi Pierre
Cheng Jianlin
author_sort Baldi Pierre
collection DOAJ
description <p>Abstract</p> <p>Background</p> <p>Predicting protein residue-residue contacts is an important 2D prediction task. It is useful for <it>ab initio </it>structure prediction and understanding protein folding. In spite of steady progress over the past decade, contact prediction remains still largely unsolved.</p> <p>Results</p> <p>Here we develop a new contact map predictor (SVMcon) that uses support vector machines to predict medium- and long-range contacts. SVMcon integrates profiles, secondary structure, relative solvent accessibility, contact potentials, and other useful features. On the same test data set, SVMcon's accuracy is 4% higher than the latest version of the CMAPpro contact map predictor. SVMcon recently participated in the seventh edition of the Critical Assessment of Techniques for Protein Structure Prediction (CASP7) experiment and was evaluated along with seven other contact map predictors. SVMcon was ranked as one of the top predictors, yielding the second best coverage and accuracy for contacts with sequence separation >= 12 on 13 <it>de novo </it>domains.</p> <p>Conclusion</p> <p>We describe SVMcon, a new contact map predictor that uses SVMs and a large set of informative features. SVMcon yields good performance on medium- to long-range contact predictions and can be modularly incorporated into a structure prediction pipeline.</p>
first_indexed 2024-12-18T19:08:39Z
format Article
id doaj.art-fe00b35d953e445d8ddfcce7c4b07784
institution Directory Open Access Journal
issn 1471-2105
language English
last_indexed 2024-12-18T19:08:39Z
publishDate 2007-04-01
publisher BMC
record_format Article
series BMC Bioinformatics
spelling doaj.art-fe00b35d953e445d8ddfcce7c4b077842022-12-21T20:56:21ZengBMCBMC Bioinformatics1471-21052007-04-018111310.1186/1471-2105-8-113Improved residue contact prediction using support vector machines and a large feature setBaldi PierreCheng Jianlin<p>Abstract</p> <p>Background</p> <p>Predicting protein residue-residue contacts is an important 2D prediction task. It is useful for <it>ab initio </it>structure prediction and understanding protein folding. In spite of steady progress over the past decade, contact prediction remains still largely unsolved.</p> <p>Results</p> <p>Here we develop a new contact map predictor (SVMcon) that uses support vector machines to predict medium- and long-range contacts. SVMcon integrates profiles, secondary structure, relative solvent accessibility, contact potentials, and other useful features. On the same test data set, SVMcon's accuracy is 4% higher than the latest version of the CMAPpro contact map predictor. SVMcon recently participated in the seventh edition of the Critical Assessment of Techniques for Protein Structure Prediction (CASP7) experiment and was evaluated along with seven other contact map predictors. SVMcon was ranked as one of the top predictors, yielding the second best coverage and accuracy for contacts with sequence separation >= 12 on 13 <it>de novo </it>domains.</p> <p>Conclusion</p> <p>We describe SVMcon, a new contact map predictor that uses SVMs and a large set of informative features. SVMcon yields good performance on medium- to long-range contact predictions and can be modularly incorporated into a structure prediction pipeline.</p>http://www.biomedcentral.com/1471-2105/8/113
spellingShingle Baldi Pierre
Cheng Jianlin
Improved residue contact prediction using support vector machines and a large feature set
BMC Bioinformatics
title Improved residue contact prediction using support vector machines and a large feature set
title_full Improved residue contact prediction using support vector machines and a large feature set
title_fullStr Improved residue contact prediction using support vector machines and a large feature set
title_full_unstemmed Improved residue contact prediction using support vector machines and a large feature set
title_short Improved residue contact prediction using support vector machines and a large feature set
title_sort improved residue contact prediction using support vector machines and a large feature set
url http://www.biomedcentral.com/1471-2105/8/113
work_keys_str_mv AT baldipierre improvedresiduecontactpredictionusingsupportvectormachinesandalargefeatureset
AT chengjianlin improvedresiduecontactpredictionusingsupportvectormachinesandalargefeatureset