Prediction of protein binding sites in protein structures using hidden Markov support vector machine

<p>Abstract</p> <p>Background</p> <p>Predicting the binding sites between two interacting proteins provides important clues to the function of a protein. Recent research on protein binding site prediction has been mainly based on widely known machine learning techniques...

Full description

Bibliographic Details
Main Authors: Lin Lei, Wang Xiaolong, Liu Bin, Tang Buzhou, Dong Qiwen, Wang Xuan
Format: Article
Language:English
Published: BMC 2009-11-01
Series:BMC Bioinformatics
Online Access:http://www.biomedcentral.com/1471-2105/10/381
_version_ 1818833265915592704
author Lin Lei
Wang Xiaolong
Liu Bin
Tang Buzhou
Dong Qiwen
Wang Xuan
author_facet Lin Lei
Wang Xiaolong
Liu Bin
Tang Buzhou
Dong Qiwen
Wang Xuan
author_sort Lin Lei
collection DOAJ
description <p>Abstract</p> <p>Background</p> <p>Predicting the binding sites between two interacting proteins provides important clues to the function of a protein. Recent research on protein binding site prediction has been mainly based on widely known machine learning techniques, such as artificial neural networks, support vector machines, conditional random field, etc. However, the prediction performance is still too low to be used in practice. It is necessary to explore new algorithms, theories and features to further improve the performance.</p> <p>Results</p> <p>In this study, we introduce a novel machine learning model hidden Markov support vector machine for protein binding site prediction. The model treats the protein binding site prediction as a sequential labelling task based on the maximum margin criterion. Common features derived from protein sequences and structures, including protein sequence profile and residue accessible surface area, are used to train hidden Markov support vector machine. When tested on six data sets, the method based on hidden Markov support vector machine shows better performance than some state-of-the-art methods, including artificial neural networks, support vector machines and conditional random field. Furthermore, its running time is several orders of magnitude shorter than that of the compared methods.</p> <p>Conclusion</p> <p>The improved prediction performance and computational efficiency of the method based on hidden Markov support vector machine can be attributed to the following three factors. Firstly, the relation between labels of neighbouring residues is useful for protein binding site prediction. Secondly, the kernel trick is very advantageous to this field. Thirdly, the complexity of the training step for hidden Markov support vector machine is linear with the number of training samples by using the cutting-plane algorithm.</p>
first_indexed 2024-12-19T02:16:11Z
format Article
id doaj.art-6762d64aeb2d46fca46896812c80c700
institution Directory Open Access Journal
issn 1471-2105
language English
last_indexed 2024-12-19T02:16:11Z
publishDate 2009-11-01
publisher BMC
record_format Article
series BMC Bioinformatics
spelling doaj.art-6762d64aeb2d46fca46896812c80c7002022-12-21T20:40:24ZengBMCBMC Bioinformatics1471-21052009-11-0110138110.1186/1471-2105-10-381Prediction of protein binding sites in protein structures using hidden Markov support vector machineLin LeiWang XiaolongLiu BinTang BuzhouDong QiwenWang Xuan<p>Abstract</p> <p>Background</p> <p>Predicting the binding sites between two interacting proteins provides important clues to the function of a protein. Recent research on protein binding site prediction has been mainly based on widely known machine learning techniques, such as artificial neural networks, support vector machines, conditional random field, etc. However, the prediction performance is still too low to be used in practice. It is necessary to explore new algorithms, theories and features to further improve the performance.</p> <p>Results</p> <p>In this study, we introduce a novel machine learning model hidden Markov support vector machine for protein binding site prediction. The model treats the protein binding site prediction as a sequential labelling task based on the maximum margin criterion. Common features derived from protein sequences and structures, including protein sequence profile and residue accessible surface area, are used to train hidden Markov support vector machine. When tested on six data sets, the method based on hidden Markov support vector machine shows better performance than some state-of-the-art methods, including artificial neural networks, support vector machines and conditional random field. Furthermore, its running time is several orders of magnitude shorter than that of the compared methods.</p> <p>Conclusion</p> <p>The improved prediction performance and computational efficiency of the method based on hidden Markov support vector machine can be attributed to the following three factors. Firstly, the relation between labels of neighbouring residues is useful for protein binding site prediction. Secondly, the kernel trick is very advantageous to this field. Thirdly, the complexity of the training step for hidden Markov support vector machine is linear with the number of training samples by using the cutting-plane algorithm.</p>http://www.biomedcentral.com/1471-2105/10/381
spellingShingle Lin Lei
Wang Xiaolong
Liu Bin
Tang Buzhou
Dong Qiwen
Wang Xuan
Prediction of protein binding sites in protein structures using hidden Markov support vector machine
BMC Bioinformatics
title Prediction of protein binding sites in protein structures using hidden Markov support vector machine
title_full Prediction of protein binding sites in protein structures using hidden Markov support vector machine
title_fullStr Prediction of protein binding sites in protein structures using hidden Markov support vector machine
title_full_unstemmed Prediction of protein binding sites in protein structures using hidden Markov support vector machine
title_short Prediction of protein binding sites in protein structures using hidden Markov support vector machine
title_sort prediction of protein binding sites in protein structures using hidden markov support vector machine
url http://www.biomedcentral.com/1471-2105/10/381
work_keys_str_mv AT linlei predictionofproteinbindingsitesinproteinstructuresusinghiddenmarkovsupportvectormachine
AT wangxiaolong predictionofproteinbindingsitesinproteinstructuresusinghiddenmarkovsupportvectormachine
AT liubin predictionofproteinbindingsitesinproteinstructuresusinghiddenmarkovsupportvectormachine
AT tangbuzhou predictionofproteinbindingsitesinproteinstructuresusinghiddenmarkovsupportvectormachine
AT dongqiwen predictionofproteinbindingsitesinproteinstructuresusinghiddenmarkovsupportvectormachine
AT wangxuan predictionofproteinbindingsitesinproteinstructuresusinghiddenmarkovsupportvectormachine