Classification of G-protein coupled receptors based on support vector machine with maximum relevance minimum redundancy and genetic algorithm

<p>Abstract</p> <p>Background</p> <p>Because a priori knowledge about function of G protein-coupled receptors (GPCRs) can provide useful information to pharmaceutical research, the determination of their function is a quite meaningful topic in protein science. However,...

Full description

Bibliographic Details
Main Authors: Dai Zong, Zhou Xuan, Li Zhanchao, Zou Xiaoyong
Format: Article
Language:English
Published: BMC 2010-06-01
Series:BMC Bioinformatics
Online Access:http://www.biomedcentral.com/1471-2105/11/325
_version_ 1818530113234403328
author Dai Zong
Zhou Xuan
Li Zhanchao
Zou Xiaoyong
author_facet Dai Zong
Zhou Xuan
Li Zhanchao
Zou Xiaoyong
author_sort Dai Zong
collection DOAJ
description <p>Abstract</p> <p>Background</p> <p>Because a priori knowledge about function of G protein-coupled receptors (GPCRs) can provide useful information to pharmaceutical research, the determination of their function is a quite meaningful topic in protein science. However, with the rapid increase of GPCRs sequences entering into databanks, the gap between the number of known sequence and the number of known function is widening rapidly, and it is both time-consuming and expensive to determine their function based only on experimental techniques. Therefore, it is vitally significant to develop a computational method for quick and accurate classification of GPCRs.</p> <p>Results</p> <p>In this study, a novel three-layer predictor based on support vector machine (SVM) and feature selection is developed for predicting and classifying GPCRs directly from amino acid sequence data. The maximum relevance minimum redundancy (mRMR) is applied to pre-evaluate features with discriminative information while genetic algorithm (GA) is utilized to find the optimized feature subsets. SVM is used for the construction of classification models. The overall accuracy with three-layer predictor at levels of superfamily, family and subfamily are obtained by cross-validation test on two non-redundant dataset. The results are about 0.5% to 16% higher than those of GPCR-CA and GPCRPred.</p> <p>Conclusion</p> <p>The results with high success rates indicate that the proposed predictor is a useful automated tool in predicting GPCRs. GPCR-SVMFS, a corresponding executable program for GPCRs prediction and classification, can be acquired freely on request from the authors.</p>
first_indexed 2024-12-11T17:15:38Z
format Article
id doaj.art-9f4b388ee10f448e8e8cabeccfd1ac1b
institution Directory Open Access Journal
issn 1471-2105
language English
last_indexed 2024-12-11T17:15:38Z
publishDate 2010-06-01
publisher BMC
record_format Article
series BMC Bioinformatics
spelling doaj.art-9f4b388ee10f448e8e8cabeccfd1ac1b2022-12-22T00:57:21ZengBMCBMC Bioinformatics1471-21052010-06-0111132510.1186/1471-2105-11-325Classification of G-protein coupled receptors based on support vector machine with maximum relevance minimum redundancy and genetic algorithmDai ZongZhou XuanLi ZhanchaoZou Xiaoyong<p>Abstract</p> <p>Background</p> <p>Because a priori knowledge about function of G protein-coupled receptors (GPCRs) can provide useful information to pharmaceutical research, the determination of their function is a quite meaningful topic in protein science. However, with the rapid increase of GPCRs sequences entering into databanks, the gap between the number of known sequence and the number of known function is widening rapidly, and it is both time-consuming and expensive to determine their function based only on experimental techniques. Therefore, it is vitally significant to develop a computational method for quick and accurate classification of GPCRs.</p> <p>Results</p> <p>In this study, a novel three-layer predictor based on support vector machine (SVM) and feature selection is developed for predicting and classifying GPCRs directly from amino acid sequence data. The maximum relevance minimum redundancy (mRMR) is applied to pre-evaluate features with discriminative information while genetic algorithm (GA) is utilized to find the optimized feature subsets. SVM is used for the construction of classification models. The overall accuracy with three-layer predictor at levels of superfamily, family and subfamily are obtained by cross-validation test on two non-redundant dataset. The results are about 0.5% to 16% higher than those of GPCR-CA and GPCRPred.</p> <p>Conclusion</p> <p>The results with high success rates indicate that the proposed predictor is a useful automated tool in predicting GPCRs. GPCR-SVMFS, a corresponding executable program for GPCRs prediction and classification, can be acquired freely on request from the authors.</p>http://www.biomedcentral.com/1471-2105/11/325
spellingShingle Dai Zong
Zhou Xuan
Li Zhanchao
Zou Xiaoyong
Classification of G-protein coupled receptors based on support vector machine with maximum relevance minimum redundancy and genetic algorithm
BMC Bioinformatics
title Classification of G-protein coupled receptors based on support vector machine with maximum relevance minimum redundancy and genetic algorithm
title_full Classification of G-protein coupled receptors based on support vector machine with maximum relevance minimum redundancy and genetic algorithm
title_fullStr Classification of G-protein coupled receptors based on support vector machine with maximum relevance minimum redundancy and genetic algorithm
title_full_unstemmed Classification of G-protein coupled receptors based on support vector machine with maximum relevance minimum redundancy and genetic algorithm
title_short Classification of G-protein coupled receptors based on support vector machine with maximum relevance minimum redundancy and genetic algorithm
title_sort classification of g protein coupled receptors based on support vector machine with maximum relevance minimum redundancy and genetic algorithm
url http://www.biomedcentral.com/1471-2105/11/325
work_keys_str_mv AT daizong classificationofgproteincoupledreceptorsbasedonsupportvectormachinewithmaximumrelevanceminimumredundancyandgeneticalgorithm
AT zhouxuan classificationofgproteincoupledreceptorsbasedonsupportvectormachinewithmaximumrelevanceminimumredundancyandgeneticalgorithm
AT lizhanchao classificationofgproteincoupledreceptorsbasedonsupportvectormachinewithmaximumrelevanceminimumredundancyandgeneticalgorithm
AT zouxiaoyong classificationofgproteincoupledreceptorsbasedonsupportvectormachinewithmaximumrelevanceminimumredundancyandgeneticalgorithm