Ovarian cancer classification based on dimensionality reduction for SELDI-TOF data

<p>Abstract</p> <p>Background</p> <p>Recent advances in proteomics technologies such as SELDI-TOF mass spectrometry has shown promise in the detection of early stage cancers. However, dimensionality reduction and classification are considerable challenges in statistical...

Full description

Bibliographic Details
Main Authors: Li Tong-Hua, Tang Kai-Lin, Xiong Wen-Wei, Chen Kai
Format: Article
Language:English
Published: BMC 2010-02-01
Series:BMC Bioinformatics
Online Access:http://www.biomedcentral.com/1471-2105/11/109
_version_ 1818773265872584704
author Li Tong-Hua
Tang Kai-Lin
Xiong Wen-Wei
Chen Kai
author_facet Li Tong-Hua
Tang Kai-Lin
Xiong Wen-Wei
Chen Kai
author_sort Li Tong-Hua
collection DOAJ
description <p>Abstract</p> <p>Background</p> <p>Recent advances in proteomics technologies such as SELDI-TOF mass spectrometry has shown promise in the detection of early stage cancers. However, dimensionality reduction and classification are considerable challenges in statistical machine learning. We therefore propose a novel approach for dimensionality reduction and tested it using published high-resolution SELDI-TOF data for ovarian cancer.</p> <p>Results</p> <p>We propose a method based on statistical moments to reduce feature dimensions. After refining and <it>t</it>-testing, SELDI-TOF data are divided into several intervals. Four statistical moments (mean, variance, skewness and kurtosis) are calculated for each interval and are used as representative variables. The high dimensionality of the data can thus be rapidly reduced. To improve efficiency and classification performance, the data are further used in kernel PLS models. The method achieved average sensitivity of 0.9950, specificity of 0.9916, accuracy of 0.9935 and a correlation coefficient of 0.9869 for 100 five-fold cross validations. Furthermore, only one control was misclassified in leave-one-out cross validation.</p> <p>Conclusion</p> <p>The proposed method is suitable for analyzing high-throughput proteomics data.</p>
first_indexed 2024-12-18T10:22:31Z
format Article
id doaj.art-d128abdd5c2b4b9bb141ea6fcaeecf72
institution Directory Open Access Journal
issn 1471-2105
language English
last_indexed 2024-12-18T10:22:31Z
publishDate 2010-02-01
publisher BMC
record_format Article
series BMC Bioinformatics
spelling doaj.art-d128abdd5c2b4b9bb141ea6fcaeecf722022-12-21T21:11:05ZengBMCBMC Bioinformatics1471-21052010-02-0111110910.1186/1471-2105-11-109Ovarian cancer classification based on dimensionality reduction for SELDI-TOF dataLi Tong-HuaTang Kai-LinXiong Wen-WeiChen Kai<p>Abstract</p> <p>Background</p> <p>Recent advances in proteomics technologies such as SELDI-TOF mass spectrometry has shown promise in the detection of early stage cancers. However, dimensionality reduction and classification are considerable challenges in statistical machine learning. We therefore propose a novel approach for dimensionality reduction and tested it using published high-resolution SELDI-TOF data for ovarian cancer.</p> <p>Results</p> <p>We propose a method based on statistical moments to reduce feature dimensions. After refining and <it>t</it>-testing, SELDI-TOF data are divided into several intervals. Four statistical moments (mean, variance, skewness and kurtosis) are calculated for each interval and are used as representative variables. The high dimensionality of the data can thus be rapidly reduced. To improve efficiency and classification performance, the data are further used in kernel PLS models. The method achieved average sensitivity of 0.9950, specificity of 0.9916, accuracy of 0.9935 and a correlation coefficient of 0.9869 for 100 five-fold cross validations. Furthermore, only one control was misclassified in leave-one-out cross validation.</p> <p>Conclusion</p> <p>The proposed method is suitable for analyzing high-throughput proteomics data.</p>http://www.biomedcentral.com/1471-2105/11/109
spellingShingle Li Tong-Hua
Tang Kai-Lin
Xiong Wen-Wei
Chen Kai
Ovarian cancer classification based on dimensionality reduction for SELDI-TOF data
BMC Bioinformatics
title Ovarian cancer classification based on dimensionality reduction for SELDI-TOF data
title_full Ovarian cancer classification based on dimensionality reduction for SELDI-TOF data
title_fullStr Ovarian cancer classification based on dimensionality reduction for SELDI-TOF data
title_full_unstemmed Ovarian cancer classification based on dimensionality reduction for SELDI-TOF data
title_short Ovarian cancer classification based on dimensionality reduction for SELDI-TOF data
title_sort ovarian cancer classification based on dimensionality reduction for seldi tof data
url http://www.biomedcentral.com/1471-2105/11/109
work_keys_str_mv AT litonghua ovariancancerclassificationbasedondimensionalityreductionforselditofdata
AT tangkailin ovariancancerclassificationbasedondimensionalityreductionforselditofdata
AT xiongwenwei ovariancancerclassificationbasedondimensionalityreductionforselditofdata
AT chenkai ovariancancerclassificationbasedondimensionalityreductionforselditofdata