Gene shaving using a sensitivity analysis of kernel based machine learning approach, with applications to cancer data.

<h4>Background</h4>Gene shaving (GS) is an essential and challenging tools for biomedical researchers due to the large number of genes in human genome and the complex nature of biological networks. Most GS methods are not applicable to non-linear and multi-view data sets. While the kerne...

Full description

Bibliographic Details
Main Authors: Md Ashad Alam, Mohammd Shahjaman, Md Ferdush Rahman, Fokhrul Hossain, Hong-Wen Deng
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2019-01-01
Series:PLoS ONE
Online Access:https://doi.org/10.1371/journal.pone.0217027
_version_ 1811332027663777792
author Md Ashad Alam
Mohammd Shahjaman
Md Ferdush Rahman
Fokhrul Hossain
Hong-Wen Deng
author_facet Md Ashad Alam
Mohammd Shahjaman
Md Ferdush Rahman
Fokhrul Hossain
Hong-Wen Deng
author_sort Md Ashad Alam
collection DOAJ
description <h4>Background</h4>Gene shaving (GS) is an essential and challenging tools for biomedical researchers due to the large number of genes in human genome and the complex nature of biological networks. Most GS methods are not applicable to non-linear and multi-view data sets. While the kernel based methods can overcome these problems, a well-founded positive definite kernel based GS method has yet to be proposed for biomedical data analysis.<h4>Methods and findings</h4>Since the kernel based methods on genomic information can improve the prediction of diseases, here we proposed a noble method, "kernel based gene shaving" which is based on the influence function of kernel canonical correlation analysis. To investigate the performance of the proposed method in comparison to state-of-the-art-method in gene saving, we analyzed extensive simulated and real microarray gene expression data set. The performance metrics including true positive rate, true negative rate, false positive rate, false negative rate, misclassification error rate, the false discovery rate and area under curves were computed for each methods. In colon cancer data analysis, the proposed method identified a significant subsets of 210 genes out of 2000 genes and suggestive superior performance compared with other methods. The proposed method can be applied to the study of other disease process where two view data is a common task.<h4>Conclusions</h4>We addressed the challenge of finding unique kernel based GS methods by using the influence function of kernel canonical correlation analysis. The proposed method has shown to have better performance than state-of-the-art-methods in gene saving and has identified many more significant gene interactions, suggesting that genes function in a concerted effort in colon cancer. In similar biomedical data analysis, kernel based methods could be applied to select a potential subset of genes. The positive definite kernel based methods can overcome the non-linearity problem and improve the prediction process.
first_indexed 2024-04-13T16:29:39Z
format Article
id doaj.art-70b2cf600bec478d9379e856c776a328
institution Directory Open Access Journal
issn 1932-6203
language English
last_indexed 2024-04-13T16:29:39Z
publishDate 2019-01-01
publisher Public Library of Science (PLoS)
record_format Article
series PLoS ONE
spelling doaj.art-70b2cf600bec478d9379e856c776a3282022-12-22T02:39:37ZengPublic Library of Science (PLoS)PLoS ONE1932-62032019-01-01145e021702710.1371/journal.pone.0217027Gene shaving using a sensitivity analysis of kernel based machine learning approach, with applications to cancer data.Md Ashad AlamMohammd ShahjamanMd Ferdush RahmanFokhrul HossainHong-Wen Deng<h4>Background</h4>Gene shaving (GS) is an essential and challenging tools for biomedical researchers due to the large number of genes in human genome and the complex nature of biological networks. Most GS methods are not applicable to non-linear and multi-view data sets. While the kernel based methods can overcome these problems, a well-founded positive definite kernel based GS method has yet to be proposed for biomedical data analysis.<h4>Methods and findings</h4>Since the kernel based methods on genomic information can improve the prediction of diseases, here we proposed a noble method, "kernel based gene shaving" which is based on the influence function of kernel canonical correlation analysis. To investigate the performance of the proposed method in comparison to state-of-the-art-method in gene saving, we analyzed extensive simulated and real microarray gene expression data set. The performance metrics including true positive rate, true negative rate, false positive rate, false negative rate, misclassification error rate, the false discovery rate and area under curves were computed for each methods. In colon cancer data analysis, the proposed method identified a significant subsets of 210 genes out of 2000 genes and suggestive superior performance compared with other methods. The proposed method can be applied to the study of other disease process where two view data is a common task.<h4>Conclusions</h4>We addressed the challenge of finding unique kernel based GS methods by using the influence function of kernel canonical correlation analysis. The proposed method has shown to have better performance than state-of-the-art-methods in gene saving and has identified many more significant gene interactions, suggesting that genes function in a concerted effort in colon cancer. In similar biomedical data analysis, kernel based methods could be applied to select a potential subset of genes. The positive definite kernel based methods can overcome the non-linearity problem and improve the prediction process.https://doi.org/10.1371/journal.pone.0217027
spellingShingle Md Ashad Alam
Mohammd Shahjaman
Md Ferdush Rahman
Fokhrul Hossain
Hong-Wen Deng
Gene shaving using a sensitivity analysis of kernel based machine learning approach, with applications to cancer data.
PLoS ONE
title Gene shaving using a sensitivity analysis of kernel based machine learning approach, with applications to cancer data.
title_full Gene shaving using a sensitivity analysis of kernel based machine learning approach, with applications to cancer data.
title_fullStr Gene shaving using a sensitivity analysis of kernel based machine learning approach, with applications to cancer data.
title_full_unstemmed Gene shaving using a sensitivity analysis of kernel based machine learning approach, with applications to cancer data.
title_short Gene shaving using a sensitivity analysis of kernel based machine learning approach, with applications to cancer data.
title_sort gene shaving using a sensitivity analysis of kernel based machine learning approach with applications to cancer data
url https://doi.org/10.1371/journal.pone.0217027
work_keys_str_mv AT mdashadalam geneshavingusingasensitivityanalysisofkernelbasedmachinelearningapproachwithapplicationstocancerdata
AT mohammdshahjaman geneshavingusingasensitivityanalysisofkernelbasedmachinelearningapproachwithapplicationstocancerdata
AT mdferdushrahman geneshavingusingasensitivityanalysisofkernelbasedmachinelearningapproachwithapplicationstocancerdata
AT fokhrulhossain geneshavingusingasensitivityanalysisofkernelbasedmachinelearningapproachwithapplicationstocancerdata
AT hongwendeng geneshavingusingasensitivityanalysisofkernelbasedmachinelearningapproachwithapplicationstocancerdata