A pathway-based approach for analyzing microarray data using random forests

Although machine learning methods, such as random forests, have been developed to correlate survival outcomes with a set of genes, less study has assessed the abilities of these methods in incorporating pathway information for analyzing microarray data. In general, genes that are identified without...

Full description

Bibliographic Details
Main Authors: Tan Ah Chik @ Mohamad, Mohd. Saberi, Shi, Chin Hui, Deris, Safaai, Ibrahim, Zuwairie
Format: Conference or Workshop Item
Published: 2011
_version_ 1796858762035724288
author Tan Ah Chik @ Mohamad, Mohd. Saberi
Shi, Chin Hui
Deris, Safaai
Ibrahim, Zuwairie
author_facet Tan Ah Chik @ Mohamad, Mohd. Saberi
Shi, Chin Hui
Deris, Safaai
Ibrahim, Zuwairie
author_sort Tan Ah Chik @ Mohamad, Mohd. Saberi
collection ePrints
description Although machine learning methods, such as random forests, have been developed to correlate survival outcomes with a set of genes, less study has assessed the abilities of these methods in incorporating pathway information for analyzing microarray data. In general, genes that are identified without incorporating biological knowledge are more difficult to interpret. Thus, the pathway-based survival analysis using machine learning methods represents a promising approach for generating new biological hypothesis from microarray studies. The two popular variants of random forests used in this research for survival data are random survival forests and bivariate node-splitting random survival forests. There are three types of datasets used for this research and each dataset with a three-level outcome. This research which compared the four splitting rules available in random survival forests to identify log-rank test is the most accurate in terms of prediction error. To evaluate the accuracy of pathway based survival approach, this research considered employing area under the receiver operating characteristic curve for censored data. The use of random survival forests for survival outcomes in analyzing microarray data allows researchers to obtain results that are more closely tied with the biological mechanism of diseases.
first_indexed 2024-03-05T19:17:14Z
format Conference or Workshop Item
id utm.eprints-45496
institution Universiti Teknologi Malaysia - ePrints
last_indexed 2024-03-05T19:17:14Z
publishDate 2011
record_format dspace
spelling utm.eprints-454962017-08-30T00:55:35Z http://eprints.utm.my/45496/ A pathway-based approach for analyzing microarray data using random forests Tan Ah Chik @ Mohamad, Mohd. Saberi Shi, Chin Hui Deris, Safaai Ibrahim, Zuwairie Although machine learning methods, such as random forests, have been developed to correlate survival outcomes with a set of genes, less study has assessed the abilities of these methods in incorporating pathway information for analyzing microarray data. In general, genes that are identified without incorporating biological knowledge are more difficult to interpret. Thus, the pathway-based survival analysis using machine learning methods represents a promising approach for generating new biological hypothesis from microarray studies. The two popular variants of random forests used in this research for survival data are random survival forests and bivariate node-splitting random survival forests. There are three types of datasets used for this research and each dataset with a three-level outcome. This research which compared the four splitting rules available in random survival forests to identify log-rank test is the most accurate in terms of prediction error. To evaluate the accuracy of pathway based survival approach, this research considered employing area under the receiver operating characteristic curve for censored data. The use of random survival forests for survival outcomes in analyzing microarray data allows researchers to obtain results that are more closely tied with the biological mechanism of diseases. 2011 Conference or Workshop Item PeerReviewed Tan Ah Chik @ Mohamad, Mohd. Saberi and Shi, Chin Hui and Deris, Safaai and Ibrahim, Zuwairie (2011) A pathway-based approach for analyzing microarray data using random forests. In: Sixth International Conference On Innovative Computing, Information And Control (LCICIC 2011).
spellingShingle Tan Ah Chik @ Mohamad, Mohd. Saberi
Shi, Chin Hui
Deris, Safaai
Ibrahim, Zuwairie
A pathway-based approach for analyzing microarray data using random forests
title A pathway-based approach for analyzing microarray data using random forests
title_full A pathway-based approach for analyzing microarray data using random forests
title_fullStr A pathway-based approach for analyzing microarray data using random forests
title_full_unstemmed A pathway-based approach for analyzing microarray data using random forests
title_short A pathway-based approach for analyzing microarray data using random forests
title_sort pathway based approach for analyzing microarray data using random forests
work_keys_str_mv AT tanahchikmohamadmohdsaberi apathwaybasedapproachforanalyzingmicroarraydatausingrandomforests
AT shichinhui apathwaybasedapproachforanalyzingmicroarraydatausingrandomforests
AT derissafaai apathwaybasedapproachforanalyzingmicroarraydatausingrandomforests
AT ibrahimzuwairie apathwaybasedapproachforanalyzingmicroarraydatausingrandomforests
AT tanahchikmohamadmohdsaberi pathwaybasedapproachforanalyzingmicroarraydatausingrandomforests
AT shichinhui pathwaybasedapproachforanalyzingmicroarraydatausingrandomforests
AT derissafaai pathwaybasedapproachforanalyzingmicroarraydatausingrandomforests
AT ibrahimzuwairie pathwaybasedapproachforanalyzingmicroarraydatausingrandomforests