Significant directed walk framework to increase the accuracy of cancer classification using gene expression data
Early diagnosis methods in cancer diagnosis studies are making great challenge as they require the involvement of different fields. Deoxyribonucleic acid (DNA) microarray analysis is one of the modern cancer diagnosis techniques used by scientists to measure the gene expression level changes in gene...
Main Authors: | , , , |
---|---|
Format: | Article |
Published: |
Springer Science and Business Media Deutschland GmbH
2021
|
Subjects: |
_version_ | 1796865611591057408 |
---|---|
author | Seah, C. S. Kasim, S. Md. Fudzee, M. F. Hassan, R. |
author_facet | Seah, C. S. Kasim, S. Md. Fudzee, M. F. Hassan, R. |
author_sort | Seah, C. S. |
collection | ePrints |
description | Early diagnosis methods in cancer diagnosis studies are making great challenge as they require the involvement of different fields. Deoxyribonucleic acid (DNA) microarray analysis is one of the modern cancer diagnosis techniques used by scientists to measure the gene expression level changes in gene expression data. From the perspective of computing, an algorithm can be developed to identify more difficult cases. Numerous cancer studies have combined different machine learning techniques for the cancer diagnosis. This study is conducted to improve the cancer diagnosis technique, directed random walk (DRW) from the direction of framework. Improved directed random walk framework is proposed with the new introduced sub-algorithms, a larger directed graph and a different classifier. It is named as significant directed walk (SDW). In this study, six gene expression datasets are applied to study the effectiveness of the sub-algorithm, directed graph and classifier in SDW in terms of cancer prediction and cancer classification. Sub-algorithms of SDW can be further divided into data pre-processing phase, specific tuning parameter selection, weight as additional variable, and exclusion of unwanted adjacency matrix. Besides that, SDW also incorporated four directed graphs to study the usability of the directed graph. The best directed graph among the four is chosen to be part of the structure in SDW. The experimental results showed that the combination of SDW with walker network and linear regression is the best among all. SDW is achieves accuracy of 95.03% in average which is higher by 8.97% compare to conventional DRW for all cancer datasets. This study provides a foundation for further studies and research on early diagnosis of cancer with machine learning technique. It is found that these findings would improve the early diagnosis methods of cancer classification. |
first_indexed | 2024-03-05T20:59:38Z |
format | Article |
id | utm.eprints-93387 |
institution | Universiti Teknologi Malaysia - ePrints |
last_indexed | 2024-03-05T20:59:38Z |
publishDate | 2021 |
publisher | Springer Science and Business Media Deutschland GmbH |
record_format | dspace |
spelling | utm.eprints-933872021-11-30T08:20:57Z http://eprints.utm.my/93387/ Significant directed walk framework to increase the accuracy of cancer classification using gene expression data Seah, C. S. Kasim, S. Md. Fudzee, M. F. Hassan, R. QA75 Electronic computers. Computer science Early diagnosis methods in cancer diagnosis studies are making great challenge as they require the involvement of different fields. Deoxyribonucleic acid (DNA) microarray analysis is one of the modern cancer diagnosis techniques used by scientists to measure the gene expression level changes in gene expression data. From the perspective of computing, an algorithm can be developed to identify more difficult cases. Numerous cancer studies have combined different machine learning techniques for the cancer diagnosis. This study is conducted to improve the cancer diagnosis technique, directed random walk (DRW) from the direction of framework. Improved directed random walk framework is proposed with the new introduced sub-algorithms, a larger directed graph and a different classifier. It is named as significant directed walk (SDW). In this study, six gene expression datasets are applied to study the effectiveness of the sub-algorithm, directed graph and classifier in SDW in terms of cancer prediction and cancer classification. Sub-algorithms of SDW can be further divided into data pre-processing phase, specific tuning parameter selection, weight as additional variable, and exclusion of unwanted adjacency matrix. Besides that, SDW also incorporated four directed graphs to study the usability of the directed graph. The best directed graph among the four is chosen to be part of the structure in SDW. The experimental results showed that the combination of SDW with walker network and linear regression is the best among all. SDW is achieves accuracy of 95.03% in average which is higher by 8.97% compare to conventional DRW for all cancer datasets. This study provides a foundation for further studies and research on early diagnosis of cancer with machine learning technique. It is found that these findings would improve the early diagnosis methods of cancer classification. Springer Science and Business Media Deutschland GmbH 2021-07 Article PeerReviewed Seah, C. S. and Kasim, S. and Md. Fudzee, M. F. and Hassan, R. (2021) Significant directed walk framework to increase the accuracy of cancer classification using gene expression data. Journal of Ambient Intelligence and Humanized Computing, 12 (7). pp. 7281-7298. ISSN 1868-5137 http://dx.doi.org/10.1007/s12652-020-02404-1 DOI: 10.1007/s12652-020-02404-1 |
spellingShingle | QA75 Electronic computers. Computer science Seah, C. S. Kasim, S. Md. Fudzee, M. F. Hassan, R. Significant directed walk framework to increase the accuracy of cancer classification using gene expression data |
title | Significant directed walk framework to increase the accuracy of cancer classification using gene expression data |
title_full | Significant directed walk framework to increase the accuracy of cancer classification using gene expression data |
title_fullStr | Significant directed walk framework to increase the accuracy of cancer classification using gene expression data |
title_full_unstemmed | Significant directed walk framework to increase the accuracy of cancer classification using gene expression data |
title_short | Significant directed walk framework to increase the accuracy of cancer classification using gene expression data |
title_sort | significant directed walk framework to increase the accuracy of cancer classification using gene expression data |
topic | QA75 Electronic computers. Computer science |
work_keys_str_mv | AT seahcs significantdirectedwalkframeworktoincreasetheaccuracyofcancerclassificationusinggeneexpressiondata AT kasims significantdirectedwalkframeworktoincreasetheaccuracyofcancerclassificationusinggeneexpressiondata AT mdfudzeemf significantdirectedwalkframeworktoincreasetheaccuracyofcancerclassificationusinggeneexpressiondata AT hassanr significantdirectedwalkframeworktoincreasetheaccuracyofcancerclassificationusinggeneexpressiondata |