Significant directed walk framework to increase the accuracy of cancer classification using gene expression data

Early diagnosis methods in cancer diagnosis studies are making great challenge as they require the involvement of different fields. Deoxyribonucleic acid (DNA) microarray analysis is one of the modern cancer diagnosis techniques used by scientists to measure the gene expression level changes in gene...

Full description

Bibliographic Details
Main Authors: Seah, C. S., Kasim, S., Md. Fudzee, M. F., Hassan, R.
Format: Article
Published: Springer Science and Business Media Deutschland GmbH 2021
Subjects:
_version_ 1796865611591057408
author Seah, C. S.
Kasim, S.
Md. Fudzee, M. F.
Hassan, R.
author_facet Seah, C. S.
Kasim, S.
Md. Fudzee, M. F.
Hassan, R.
author_sort Seah, C. S.
collection ePrints
description Early diagnosis methods in cancer diagnosis studies are making great challenge as they require the involvement of different fields. Deoxyribonucleic acid (DNA) microarray analysis is one of the modern cancer diagnosis techniques used by scientists to measure the gene expression level changes in gene expression data. From the perspective of computing, an algorithm can be developed to identify more difficult cases. Numerous cancer studies have combined different machine learning techniques for the cancer diagnosis. This study is conducted to improve the cancer diagnosis technique, directed random walk (DRW) from the direction of framework. Improved directed random walk framework is proposed with the new introduced sub-algorithms, a larger directed graph and a different classifier. It is named as significant directed walk (SDW). In this study, six gene expression datasets are applied to study the effectiveness of the sub-algorithm, directed graph and classifier in SDW in terms of cancer prediction and cancer classification. Sub-algorithms of SDW can be further divided into data pre-processing phase, specific tuning parameter selection, weight as additional variable, and exclusion of unwanted adjacency matrix. Besides that, SDW also incorporated four directed graphs to study the usability of the directed graph. The best directed graph among the four is chosen to be part of the structure in SDW. The experimental results showed that the combination of SDW with walker network and linear regression is the best among all. SDW is achieves accuracy of 95.03% in average which is higher by 8.97% compare to conventional DRW for all cancer datasets. This study provides a foundation for further studies and research on early diagnosis of cancer with machine learning technique. It is found that these findings would improve the early diagnosis methods of cancer classification.
first_indexed 2024-03-05T20:59:38Z
format Article
id utm.eprints-93387
institution Universiti Teknologi Malaysia - ePrints
last_indexed 2024-03-05T20:59:38Z
publishDate 2021
publisher Springer Science and Business Media Deutschland GmbH
record_format dspace
spelling utm.eprints-933872021-11-30T08:20:57Z http://eprints.utm.my/93387/ Significant directed walk framework to increase the accuracy of cancer classification using gene expression data Seah, C. S. Kasim, S. Md. Fudzee, M. F. Hassan, R. QA75 Electronic computers. Computer science Early diagnosis methods in cancer diagnosis studies are making great challenge as they require the involvement of different fields. Deoxyribonucleic acid (DNA) microarray analysis is one of the modern cancer diagnosis techniques used by scientists to measure the gene expression level changes in gene expression data. From the perspective of computing, an algorithm can be developed to identify more difficult cases. Numerous cancer studies have combined different machine learning techniques for the cancer diagnosis. This study is conducted to improve the cancer diagnosis technique, directed random walk (DRW) from the direction of framework. Improved directed random walk framework is proposed with the new introduced sub-algorithms, a larger directed graph and a different classifier. It is named as significant directed walk (SDW). In this study, six gene expression datasets are applied to study the effectiveness of the sub-algorithm, directed graph and classifier in SDW in terms of cancer prediction and cancer classification. Sub-algorithms of SDW can be further divided into data pre-processing phase, specific tuning parameter selection, weight as additional variable, and exclusion of unwanted adjacency matrix. Besides that, SDW also incorporated four directed graphs to study the usability of the directed graph. The best directed graph among the four is chosen to be part of the structure in SDW. The experimental results showed that the combination of SDW with walker network and linear regression is the best among all. SDW is achieves accuracy of 95.03% in average which is higher by 8.97% compare to conventional DRW for all cancer datasets. This study provides a foundation for further studies and research on early diagnosis of cancer with machine learning technique. It is found that these findings would improve the early diagnosis methods of cancer classification. Springer Science and Business Media Deutschland GmbH 2021-07 Article PeerReviewed Seah, C. S. and Kasim, S. and Md. Fudzee, M. F. and Hassan, R. (2021) Significant directed walk framework to increase the accuracy of cancer classification using gene expression data. Journal of Ambient Intelligence and Humanized Computing, 12 (7). pp. 7281-7298. ISSN 1868-5137 http://dx.doi.org/10.1007/s12652-020-02404-1 DOI: 10.1007/s12652-020-02404-1
spellingShingle QA75 Electronic computers. Computer science
Seah, C. S.
Kasim, S.
Md. Fudzee, M. F.
Hassan, R.
Significant directed walk framework to increase the accuracy of cancer classification using gene expression data
title Significant directed walk framework to increase the accuracy of cancer classification using gene expression data
title_full Significant directed walk framework to increase the accuracy of cancer classification using gene expression data
title_fullStr Significant directed walk framework to increase the accuracy of cancer classification using gene expression data
title_full_unstemmed Significant directed walk framework to increase the accuracy of cancer classification using gene expression data
title_short Significant directed walk framework to increase the accuracy of cancer classification using gene expression data
title_sort significant directed walk framework to increase the accuracy of cancer classification using gene expression data
topic QA75 Electronic computers. Computer science
work_keys_str_mv AT seahcs significantdirectedwalkframeworktoincreasetheaccuracyofcancerclassificationusinggeneexpressiondata
AT kasims significantdirectedwalkframeworktoincreasetheaccuracyofcancerclassificationusinggeneexpressiondata
AT mdfudzeemf significantdirectedwalkframeworktoincreasetheaccuracyofcancerclassificationusinggeneexpressiondata
AT hassanr significantdirectedwalkframeworktoincreasetheaccuracyofcancerclassificationusinggeneexpressiondata