Deep-Learning Algorithm and Concomitant Biomarker Identification for NSCLC Prediction Using Multi-Omics Data Integration

Early diagnosis of lung cancer to increase the survival rate, which is currently at a low range of mid-30%, remains a critical need. Despite this, multi-omics data have rarely been applied to non-small-cell lung cancer (NSCLC) diagnosis. We developed a multi-omics data-affinitive artificial intellig...

Full description

Bibliographic Details
Main Authors: Min-Koo Park, Jin-Muk Lim, Jinwoo Jeong, Yeongjae Jang, Ji-Won Lee, Jeong-Chan Lee, Hyungyu Kim, Euiyul Koh, Sung-Joo Hwang, Hong-Gee Kim, Keun-Cheol Kim
Format: Article
Language:English
Published: MDPI AG 2022-12-01
Series:Biomolecules
Subjects:
Online Access:https://www.mdpi.com/2218-273X/12/12/1839
_version_ 1797461193412050944
author Min-Koo Park
Jin-Muk Lim
Jinwoo Jeong
Yeongjae Jang
Ji-Won Lee
Jeong-Chan Lee
Hyungyu Kim
Euiyul Koh
Sung-Joo Hwang
Hong-Gee Kim
Keun-Cheol Kim
author_facet Min-Koo Park
Jin-Muk Lim
Jinwoo Jeong
Yeongjae Jang
Ji-Won Lee
Jeong-Chan Lee
Hyungyu Kim
Euiyul Koh
Sung-Joo Hwang
Hong-Gee Kim
Keun-Cheol Kim
author_sort Min-Koo Park
collection DOAJ
description Early diagnosis of lung cancer to increase the survival rate, which is currently at a low range of mid-30%, remains a critical need. Despite this, multi-omics data have rarely been applied to non-small-cell lung cancer (NSCLC) diagnosis. We developed a multi-omics data-affinitive artificial intelligence algorithm based on the graph convolutional network that integrates mRNA expression, DNA methylation, and DNA sequencing data. This NSCLC prediction model achieved a 93.7% macro F1-score, indicating that values for false positives and negatives were substantially low, which is desirable for accurate classification. Gene ontology enrichment and pathway analysis of features revealed that two major subtypes of NSCLC, lung adenocarcinoma and lung squamous cell carcinoma, have both specific and common GO biological processes. Numerous biomarkers (i.e., microRNA, long non-coding RNA, differentially methylated regions) were newly identified, whereas some biomarkers were consistent with previous findings in NSCLC (e.g., <i>SPRR1B</i>). Thus, using multi-omics data integration, we developed a promising cancer prediction algorithm.
first_indexed 2024-03-09T17:15:54Z
format Article
id doaj.art-0e081868891d403eb15cb39af5c40538
institution Directory Open Access Journal
issn 2218-273X
language English
last_indexed 2024-03-09T17:15:54Z
publishDate 2022-12-01
publisher MDPI AG
record_format Article
series Biomolecules
spelling doaj.art-0e081868891d403eb15cb39af5c405382023-11-24T13:34:14ZengMDPI AGBiomolecules2218-273X2022-12-011212183910.3390/biom12121839Deep-Learning Algorithm and Concomitant Biomarker Identification for NSCLC Prediction Using Multi-Omics Data IntegrationMin-Koo Park0Jin-Muk Lim1Jinwoo Jeong2Yeongjae Jang3Ji-Won Lee4Jeong-Chan Lee5Hyungyu Kim6Euiyul Koh7Sung-Joo Hwang8Hong-Gee Kim9Keun-Cheol Kim10Department of Biological Sciences, College of Natural Sciences, Kangwon National University, Chuncheon 24341, Republic of KoreaBiomedical Knowledge Engineering Laboratory, School of Dentistry and Dental Research Institute, Seoul National University, Seoul 08826, Republic of KoreaAI Institute, Alopax-Algo, Co., Ltd., Seoul 06978, Republic of KoreaMedical AI Team, Jonathan Wellcare Division, Acryl, Inc., Seoul 06069, Republic of KoreaHugenebio Institute, Bio-Innovation Park, Erom, Inc., Chuncheon 24427, Republic of KoreaHugenebio Institute, Bio-Innovation Park, Erom, Inc., Chuncheon 24427, Republic of KoreaMedical AI Team, Jonathan Wellcare Division, Acryl, Inc., Seoul 06069, Republic of KoreaMedical AI Team, Jonathan Wellcare Division, Acryl, Inc., Seoul 06069, Republic of KoreaIntegrated Medicine Institute, Loving Care Hospital, Seongnam 463400, Republic of KoreaBiomedical Knowledge Engineering Laboratory, School of Dentistry and Dental Research Institute, Seoul National University, Seoul 08826, Republic of KoreaDepartment of Biological Sciences, College of Natural Sciences, Kangwon National University, Chuncheon 24341, Republic of KoreaEarly diagnosis of lung cancer to increase the survival rate, which is currently at a low range of mid-30%, remains a critical need. Despite this, multi-omics data have rarely been applied to non-small-cell lung cancer (NSCLC) diagnosis. We developed a multi-omics data-affinitive artificial intelligence algorithm based on the graph convolutional network that integrates mRNA expression, DNA methylation, and DNA sequencing data. This NSCLC prediction model achieved a 93.7% macro F1-score, indicating that values for false positives and negatives were substantially low, which is desirable for accurate classification. Gene ontology enrichment and pathway analysis of features revealed that two major subtypes of NSCLC, lung adenocarcinoma and lung squamous cell carcinoma, have both specific and common GO biological processes. Numerous biomarkers (i.e., microRNA, long non-coding RNA, differentially methylated regions) were newly identified, whereas some biomarkers were consistent with previous findings in NSCLC (e.g., <i>SPRR1B</i>). Thus, using multi-omics data integration, we developed a promising cancer prediction algorithm.https://www.mdpi.com/2218-273X/12/12/1839non-small-cell lung cancerdeep learninggraph convolutional networkcancer predictionbiomarkergene ontology enrichment
spellingShingle Min-Koo Park
Jin-Muk Lim
Jinwoo Jeong
Yeongjae Jang
Ji-Won Lee
Jeong-Chan Lee
Hyungyu Kim
Euiyul Koh
Sung-Joo Hwang
Hong-Gee Kim
Keun-Cheol Kim
Deep-Learning Algorithm and Concomitant Biomarker Identification for NSCLC Prediction Using Multi-Omics Data Integration
Biomolecules
non-small-cell lung cancer
deep learning
graph convolutional network
cancer prediction
biomarker
gene ontology enrichment
title Deep-Learning Algorithm and Concomitant Biomarker Identification for NSCLC Prediction Using Multi-Omics Data Integration
title_full Deep-Learning Algorithm and Concomitant Biomarker Identification for NSCLC Prediction Using Multi-Omics Data Integration
title_fullStr Deep-Learning Algorithm and Concomitant Biomarker Identification for NSCLC Prediction Using Multi-Omics Data Integration
title_full_unstemmed Deep-Learning Algorithm and Concomitant Biomarker Identification for NSCLC Prediction Using Multi-Omics Data Integration
title_short Deep-Learning Algorithm and Concomitant Biomarker Identification for NSCLC Prediction Using Multi-Omics Data Integration
title_sort deep learning algorithm and concomitant biomarker identification for nsclc prediction using multi omics data integration
topic non-small-cell lung cancer
deep learning
graph convolutional network
cancer prediction
biomarker
gene ontology enrichment
url https://www.mdpi.com/2218-273X/12/12/1839
work_keys_str_mv AT minkoopark deeplearningalgorithmandconcomitantbiomarkeridentificationfornsclcpredictionusingmultiomicsdataintegration
AT jinmuklim deeplearningalgorithmandconcomitantbiomarkeridentificationfornsclcpredictionusingmultiomicsdataintegration
AT jinwoojeong deeplearningalgorithmandconcomitantbiomarkeridentificationfornsclcpredictionusingmultiomicsdataintegration
AT yeongjaejang deeplearningalgorithmandconcomitantbiomarkeridentificationfornsclcpredictionusingmultiomicsdataintegration
AT jiwonlee deeplearningalgorithmandconcomitantbiomarkeridentificationfornsclcpredictionusingmultiomicsdataintegration
AT jeongchanlee deeplearningalgorithmandconcomitantbiomarkeridentificationfornsclcpredictionusingmultiomicsdataintegration
AT hyungyukim deeplearningalgorithmandconcomitantbiomarkeridentificationfornsclcpredictionusingmultiomicsdataintegration
AT euiyulkoh deeplearningalgorithmandconcomitantbiomarkeridentificationfornsclcpredictionusingmultiomicsdataintegration
AT sungjoohwang deeplearningalgorithmandconcomitantbiomarkeridentificationfornsclcpredictionusingmultiomicsdataintegration
AT honggeekim deeplearningalgorithmandconcomitantbiomarkeridentificationfornsclcpredictionusingmultiomicsdataintegration
AT keuncheolkim deeplearningalgorithmandconcomitantbiomarkeridentificationfornsclcpredictionusingmultiomicsdataintegration