Graph Embedding Deep Learning Guides Microbial Biomarkers' Identification

The microbiome-wide association studies are to figure out the relationship between microorganisms and humans, with the goal of discovering relevant biomarkers to guide disease diagnosis. However, the microbiome data is complex, with high noise and dimensions. Traditional machine learning methods are...

Full description

Bibliographic Details
Main Authors: Qiang Zhu, Xingpeng Jiang, Qing Zhu, Min Pan, Tingting He
Format: Article
Language:English
Published: Frontiers Media S.A. 2019-11-01
Series:Frontiers in Genetics
Online Access:https://www.frontiersin.org/article/10.3389/fgene.2019.01182/full
_version_ 1818255716450828288
author Qiang Zhu
Qiang Zhu
Xingpeng Jiang
Xingpeng Jiang
Qing Zhu
Qing Zhu
Min Pan
Min Pan
Tingting He
Tingting He
author_facet Qiang Zhu
Qiang Zhu
Xingpeng Jiang
Xingpeng Jiang
Qing Zhu
Qing Zhu
Min Pan
Min Pan
Tingting He
Tingting He
author_sort Qiang Zhu
collection DOAJ
description The microbiome-wide association studies are to figure out the relationship between microorganisms and humans, with the goal of discovering relevant biomarkers to guide disease diagnosis. However, the microbiome data is complex, with high noise and dimensions. Traditional machine learning methods are limited by the models' representation ability and cannot learn complex patterns from the data. Recently, deep learning has been widely applied to fields ranging from text processing to image recognition due to its efficient flexibility and high capacity. But the deep learning models must be trained with enough data in order to achieve good performance, which is impractical in reality. In addition, deep learning is considered as black box and hard to interpret. These factors make deep learning not widely used in microbiome-wide association studies. In this work, we construct a sparse microbial interaction network and embed this graph into deep model to alleviate the risk of overfitting and improve the performance. Further, we explore a Graph Embedding Deep Feedforward Network (GEDFN) to conduct feature selection and guide meaningful microbial markers' identification. Based on the experimental results, we verify the feasibility of combining the microbial graph model with the deep learning model, and demonstrate the feasibility of applying deep learning and feature selection on microbial data. Our main contributions are: firstly, we utilize different methods to construct a variety of microbial interaction networks and combine the network via graph embedding deep learning. Secondly, we introduce a feature selection method based on graph embedding and validate the biological meaning of microbial markers. The code is available at https://github.com/MicroAVA/GEDFN.git.
first_indexed 2024-12-12T17:16:17Z
format Article
id doaj.art-bea00725716d4b4b97a9286d5e6ce2dd
institution Directory Open Access Journal
issn 1664-8021
language English
last_indexed 2024-12-12T17:16:17Z
publishDate 2019-11-01
publisher Frontiers Media S.A.
record_format Article
series Frontiers in Genetics
spelling doaj.art-bea00725716d4b4b97a9286d5e6ce2dd2022-12-22T00:17:47ZengFrontiers Media S.A.Frontiers in Genetics1664-80212019-11-011010.3389/fgene.2019.01182491009Graph Embedding Deep Learning Guides Microbial Biomarkers' IdentificationQiang Zhu0Qiang Zhu1Xingpeng Jiang2Xingpeng Jiang3Qing Zhu4Qing Zhu5Min Pan6Min Pan7Tingting He8Tingting He9School of Information Management, Central China Normal University, Wuhan, ChinaSchool of Computer, Central China Normal University, Wuhan, ChinaSchool of Computer, Central China Normal University, Wuhan, ChinaHubei Provincial Key Laboratory of Artificial Intelligence and Smart Learning, Central China Normal University, Wuhan, ChinaSchool of Computer, Central China Normal University, Wuhan, ChinaHubei Provincial Key Laboratory of Artificial Intelligence and Smart Learning, Central China Normal University, Wuhan, ChinaSchool of Computer, Central China Normal University, Wuhan, ChinaHubei Provincial Key Laboratory of Artificial Intelligence and Smart Learning, Central China Normal University, Wuhan, ChinaSchool of Computer, Central China Normal University, Wuhan, ChinaHubei Provincial Key Laboratory of Artificial Intelligence and Smart Learning, Central China Normal University, Wuhan, ChinaThe microbiome-wide association studies are to figure out the relationship between microorganisms and humans, with the goal of discovering relevant biomarkers to guide disease diagnosis. However, the microbiome data is complex, with high noise and dimensions. Traditional machine learning methods are limited by the models' representation ability and cannot learn complex patterns from the data. Recently, deep learning has been widely applied to fields ranging from text processing to image recognition due to its efficient flexibility and high capacity. But the deep learning models must be trained with enough data in order to achieve good performance, which is impractical in reality. In addition, deep learning is considered as black box and hard to interpret. These factors make deep learning not widely used in microbiome-wide association studies. In this work, we construct a sparse microbial interaction network and embed this graph into deep model to alleviate the risk of overfitting and improve the performance. Further, we explore a Graph Embedding Deep Feedforward Network (GEDFN) to conduct feature selection and guide meaningful microbial markers' identification. Based on the experimental results, we verify the feasibility of combining the microbial graph model with the deep learning model, and demonstrate the feasibility of applying deep learning and feature selection on microbial data. Our main contributions are: firstly, we utilize different methods to construct a variety of microbial interaction networks and combine the network via graph embedding deep learning. Secondly, we introduce a feature selection method based on graph embedding and validate the biological meaning of microbial markers. The code is available at https://github.com/MicroAVA/GEDFN.git.https://www.frontiersin.org/article/10.3389/fgene.2019.01182/full
spellingShingle Qiang Zhu
Qiang Zhu
Xingpeng Jiang
Xingpeng Jiang
Qing Zhu
Qing Zhu
Min Pan
Min Pan
Tingting He
Tingting He
Graph Embedding Deep Learning Guides Microbial Biomarkers' Identification
Frontiers in Genetics
title Graph Embedding Deep Learning Guides Microbial Biomarkers' Identification
title_full Graph Embedding Deep Learning Guides Microbial Biomarkers' Identification
title_fullStr Graph Embedding Deep Learning Guides Microbial Biomarkers' Identification
title_full_unstemmed Graph Embedding Deep Learning Guides Microbial Biomarkers' Identification
title_short Graph Embedding Deep Learning Guides Microbial Biomarkers' Identification
title_sort graph embedding deep learning guides microbial biomarkers identification
url https://www.frontiersin.org/article/10.3389/fgene.2019.01182/full
work_keys_str_mv AT qiangzhu graphembeddingdeeplearningguidesmicrobialbiomarkersidentification
AT qiangzhu graphembeddingdeeplearningguidesmicrobialbiomarkersidentification
AT xingpengjiang graphembeddingdeeplearningguidesmicrobialbiomarkersidentification
AT xingpengjiang graphembeddingdeeplearningguidesmicrobialbiomarkersidentification
AT qingzhu graphembeddingdeeplearningguidesmicrobialbiomarkersidentification
AT qingzhu graphembeddingdeeplearningguidesmicrobialbiomarkersidentification
AT minpan graphembeddingdeeplearningguidesmicrobialbiomarkersidentification
AT minpan graphembeddingdeeplearningguidesmicrobialbiomarkersidentification
AT tingtinghe graphembeddingdeeplearningguidesmicrobialbiomarkersidentification
AT tingtinghe graphembeddingdeeplearningguidesmicrobialbiomarkersidentification