An effective drug-disease associations prediction model based on graphic representation learning over multi-biomolecular network

Abstract Background Drug-disease associations (DDAs) can provide important information for exploring the potential efficacy of drugs. However, up to now, there are still few DDAs verified by experiments. Previous evidence indicates that the combination of information would be conducive to the discov...

Full description

Bibliographic Details
Main Authors: Hanjing Jiang, Yabing Huang
Format: Article
Language:English
Published: BMC 2022-01-01
Series:BMC Bioinformatics
Subjects:
Online Access:https://doi.org/10.1186/s12859-021-04553-2
_version_ 1828145305011355648
author Hanjing Jiang
Yabing Huang
author_facet Hanjing Jiang
Yabing Huang
author_sort Hanjing Jiang
collection DOAJ
description Abstract Background Drug-disease associations (DDAs) can provide important information for exploring the potential efficacy of drugs. However, up to now, there are still few DDAs verified by experiments. Previous evidence indicates that the combination of information would be conducive to the discovery of new DDAs. How to integrate different biological data sources and identify the most effective drugs for a certain disease based on drug-disease coupled mechanisms is still a challenging problem. Results In this paper, we proposed a novel computation model for DDA predictions based on graph representation learning over multi-biomolecular network (GRLMN). More specifically, we firstly constructed a large-scale molecular association network (MAN) by integrating the associations among drugs, diseases, proteins, miRNAs, and lncRNAs. Then, a graph embedding model was used to learn vector representations for all drugs and diseases in MAN. Finally, the combined features were fed to a random forest (RF) model to predict new DDAs. The proposed model was evaluated on the SCMFDD-S data set using five-fold cross-validation. Experiment results showed that GRLMN model was very accurate with the area under the ROC curve (AUC) of 87.9%, which outperformed all previous works in terms of both accuracy and AUC in benchmark dataset. To further verify the high performance of GRLMN, we carried out two case studies for two common diseases. As a result, in the ranking of drugs that were predicted to be related to certain diseases (such as kidney disease and fever), 15 of the top 20 drugs have been experimentally confirmed. Conclusions The experimental results show that our model has good performance in the prediction of DDA. GRLMN is an effective prioritization tool for screening the reliable DDAs for follow-up studies concerning their participation in drug reposition.
first_indexed 2024-04-11T20:31:00Z
format Article
id doaj.art-faa6a6f7af624c37a947e43b9da5df50
institution Directory Open Access Journal
issn 1471-2105
language English
last_indexed 2024-04-11T20:31:00Z
publishDate 2022-01-01
publisher BMC
record_format Article
series BMC Bioinformatics
spelling doaj.art-faa6a6f7af624c37a947e43b9da5df502022-12-22T04:04:31ZengBMCBMC Bioinformatics1471-21052022-01-0123111710.1186/s12859-021-04553-2An effective drug-disease associations prediction model based on graphic representation learning over multi-biomolecular networkHanjing Jiang0Yabing Huang1Key Laboratory of Image Information Processing and Intelligent Control of Education Ministry of China, Institute of Artificial Intelligence, School of Artificial Intelligence and Automation, Huazhong University of Science and TechnologyDepartment of Pathology, Renmin Hospital of Wuhan UniversityAbstract Background Drug-disease associations (DDAs) can provide important information for exploring the potential efficacy of drugs. However, up to now, there are still few DDAs verified by experiments. Previous evidence indicates that the combination of information would be conducive to the discovery of new DDAs. How to integrate different biological data sources and identify the most effective drugs for a certain disease based on drug-disease coupled mechanisms is still a challenging problem. Results In this paper, we proposed a novel computation model for DDA predictions based on graph representation learning over multi-biomolecular network (GRLMN). More specifically, we firstly constructed a large-scale molecular association network (MAN) by integrating the associations among drugs, diseases, proteins, miRNAs, and lncRNAs. Then, a graph embedding model was used to learn vector representations for all drugs and diseases in MAN. Finally, the combined features were fed to a random forest (RF) model to predict new DDAs. The proposed model was evaluated on the SCMFDD-S data set using five-fold cross-validation. Experiment results showed that GRLMN model was very accurate with the area under the ROC curve (AUC) of 87.9%, which outperformed all previous works in terms of both accuracy and AUC in benchmark dataset. To further verify the high performance of GRLMN, we carried out two case studies for two common diseases. As a result, in the ranking of drugs that were predicted to be related to certain diseases (such as kidney disease and fever), 15 of the top 20 drugs have been experimentally confirmed. Conclusions The experimental results show that our model has good performance in the prediction of DDA. GRLMN is an effective prioritization tool for screening the reliable DDAs for follow-up studies concerning their participation in drug reposition.https://doi.org/10.1186/s12859-021-04553-2Drug-disease associationGraph representation learningMulti-biomolecular network
spellingShingle Hanjing Jiang
Yabing Huang
An effective drug-disease associations prediction model based on graphic representation learning over multi-biomolecular network
BMC Bioinformatics
Drug-disease association
Graph representation learning
Multi-biomolecular network
title An effective drug-disease associations prediction model based on graphic representation learning over multi-biomolecular network
title_full An effective drug-disease associations prediction model based on graphic representation learning over multi-biomolecular network
title_fullStr An effective drug-disease associations prediction model based on graphic representation learning over multi-biomolecular network
title_full_unstemmed An effective drug-disease associations prediction model based on graphic representation learning over multi-biomolecular network
title_short An effective drug-disease associations prediction model based on graphic representation learning over multi-biomolecular network
title_sort effective drug disease associations prediction model based on graphic representation learning over multi biomolecular network
topic Drug-disease association
Graph representation learning
Multi-biomolecular network
url https://doi.org/10.1186/s12859-021-04553-2
work_keys_str_mv AT hanjingjiang aneffectivedrugdiseaseassociationspredictionmodelbasedongraphicrepresentationlearningovermultibiomolecularnetwork
AT yabinghuang aneffectivedrugdiseaseassociationspredictionmodelbasedongraphicrepresentationlearningovermultibiomolecularnetwork
AT hanjingjiang effectivedrugdiseaseassociationspredictionmodelbasedongraphicrepresentationlearningovermultibiomolecularnetwork
AT yabinghuang effectivedrugdiseaseassociationspredictionmodelbasedongraphicrepresentationlearningovermultibiomolecularnetwork