Random walk with restart on multilayer networks: from node prioritisation to supervised link prediction and beyond
Abstract Background Biological networks have proven invaluable ability for representing biological knowledge. Multilayer networks, which gather different types of nodes and edges in multiplex, heterogeneous and bipartite networks, provide a natural way to integrate diverse and multi-scale data sourc...
Main Authors: | , , |
---|---|
Format: | Article |
Language: | English |
Published: |
BMC
2024-02-01
|
Series: | BMC Bioinformatics |
Subjects: | |
Online Access: | https://doi.org/10.1186/s12859-024-05683-z |
_version_ | 1797273050257817600 |
---|---|
author | Anthony Baptista Galadriel Brière Anaïs Baudot |
author_facet | Anthony Baptista Galadriel Brière Anaïs Baudot |
author_sort | Anthony Baptista |
collection | DOAJ |
description | Abstract Background Biological networks have proven invaluable ability for representing biological knowledge. Multilayer networks, which gather different types of nodes and edges in multiplex, heterogeneous and bipartite networks, provide a natural way to integrate diverse and multi-scale data sources into a common framework. Recently, we developed MultiXrank, a Random Walk with Restart algorithm able to explore such multilayer networks. MultiXrank outputs scores reflecting the proximity between an initial set of seed node(s) and all the other nodes in the multilayer network. We illustrate here the versatility of bioinformatics tasks that can be performed using MultiXrank. Results We first show that MultiXrank can be used to prioritise genes and drugs of interest by exploring multilayer networks containing interactions between genes, drugs, and diseases. In a second study, we illustrate how MultiXrank scores can also be used in a supervised strategy to train a binary classifier to predict gene-disease associations. The classifier performance are validated using outdated and novel gene-disease association for training and evaluation, respectively. Finally, we show that MultiXrank scores can be used to compute diffusion profiles and use them as disease signatures. We computed the diffusion profiles of more than 100 immune diseases using a multilayer network that includes cell-type specific genomic information. The clustering of the immune disease diffusion profiles reveals shared shared phenotypic characteristics. Conclusion Overall, we illustrate here diverse applications of MultiXrank to showcase its versatility. We expect that this can lead to further and broader bioinformatics applications. |
first_indexed | 2024-03-07T14:37:58Z |
format | Article |
id | doaj.art-9f1e34c21ed543a08beaa9d321659347 |
institution | Directory Open Access Journal |
issn | 1471-2105 |
language | English |
last_indexed | 2024-03-07T14:37:58Z |
publishDate | 2024-02-01 |
publisher | BMC |
record_format | Article |
series | BMC Bioinformatics |
spelling | doaj.art-9f1e34c21ed543a08beaa9d3216593472024-03-05T20:31:53ZengBMCBMC Bioinformatics1471-21052024-02-0125111910.1186/s12859-024-05683-zRandom walk with restart on multilayer networks: from node prioritisation to supervised link prediction and beyondAnthony Baptista0Galadriel Brière1Anaïs Baudot2School of Mathematical Sciences, Queen Mary University of LondonCNRS, I2M, Aix-Marseille UnivINSERM, MMG, Turing Center for Living Systems, Aix-Marseille UnivAbstract Background Biological networks have proven invaluable ability for representing biological knowledge. Multilayer networks, which gather different types of nodes and edges in multiplex, heterogeneous and bipartite networks, provide a natural way to integrate diverse and multi-scale data sources into a common framework. Recently, we developed MultiXrank, a Random Walk with Restart algorithm able to explore such multilayer networks. MultiXrank outputs scores reflecting the proximity between an initial set of seed node(s) and all the other nodes in the multilayer network. We illustrate here the versatility of bioinformatics tasks that can be performed using MultiXrank. Results We first show that MultiXrank can be used to prioritise genes and drugs of interest by exploring multilayer networks containing interactions between genes, drugs, and diseases. In a second study, we illustrate how MultiXrank scores can also be used in a supervised strategy to train a binary classifier to predict gene-disease associations. The classifier performance are validated using outdated and novel gene-disease association for training and evaluation, respectively. Finally, we show that MultiXrank scores can be used to compute diffusion profiles and use them as disease signatures. We computed the diffusion profiles of more than 100 immune diseases using a multilayer network that includes cell-type specific genomic information. The clustering of the immune disease diffusion profiles reveals shared shared phenotypic characteristics. Conclusion Overall, we illustrate here diverse applications of MultiXrank to showcase its versatility. We expect that this can lead to further and broader bioinformatics applications.https://doi.org/10.1186/s12859-024-05683-zMultilayer networkRandom walk with restartMulti-omics dataBiological network |
spellingShingle | Anthony Baptista Galadriel Brière Anaïs Baudot Random walk with restart on multilayer networks: from node prioritisation to supervised link prediction and beyond BMC Bioinformatics Multilayer network Random walk with restart Multi-omics data Biological network |
title | Random walk with restart on multilayer networks: from node prioritisation to supervised link prediction and beyond |
title_full | Random walk with restart on multilayer networks: from node prioritisation to supervised link prediction and beyond |
title_fullStr | Random walk with restart on multilayer networks: from node prioritisation to supervised link prediction and beyond |
title_full_unstemmed | Random walk with restart on multilayer networks: from node prioritisation to supervised link prediction and beyond |
title_short | Random walk with restart on multilayer networks: from node prioritisation to supervised link prediction and beyond |
title_sort | random walk with restart on multilayer networks from node prioritisation to supervised link prediction and beyond |
topic | Multilayer network Random walk with restart Multi-omics data Biological network |
url | https://doi.org/10.1186/s12859-024-05683-z |
work_keys_str_mv | AT anthonybaptista randomwalkwithrestartonmultilayernetworksfromnodeprioritisationtosupervisedlinkpredictionandbeyond AT galadrielbriere randomwalkwithrestartonmultilayernetworksfromnodeprioritisationtosupervisedlinkpredictionandbeyond AT anaisbaudot randomwalkwithrestartonmultilayernetworksfromnodeprioritisationtosupervisedlinkpredictionandbeyond |