Statistical integration of multi-omics and drug screening data from cell lines.

Data integration methods are used to obtain a unified summary of multiple datasets. For multi-modal data, we propose a computational workflow to jointly analyze datasets from cell lines. The workflow comprises a novel probabilistic data integration method, named POPLS-DA, for multi-omics data. The w...

Full description

Bibliographic Details
Main Authors:	Said El Bouhaddani, Matthias Höllerhage, Hae-Won Uh, Claudia Moebius, Marc Bickle, Günter Höglinger, Jeanine Houwing-Duistermaat
Format:	Article
Language:	English
Published:	Public Library of Science (PLoS) 2024-01-01
Series:	PLoS Computational Biology
Online Access:	https://journals.plos.org/ploscompbiol/article/file?id=10.1371/journal.pcbi.1011809&type=printable

_version_	1797296236499304448
author	Said El Bouhaddani Matthias Höllerhage Hae-Won Uh Claudia Moebius Marc Bickle Günter Höglinger Jeanine Houwing-Duistermaat
author_facet	Said El Bouhaddani Matthias Höllerhage Hae-Won Uh Claudia Moebius Marc Bickle Günter Höglinger Jeanine Houwing-Duistermaat
author_sort	Said El Bouhaddani
collection	DOAJ
description	Data integration methods are used to obtain a unified summary of multiple datasets. For multi-modal data, we propose a computational workflow to jointly analyze datasets from cell lines. The workflow comprises a novel probabilistic data integration method, named POPLS-DA, for multi-omics data. The workflow is motivated by a study on synucleinopathies where transcriptomics, proteomics, and drug screening data are measured in affected LUHMES cell lines and controls. The aim is to highlight potentially druggable pathways and genes involved in synucleinopathies. First, POPLS-DA is used to prioritize genes and proteins that best distinguish cases and controls. For these genes, an integrated interaction network is constructed where the drug screen data is incorporated to highlight druggable genes and pathways in the network. Finally, functional enrichment analyses are performed to identify clusters of synaptic and lysosome-related genes and proteins targeted by the protective drugs. POPLS-DA is compared to other single- and multi-omics approaches. We found that HSPA5, a member of the heat shock protein 70 family, was one of the most targeted genes by the validated drugs, in particular by AT1-blockers. HSPA5 and AT1-blockers have been previously linked to α-synuclein pathology and Parkinson's disease, showing the relevance of our findings. Our computational workflow identified new directions for therapeutic targets for synucleinopathies. POPLS-DA provided a larger interpretable gene set than other single- and multi-omic approaches. An implementation based on R and markdown is freely available online.
first_indexed	2024-03-07T22:00:30Z
format	Article
id	doaj.art-d1b5e0f2662c41419ef3577fecd9581c
institution	Directory Open Access Journal
issn	1553-734X 1553-7358
language	English
last_indexed	2024-03-07T22:00:30Z
publishDate	2024-01-01
publisher	Public Library of Science (PLoS)
record_format	Article
series	PLoS Computational Biology
spelling	doaj.art-d1b5e0f2662c41419ef3577fecd9581c2024-02-24T05:31:22ZengPublic Library of Science (PLoS)PLoS Computational Biology1553-734X1553-73582024-01-01201e101180910.1371/journal.pcbi.1011809Statistical integration of multi-omics and drug screening data from cell lines.Said El BouhaddaniMatthias HöllerhageHae-Won UhClaudia MoebiusMarc BickleGünter HöglingerJeanine Houwing-DuistermaatData integration methods are used to obtain a unified summary of multiple datasets. For multi-modal data, we propose a computational workflow to jointly analyze datasets from cell lines. The workflow comprises a novel probabilistic data integration method, named POPLS-DA, for multi-omics data. The workflow is motivated by a study on synucleinopathies where transcriptomics, proteomics, and drug screening data are measured in affected LUHMES cell lines and controls. The aim is to highlight potentially druggable pathways and genes involved in synucleinopathies. First, POPLS-DA is used to prioritize genes and proteins that best distinguish cases and controls. For these genes, an integrated interaction network is constructed where the drug screen data is incorporated to highlight druggable genes and pathways in the network. Finally, functional enrichment analyses are performed to identify clusters of synaptic and lysosome-related genes and proteins targeted by the protective drugs. POPLS-DA is compared to other single- and multi-omics approaches. We found that HSPA5, a member of the heat shock protein 70 family, was one of the most targeted genes by the validated drugs, in particular by AT1-blockers. HSPA5 and AT1-blockers have been previously linked to α-synuclein pathology and Parkinson's disease, showing the relevance of our findings. Our computational workflow identified new directions for therapeutic targets for synucleinopathies. POPLS-DA provided a larger interpretable gene set than other single- and multi-omic approaches. An implementation based on R and markdown is freely available online.https://journals.plos.org/ploscompbiol/article/file?id=10.1371/journal.pcbi.1011809&type=printable
spellingShingle	Said El Bouhaddani Matthias Höllerhage Hae-Won Uh Claudia Moebius Marc Bickle Günter Höglinger Jeanine Houwing-Duistermaat Statistical integration of multi-omics and drug screening data from cell lines. PLoS Computational Biology
title	Statistical integration of multi-omics and drug screening data from cell lines.
title_full	Statistical integration of multi-omics and drug screening data from cell lines.
title_fullStr	Statistical integration of multi-omics and drug screening data from cell lines.
title_full_unstemmed	Statistical integration of multi-omics and drug screening data from cell lines.
title_short	Statistical integration of multi-omics and drug screening data from cell lines.
title_sort	statistical integration of multi omics and drug screening data from cell lines
url	https://journals.plos.org/ploscompbiol/article/file?id=10.1371/journal.pcbi.1011809&type=printable
work_keys_str_mv	AT saidelbouhaddani statisticalintegrationofmultiomicsanddrugscreeningdatafromcelllines AT matthiashollerhage statisticalintegrationofmultiomicsanddrugscreeningdatafromcelllines AT haewonuh statisticalintegrationofmultiomicsanddrugscreeningdatafromcelllines AT claudiamoebius statisticalintegrationofmultiomicsanddrugscreeningdatafromcelllines AT marcbickle statisticalintegrationofmultiomicsanddrugscreeningdatafromcelllines AT gunterhoglinger statisticalintegrationofmultiomicsanddrugscreeningdatafromcelllines AT jeaninehouwingduistermaat statisticalintegrationofmultiomicsanddrugscreeningdatafromcelllines

Statistical integration of multi-omics and drug screening data from cell lines.

Similar Items