Interpreting biologically informed neural networks for enhanced proteomic biomarker discovery and pathway analysis

Abstract The incorporation of machine learning methods into proteomics workflows improves the identification of disease-relevant biomarkers and biological pathways. However, machine learning models, such as deep neural networks, typically suffer from lack of interpretability. Here, we present a deep...

Full description

Bibliographic Details
Main Authors: Erik Hartman, Aaron M. Scott, Christofer Karlsson, Tirthankar Mohanty, Suvi T. Vaara, Adam Linder, Lars Malmström, Johan Malmström
Format: Article
Language:English
Published: Nature Portfolio 2023-09-01
Series:Nature Communications
Online Access:https://doi.org/10.1038/s41467-023-41146-4
_version_ 1797558380124962816
author Erik Hartman
Aaron M. Scott
Christofer Karlsson
Tirthankar Mohanty
Suvi T. Vaara
Adam Linder
Lars Malmström
Johan Malmström
author_facet Erik Hartman
Aaron M. Scott
Christofer Karlsson
Tirthankar Mohanty
Suvi T. Vaara
Adam Linder
Lars Malmström
Johan Malmström
author_sort Erik Hartman
collection DOAJ
description Abstract The incorporation of machine learning methods into proteomics workflows improves the identification of disease-relevant biomarkers and biological pathways. However, machine learning models, such as deep neural networks, typically suffer from lack of interpretability. Here, we present a deep learning approach to combine biological pathway analysis and biomarker identification to increase the interpretability of proteomics experiments. Our approach integrates a priori knowledge of the relationships between proteins and biological pathways and biological processes into sparse neural networks to create biologically informed neural networks. We employ these networks to differentiate between clinical subphenotypes of septic acute kidney injury and COVID-19, as well as acute respiratory distress syndrome of different aetiologies. To gain biological insight into the complex syndromes, we utilize feature attribution-methods to introspect the networks for the identification of proteins and pathways important for distinguishing between subtypes. The algorithms are implemented in a freely available open source Python-package ( https://github.com/InfectionMedicineProteomics/BINN ).
first_indexed 2024-03-10T17:30:41Z
format Article
id doaj.art-6ba2704448034c108462da0c1d21a25d
institution Directory Open Access Journal
issn 2041-1723
language English
last_indexed 2024-03-10T17:30:41Z
publishDate 2023-09-01
publisher Nature Portfolio
record_format Article
series Nature Communications
spelling doaj.art-6ba2704448034c108462da0c1d21a25d2023-11-20T10:02:51ZengNature PortfolioNature Communications2041-17232023-09-0114111310.1038/s41467-023-41146-4Interpreting biologically informed neural networks for enhanced proteomic biomarker discovery and pathway analysisErik Hartman0Aaron M. Scott1Christofer Karlsson2Tirthankar Mohanty3Suvi T. Vaara4Adam Linder5Lars Malmström6Johan Malmström7Division of Infection Medicine, Department of Clinical Sciences Lund, Faculty of Medicine, Lund UniversityDivision of Infection Medicine, Department of Clinical Sciences Lund, Faculty of Medicine, Lund UniversityDivision of Infection Medicine, Department of Clinical Sciences Lund, Faculty of Medicine, Lund UniversityDivision of Infection Medicine, Department of Clinical Sciences Lund, Faculty of Medicine, Lund UniversityDepartment of Perioperative and Intensive Care, University of Helsinki and Helsinki University HospitalDivision of Infection Medicine, Department of Clinical Sciences Lund, Faculty of Medicine, Lund UniversityDivision of Infection Medicine, Department of Clinical Sciences Lund, Faculty of Medicine, Lund UniversityDivision of Infection Medicine, Department of Clinical Sciences Lund, Faculty of Medicine, Lund UniversityAbstract The incorporation of machine learning methods into proteomics workflows improves the identification of disease-relevant biomarkers and biological pathways. However, machine learning models, such as deep neural networks, typically suffer from lack of interpretability. Here, we present a deep learning approach to combine biological pathway analysis and biomarker identification to increase the interpretability of proteomics experiments. Our approach integrates a priori knowledge of the relationships between proteins and biological pathways and biological processes into sparse neural networks to create biologically informed neural networks. We employ these networks to differentiate between clinical subphenotypes of septic acute kidney injury and COVID-19, as well as acute respiratory distress syndrome of different aetiologies. To gain biological insight into the complex syndromes, we utilize feature attribution-methods to introspect the networks for the identification of proteins and pathways important for distinguishing between subtypes. The algorithms are implemented in a freely available open source Python-package ( https://github.com/InfectionMedicineProteomics/BINN ).https://doi.org/10.1038/s41467-023-41146-4
spellingShingle Erik Hartman
Aaron M. Scott
Christofer Karlsson
Tirthankar Mohanty
Suvi T. Vaara
Adam Linder
Lars Malmström
Johan Malmström
Interpreting biologically informed neural networks for enhanced proteomic biomarker discovery and pathway analysis
Nature Communications
title Interpreting biologically informed neural networks for enhanced proteomic biomarker discovery and pathway analysis
title_full Interpreting biologically informed neural networks for enhanced proteomic biomarker discovery and pathway analysis
title_fullStr Interpreting biologically informed neural networks for enhanced proteomic biomarker discovery and pathway analysis
title_full_unstemmed Interpreting biologically informed neural networks for enhanced proteomic biomarker discovery and pathway analysis
title_short Interpreting biologically informed neural networks for enhanced proteomic biomarker discovery and pathway analysis
title_sort interpreting biologically informed neural networks for enhanced proteomic biomarker discovery and pathway analysis
url https://doi.org/10.1038/s41467-023-41146-4
work_keys_str_mv AT erikhartman interpretingbiologicallyinformedneuralnetworksforenhancedproteomicbiomarkerdiscoveryandpathwayanalysis
AT aaronmscott interpretingbiologicallyinformedneuralnetworksforenhancedproteomicbiomarkerdiscoveryandpathwayanalysis
AT christoferkarlsson interpretingbiologicallyinformedneuralnetworksforenhancedproteomicbiomarkerdiscoveryandpathwayanalysis
AT tirthankarmohanty interpretingbiologicallyinformedneuralnetworksforenhancedproteomicbiomarkerdiscoveryandpathwayanalysis
AT suvitvaara interpretingbiologicallyinformedneuralnetworksforenhancedproteomicbiomarkerdiscoveryandpathwayanalysis
AT adamlinder interpretingbiologicallyinformedneuralnetworksforenhancedproteomicbiomarkerdiscoveryandpathwayanalysis
AT larsmalmstrom interpretingbiologicallyinformedneuralnetworksforenhancedproteomicbiomarkerdiscoveryandpathwayanalysis
AT johanmalmstrom interpretingbiologicallyinformedneuralnetworksforenhancedproteomicbiomarkerdiscoveryandpathwayanalysis