A simple null model for inferences from network enrichment analysis.

A prevailing technique to infer function from lists of identifications, from molecular biological high-throughput experiments, is over-representation analysis, where the identifications are compared to predefined sets of related genes often referred to as pathways. As at least some pathways are know...

Full description

Bibliographic Details
Main Authors: Gustavo S Jeuken, Lukas Käll
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2018-01-01
Series:PLoS ONE
Online Access:http://europepmc.org/articles/PMC6226187?pdf=render
_version_ 1817982857725870080
author Gustavo S Jeuken
Lukas Käll
author_facet Gustavo S Jeuken
Lukas Käll
author_sort Gustavo S Jeuken
collection DOAJ
description A prevailing technique to infer function from lists of identifications, from molecular biological high-throughput experiments, is over-representation analysis, where the identifications are compared to predefined sets of related genes often referred to as pathways. As at least some pathways are known to be incomplete in their annotation, algorithmic efforts have been made to complement them with information from functional association networks. While the terminology varies in the literature, we will here refer to such methods as Network Enrichment Analysis (NEA). Traditionally, the significance of inferences from NEA has been assigned using a null model constructed from randomizations of the network. Here we instead argue for a null model that more directly relates to the set of genes being studied, and have designed one dynamic programming algorithm that calculates the score distribution of NEA scores that makes it possible to assign unbiased mid p values to inferences. We also implemented a random sampling method, carrying out the same task. We demonstrate that our method obtains a superior statistical calibration as compared to the popular NEA inference engine, BinoX, while also providing statistics that are easier to interpret.
first_indexed 2024-04-13T23:26:34Z
format Article
id doaj.art-ecb04e6d5572469bb8a520130fb81573
institution Directory Open Access Journal
issn 1932-6203
language English
last_indexed 2024-04-13T23:26:34Z
publishDate 2018-01-01
publisher Public Library of Science (PLoS)
record_format Article
series PLoS ONE
spelling doaj.art-ecb04e6d5572469bb8a520130fb815732022-12-22T02:25:03ZengPublic Library of Science (PLoS)PLoS ONE1932-62032018-01-011311e020686410.1371/journal.pone.0206864A simple null model for inferences from network enrichment analysis.Gustavo S JeukenLukas KällA prevailing technique to infer function from lists of identifications, from molecular biological high-throughput experiments, is over-representation analysis, where the identifications are compared to predefined sets of related genes often referred to as pathways. As at least some pathways are known to be incomplete in their annotation, algorithmic efforts have been made to complement them with information from functional association networks. While the terminology varies in the literature, we will here refer to such methods as Network Enrichment Analysis (NEA). Traditionally, the significance of inferences from NEA has been assigned using a null model constructed from randomizations of the network. Here we instead argue for a null model that more directly relates to the set of genes being studied, and have designed one dynamic programming algorithm that calculates the score distribution of NEA scores that makes it possible to assign unbiased mid p values to inferences. We also implemented a random sampling method, carrying out the same task. We demonstrate that our method obtains a superior statistical calibration as compared to the popular NEA inference engine, BinoX, while also providing statistics that are easier to interpret.http://europepmc.org/articles/PMC6226187?pdf=render
spellingShingle Gustavo S Jeuken
Lukas Käll
A simple null model for inferences from network enrichment analysis.
PLoS ONE
title A simple null model for inferences from network enrichment analysis.
title_full A simple null model for inferences from network enrichment analysis.
title_fullStr A simple null model for inferences from network enrichment analysis.
title_full_unstemmed A simple null model for inferences from network enrichment analysis.
title_short A simple null model for inferences from network enrichment analysis.
title_sort simple null model for inferences from network enrichment analysis
url http://europepmc.org/articles/PMC6226187?pdf=render
work_keys_str_mv AT gustavosjeuken asimplenullmodelforinferencesfromnetworkenrichmentanalysis
AT lukaskall asimplenullmodelforinferencesfromnetworkenrichmentanalysis
AT gustavosjeuken simplenullmodelforinferencesfromnetworkenrichmentanalysis
AT lukaskall simplenullmodelforinferencesfromnetworkenrichmentanalysis