Interaction networks from discrete event data by Poisson multivariate mutual information estimation and information flow with applications from gene expression data

Abstract In this work, we introduce a new methodology for inferring the interaction structure of discrete valued time series which are Poisson distributed. While most related methods are premised on continuous state stochastic processes, in fact, discrete and counting event oriented stochastic proce...

Full description

Bibliographic Details
Main Authors: Jeremie Fish, Jie Sun, Erik Bollt
Format: Article
Language:English
Published: SpringerOpen 2022-10-01
Series:Applied Network Science
Subjects:
Online Access:https://doi.org/10.1007/s41109-022-00510-x
_version_ 1811182303583404032
author Jeremie Fish
Jie Sun
Erik Bollt
author_facet Jeremie Fish
Jie Sun
Erik Bollt
author_sort Jeremie Fish
collection DOAJ
description Abstract In this work, we introduce a new methodology for inferring the interaction structure of discrete valued time series which are Poisson distributed. While most related methods are premised on continuous state stochastic processes, in fact, discrete and counting event oriented stochastic process are natural and common, so called time-point processes. An important application that we focus on here is gene expression, where it is often assumed that the data is generated from a multivariate Poisson distribution. Nonparameteric methods such as the popular k-nearest neighbors are slow converging for discrete processes, and thus data hungry. Now, with the new multi-variate Poisson estimator developed here as the core computational engine, the causation entropy (CSE) principle, together with the associated greedy search algorithm optimal CSE (oCSE) allows us to efficiently infer the true network structure for this class of stochastic processes that were previously not practical. We illustrate the power of our method, first in benchmarking with synthetic datum, and then by inferring the genetic factors network from a breast cancer micro-ribonucleic acid sequence count data set. We show the Poisson oCSE gives the best performance among the tested methods and discovers previously known interactions on the breast cancer data set.
first_indexed 2024-04-11T09:30:29Z
format Article
id doaj.art-ad553e2829bd4528a09a999b49134299
institution Directory Open Access Journal
issn 2364-8228
language English
last_indexed 2024-04-11T09:30:29Z
publishDate 2022-10-01
publisher SpringerOpen
record_format Article
series Applied Network Science
spelling doaj.art-ad553e2829bd4528a09a999b491342992022-12-22T04:31:54ZengSpringerOpenApplied Network Science2364-82282022-10-017112210.1007/s41109-022-00510-xInteraction networks from discrete event data by Poisson multivariate mutual information estimation and information flow with applications from gene expression dataJeremie Fish0Jie Sun1Erik Bollt2Department of Electrical and Computer Engineering, Clarkson UniversityDepartment of Electrical and Computer Engineering, Clarkson UniversityDepartment of Electrical and Computer Engineering, Clarkson UniversityAbstract In this work, we introduce a new methodology for inferring the interaction structure of discrete valued time series which are Poisson distributed. While most related methods are premised on continuous state stochastic processes, in fact, discrete and counting event oriented stochastic process are natural and common, so called time-point processes. An important application that we focus on here is gene expression, where it is often assumed that the data is generated from a multivariate Poisson distribution. Nonparameteric methods such as the popular k-nearest neighbors are slow converging for discrete processes, and thus data hungry. Now, with the new multi-variate Poisson estimator developed here as the core computational engine, the causation entropy (CSE) principle, together with the associated greedy search algorithm optimal CSE (oCSE) allows us to efficiently infer the true network structure for this class of stochastic processes that were previously not practical. We illustrate the power of our method, first in benchmarking with synthetic datum, and then by inferring the genetic factors network from a breast cancer micro-ribonucleic acid sequence count data set. We show the Poisson oCSE gives the best performance among the tested methods and discovers previously known interactions on the breast cancer data set.https://doi.org/10.1007/s41109-022-00510-xNetwork inferencePoisson distributionConditional mutual informationInformation theory
spellingShingle Jeremie Fish
Jie Sun
Erik Bollt
Interaction networks from discrete event data by Poisson multivariate mutual information estimation and information flow with applications from gene expression data
Applied Network Science
Network inference
Poisson distribution
Conditional mutual information
Information theory
title Interaction networks from discrete event data by Poisson multivariate mutual information estimation and information flow with applications from gene expression data
title_full Interaction networks from discrete event data by Poisson multivariate mutual information estimation and information flow with applications from gene expression data
title_fullStr Interaction networks from discrete event data by Poisson multivariate mutual information estimation and information flow with applications from gene expression data
title_full_unstemmed Interaction networks from discrete event data by Poisson multivariate mutual information estimation and information flow with applications from gene expression data
title_short Interaction networks from discrete event data by Poisson multivariate mutual information estimation and information flow with applications from gene expression data
title_sort interaction networks from discrete event data by poisson multivariate mutual information estimation and information flow with applications from gene expression data
topic Network inference
Poisson distribution
Conditional mutual information
Information theory
url https://doi.org/10.1007/s41109-022-00510-x
work_keys_str_mv AT jeremiefish interactionnetworksfromdiscreteeventdatabypoissonmultivariatemutualinformationestimationandinformationflowwithapplicationsfromgeneexpressiondata
AT jiesun interactionnetworksfromdiscreteeventdatabypoissonmultivariatemutualinformationestimationandinformationflowwithapplicationsfromgeneexpressiondata
AT erikbollt interactionnetworksfromdiscreteeventdatabypoissonmultivariatemutualinformationestimationandinformationflowwithapplicationsfromgeneexpressiondata