Compressing network populations with modal networks reveal structural diversity

Abstract Analyzing relational data consisting of multiple samples or layers involves critical challenges: How many networks are required to capture the variety of structures in the data? And what are the structures of these representative networks? We describe efficient nonparametric methods derived...

Full description

Bibliographic Details
Main Authors: Alec Kirkley, Alexis Rojas, Martin Rosvall, Jean-Gabriel Young
Format: Article
Language:English
Published: Nature Portfolio 2023-06-01
Series:Communications Physics
Online Access:https://doi.org/10.1038/s42005-023-01270-5
_version_ 1797795669564456960
author Alec Kirkley
Alexis Rojas
Martin Rosvall
Jean-Gabriel Young
author_facet Alec Kirkley
Alexis Rojas
Martin Rosvall
Jean-Gabriel Young
author_sort Alec Kirkley
collection DOAJ
description Abstract Analyzing relational data consisting of multiple samples or layers involves critical challenges: How many networks are required to capture the variety of structures in the data? And what are the structures of these representative networks? We describe efficient nonparametric methods derived from the minimum description length principle to construct the network representations automatically. The methods input a population of networks or a multilayer network measured on a fixed set of nodes and output a small set of representative networks together with an assignment of each network sample or layer to one of the representative networks. We identify the representative networks and assign network samples to them with an efficient Monte Carlo scheme that minimizes our description length objective. For temporally ordered networks, we use a polynomial time dynamic programming approach that restricts the clusters of network layers to be temporally contiguous. These methods recover planted heterogeneity in synthetic network populations and identify essential structural heterogeneities in global trade and fossil record networks. Our methods are principled, scalable, parameter-free, and accommodate a wide range of data, providing a unified lens for exploratory analyses and preprocessing large sets of network samples.
first_indexed 2024-03-13T03:21:30Z
format Article
id doaj.art-bb24eb7b820c400cb527508963dd9b2b
institution Directory Open Access Journal
issn 2399-3650
language English
last_indexed 2024-03-13T03:21:30Z
publishDate 2023-06-01
publisher Nature Portfolio
record_format Article
series Communications Physics
spelling doaj.art-bb24eb7b820c400cb527508963dd9b2b2023-06-25T11:19:28ZengNature PortfolioCommunications Physics2399-36502023-06-016111010.1038/s42005-023-01270-5Compressing network populations with modal networks reveal structural diversityAlec Kirkley0Alexis Rojas1Martin Rosvall2Jean-Gabriel Young3Institute of Data Science, University of Hong KongDepartment of Computer Science, University of HelsinkiIntegrated Science Lab, Department of Physics, Umea UniversityDepartment of Mathematics and Statistics, University of VermontAbstract Analyzing relational data consisting of multiple samples or layers involves critical challenges: How many networks are required to capture the variety of structures in the data? And what are the structures of these representative networks? We describe efficient nonparametric methods derived from the minimum description length principle to construct the network representations automatically. The methods input a population of networks or a multilayer network measured on a fixed set of nodes and output a small set of representative networks together with an assignment of each network sample or layer to one of the representative networks. We identify the representative networks and assign network samples to them with an efficient Monte Carlo scheme that minimizes our description length objective. For temporally ordered networks, we use a polynomial time dynamic programming approach that restricts the clusters of network layers to be temporally contiguous. These methods recover planted heterogeneity in synthetic network populations and identify essential structural heterogeneities in global trade and fossil record networks. Our methods are principled, scalable, parameter-free, and accommodate a wide range of data, providing a unified lens for exploratory analyses and preprocessing large sets of network samples.https://doi.org/10.1038/s42005-023-01270-5
spellingShingle Alec Kirkley
Alexis Rojas
Martin Rosvall
Jean-Gabriel Young
Compressing network populations with modal networks reveal structural diversity
Communications Physics
title Compressing network populations with modal networks reveal structural diversity
title_full Compressing network populations with modal networks reveal structural diversity
title_fullStr Compressing network populations with modal networks reveal structural diversity
title_full_unstemmed Compressing network populations with modal networks reveal structural diversity
title_short Compressing network populations with modal networks reveal structural diversity
title_sort compressing network populations with modal networks reveal structural diversity
url https://doi.org/10.1038/s42005-023-01270-5
work_keys_str_mv AT aleckirkley compressingnetworkpopulationswithmodalnetworksrevealstructuraldiversity
AT alexisrojas compressingnetworkpopulationswithmodalnetworksrevealstructuraldiversity
AT martinrosvall compressingnetworkpopulationswithmodalnetworksrevealstructuraldiversity
AT jeangabrielyoung compressingnetworkpopulationswithmodalnetworksrevealstructuraldiversity