On the importance of structural equivalence in temporal networks for epidemic forecasting

Abstract Understanding how a disease spreads in a population is a first step to preparing for future epidemics, and machine learning models are a useful tool to analyze the spreading process of infectious diseases. For effective predictions of these spreading processes, node embeddings are used to e...

Full description

Bibliographic Details
Main Authors: Pauline Kister, Leonardo Tonetto
Format: Article
Language:English
Published: Nature Portfolio 2023-01-01
Series:Scientific Reports
Online Access:https://doi.org/10.1038/s41598-023-28126-w
_version_ 1797945903121694720
author Pauline Kister
Leonardo Tonetto
author_facet Pauline Kister
Leonardo Tonetto
author_sort Pauline Kister
collection DOAJ
description Abstract Understanding how a disease spreads in a population is a first step to preparing for future epidemics, and machine learning models are a useful tool to analyze the spreading process of infectious diseases. For effective predictions of these spreading processes, node embeddings are used to encode networks based on the similarity between nodes into feature vectors, i.e., higher dimensional representations of human contacts. In this work, we evaluated the impact of homophily and structural equivalence on node2vec embedding for disease spread prediction by testing them on real world temporal human contact networks. Our results show that structural equivalence is a useful indicator for the infection status of a person. Embeddings that are balanced towards the preservation of structural equivalence performed better than those that focus on the preservation of homophily, with an average improvement of 0.1042 in the f1-score (95% CI 0.051 to 0.157). This indicates that structurally equivalent nodes behave similarly during an epidemic (e.g., expected time of a disease onset). This observation could greatly improve predictions of future epidemics where only partial information about contacts is known, thereby helping determine the risk of infection for different groups in the population.
first_indexed 2024-04-10T21:03:32Z
format Article
id doaj.art-c5cc54676c2045c89ecb4c59e32e48ae
institution Directory Open Access Journal
issn 2045-2322
language English
last_indexed 2024-04-10T21:03:32Z
publishDate 2023-01-01
publisher Nature Portfolio
record_format Article
series Scientific Reports
spelling doaj.art-c5cc54676c2045c89ecb4c59e32e48ae2023-01-22T12:13:38ZengNature PortfolioScientific Reports2045-23222023-01-011311710.1038/s41598-023-28126-wOn the importance of structural equivalence in temporal networks for epidemic forecastingPauline Kister0Leonardo Tonetto1Technical University of MunichTechnical University of MunichAbstract Understanding how a disease spreads in a population is a first step to preparing for future epidemics, and machine learning models are a useful tool to analyze the spreading process of infectious diseases. For effective predictions of these spreading processes, node embeddings are used to encode networks based on the similarity between nodes into feature vectors, i.e., higher dimensional representations of human contacts. In this work, we evaluated the impact of homophily and structural equivalence on node2vec embedding for disease spread prediction by testing them on real world temporal human contact networks. Our results show that structural equivalence is a useful indicator for the infection status of a person. Embeddings that are balanced towards the preservation of structural equivalence performed better than those that focus on the preservation of homophily, with an average improvement of 0.1042 in the f1-score (95% CI 0.051 to 0.157). This indicates that structurally equivalent nodes behave similarly during an epidemic (e.g., expected time of a disease onset). This observation could greatly improve predictions of future epidemics where only partial information about contacts is known, thereby helping determine the risk of infection for different groups in the population.https://doi.org/10.1038/s41598-023-28126-w
spellingShingle Pauline Kister
Leonardo Tonetto
On the importance of structural equivalence in temporal networks for epidemic forecasting
Scientific Reports
title On the importance of structural equivalence in temporal networks for epidemic forecasting
title_full On the importance of structural equivalence in temporal networks for epidemic forecasting
title_fullStr On the importance of structural equivalence in temporal networks for epidemic forecasting
title_full_unstemmed On the importance of structural equivalence in temporal networks for epidemic forecasting
title_short On the importance of structural equivalence in temporal networks for epidemic forecasting
title_sort on the importance of structural equivalence in temporal networks for epidemic forecasting
url https://doi.org/10.1038/s41598-023-28126-w
work_keys_str_mv AT paulinekister ontheimportanceofstructuralequivalenceintemporalnetworksforepidemicforecasting
AT leonardotonetto ontheimportanceofstructuralequivalenceintemporalnetworksforepidemicforecasting