LSTM-based recurrent neural network provides effective short term flu forecasting

Abstract Background Influenza virus is responsible for a yearly epidemic in much of the world. To better predict short-term, seasonal variations in flu infection rates and possible mechanisms of yearly infection variation, we trained a Long Short-Term Memory (LSTM)-based deep neural network on histo...

Full description

Bibliographic Details
Main Authors: Alfred B. Amendolara, David Sant, Horacio G. Rotstein, Eric Fortune
Format: Article
Language:English
Published: BMC 2023-09-01
Series:BMC Public Health
Subjects:
Online Access:https://doi.org/10.1186/s12889-023-16720-6
_version_ 1797451260285157376
author Alfred B. Amendolara
David Sant
Horacio G. Rotstein
Eric Fortune
author_facet Alfred B. Amendolara
David Sant
Horacio G. Rotstein
Eric Fortune
author_sort Alfred B. Amendolara
collection DOAJ
description Abstract Background Influenza virus is responsible for a yearly epidemic in much of the world. To better predict short-term, seasonal variations in flu infection rates and possible mechanisms of yearly infection variation, we trained a Long Short-Term Memory (LSTM)-based deep neural network on historical Influenza-Like-Illness (ILI), climate, and population data. Methods Data were collected from the Centers for Disease Control and Prevention (CDC), the National Center for Environmental Information (NCEI), and the United States Census Bureau. The model was initially built in Python using the Keras API and tuned manually. We explored the roles of temperature, precipitation, local wind speed, population size, vaccination rate, and vaccination efficacy. The model was validated using K-fold cross validation as well as forward chaining cross validation and compared to several standard algorithms. Finally, simulation data was generated in R and used for further exploration of the model. Results We found that temperature is the strongest predictor of ILI rates, but also found that precipitation increased the predictive power of the network. Additionally, the proposed model achieved a +1 week prediction mean absolute error (MAE) of 0.1973. This is less than half of the MAE achieved by the next best performing algorithm. Additionally, the model accurately predicted simulation data. To test the role of temperature in the network, we phase-shifted temperature in time and found a predictable reduction in prediction accuracy. Conclusions The results of this study suggest that short term flu forecasting may be effectively accomplished using architectures traditionally reserved for time series analysis. The proposed LSTM-based model was able to outperform comparison models at the +1 week time point. Additionally, this model provided insight into the week-to-week effects of climatic and biotic factors and revealed potential patterns in data series. Specifically, we found that temperature is the strongest predictor of seasonal flu infection rates. This information may prove to be especially important for flu forecasting given the uncertain long-term impact of the SARS-CoV-2 pandemic on seasonal influenza.
first_indexed 2024-03-09T14:51:08Z
format Article
id doaj.art-3b8de653d8d44391abcfa0f960bb1d30
institution Directory Open Access Journal
issn 1471-2458
language English
last_indexed 2024-03-09T14:51:08Z
publishDate 2023-09-01
publisher BMC
record_format Article
series BMC Public Health
spelling doaj.art-3b8de653d8d44391abcfa0f960bb1d302023-11-26T14:26:01ZengBMCBMC Public Health1471-24582023-09-0123111110.1186/s12889-023-16720-6LSTM-based recurrent neural network provides effective short term flu forecastingAlfred B. Amendolara0David Sant1Horacio G. Rotstein2Eric Fortune3Department of Biomedical Science, Noorda College of Osteopathic MedicineDepartment of Biomedical Science, Noorda College of Osteopathic MedicineFederated Department of Biology, New Jersey Institute of TechnologyFederated Department of Biology, New Jersey Institute of TechnologyAbstract Background Influenza virus is responsible for a yearly epidemic in much of the world. To better predict short-term, seasonal variations in flu infection rates and possible mechanisms of yearly infection variation, we trained a Long Short-Term Memory (LSTM)-based deep neural network on historical Influenza-Like-Illness (ILI), climate, and population data. Methods Data were collected from the Centers for Disease Control and Prevention (CDC), the National Center for Environmental Information (NCEI), and the United States Census Bureau. The model was initially built in Python using the Keras API and tuned manually. We explored the roles of temperature, precipitation, local wind speed, population size, vaccination rate, and vaccination efficacy. The model was validated using K-fold cross validation as well as forward chaining cross validation and compared to several standard algorithms. Finally, simulation data was generated in R and used for further exploration of the model. Results We found that temperature is the strongest predictor of ILI rates, but also found that precipitation increased the predictive power of the network. Additionally, the proposed model achieved a +1 week prediction mean absolute error (MAE) of 0.1973. This is less than half of the MAE achieved by the next best performing algorithm. Additionally, the model accurately predicted simulation data. To test the role of temperature in the network, we phase-shifted temperature in time and found a predictable reduction in prediction accuracy. Conclusions The results of this study suggest that short term flu forecasting may be effectively accomplished using architectures traditionally reserved for time series analysis. The proposed LSTM-based model was able to outperform comparison models at the +1 week time point. Additionally, this model provided insight into the week-to-week effects of climatic and biotic factors and revealed potential patterns in data series. Specifically, we found that temperature is the strongest predictor of seasonal flu infection rates. This information may prove to be especially important for flu forecasting given the uncertain long-term impact of the SARS-CoV-2 pandemic on seasonal influenza.https://doi.org/10.1186/s12889-023-16720-6InfluenzaModelingMachine learningLSTMEpidemiology
spellingShingle Alfred B. Amendolara
David Sant
Horacio G. Rotstein
Eric Fortune
LSTM-based recurrent neural network provides effective short term flu forecasting
BMC Public Health
Influenza
Modeling
Machine learning
LSTM
Epidemiology
title LSTM-based recurrent neural network provides effective short term flu forecasting
title_full LSTM-based recurrent neural network provides effective short term flu forecasting
title_fullStr LSTM-based recurrent neural network provides effective short term flu forecasting
title_full_unstemmed LSTM-based recurrent neural network provides effective short term flu forecasting
title_short LSTM-based recurrent neural network provides effective short term flu forecasting
title_sort lstm based recurrent neural network provides effective short term flu forecasting
topic Influenza
Modeling
Machine learning
LSTM
Epidemiology
url https://doi.org/10.1186/s12889-023-16720-6
work_keys_str_mv AT alfredbamendolara lstmbasedrecurrentneuralnetworkprovideseffectiveshorttermfluforecasting
AT davidsant lstmbasedrecurrentneuralnetworkprovideseffectiveshorttermfluforecasting
AT horaciogrotstein lstmbasedrecurrentneuralnetworkprovideseffectiveshorttermfluforecasting
AT ericfortune lstmbasedrecurrentneuralnetworkprovideseffectiveshorttermfluforecasting