The prediction of influenza-like illness using national influenza surveillance data and Baidu query data
Abstract Background Seasonal influenza and other respiratory tract infections are serious public health problems that need to be further addressed and investigated. Internet search data are recognized as a valuable source for forecasting influenza or other respiratory tract infection epidemics. Howe...
Main Authors: | , , , , , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
BMC
2024-02-01
|
Series: | BMC Public Health |
Subjects: | |
Online Access: | https://doi.org/10.1186/s12889-024-17978-0 |
_version_ | 1797272869298765824 |
---|---|
author | Su wei Sun Lin Zhao wenjing Song Shaoxia Yang Yuejie He Yujie Zhang Shu Li Zhong Liu Ti |
author_facet | Su wei Sun Lin Zhao wenjing Song Shaoxia Yang Yuejie He Yujie Zhang Shu Li Zhong Liu Ti |
author_sort | Su wei |
collection | DOAJ |
description | Abstract Background Seasonal influenza and other respiratory tract infections are serious public health problems that need to be further addressed and investigated. Internet search data are recognized as a valuable source for forecasting influenza or other respiratory tract infection epidemics. However, the selection of internet search data and the application of forecasting methods are important for improving forecasting accuracy. The aim of the present study was to forecast influenza epidemics based on the long short-term memory neural network (LSTM) method, Baidu search index data, and the influenza-like-illness (ILI) rate. Methods The official weekly ILI% data for northern and southern mainland China were obtained from the Chinese Influenza Center from 2018 to 2021. Based on the Baidu Index, search indices related to influenza infection over the corresponding time period were obtained. Pearson correlation analysis was performed to explore the association between influenza-related search queries and the ILI% of southern and northern mainland China. The LSTM model was used to forecast the influenza epidemic within the same week and at lags of 1–4 weeks. The model performance was assessed by evaluation metrics, including the mean square error (MSE), root mean square error (RMSE) and mean absolute error (MAE). Results In total, 24 search queries in northern mainland China and 7 search queries in southern mainland China were found to be correlated and were used to construct the LSTM model, which included the same week and a lag of 1–4 weeks. The LSTM model showed that ILI% + mask with one lag week and ILI% + influenza name were good prediction modules, with reduced RMSE predictions of 16.75% and 4.20%, respectively, compared with the estimated ILI% for northern and southern mainland China. Conclusions The results illuminate the feasibility of using an internet search index as a complementary data source for influenza forecasting and the efficiency of using the LSTM model to forecast influenza epidemics. |
first_indexed | 2024-03-07T14:35:29Z |
format | Article |
id | doaj.art-01c2f51464ae4ca297ccce36fa64d868 |
institution | Directory Open Access Journal |
issn | 1471-2458 |
language | English |
last_indexed | 2024-03-07T14:35:29Z |
publishDate | 2024-02-01 |
publisher | BMC |
record_format | Article |
series | BMC Public Health |
spelling | doaj.art-01c2f51464ae4ca297ccce36fa64d8682024-03-05T20:38:44ZengBMCBMC Public Health1471-24582024-02-0124111210.1186/s12889-024-17978-0The prediction of influenza-like illness using national influenza surveillance data and Baidu query dataSu wei0Sun Lin1Zhao wenjing2Song Shaoxia3Yang Yuejie4He Yujie5Zhang Shu6Li Zhong7Liu Ti8School of Management Science and Engineering, Shandong University of Finance and EconomicsShandong Center for Disease Control and Prevention, Shandong Provincial Key Laboratory of Infectious Disease Control and Prevention, Shandong University Institution for Prevention MedicineDezhou Center for Disease Control and PreventionShandong Center for Disease Control and Prevention, Shandong Provincial Key Laboratory of Infectious Disease Control and Prevention, Shandong University Institution for Prevention MedicineChina Institute of Water Resources and Hydropower ResearchShandong Center for Disease Control and Prevention, Shandong Provincial Key Laboratory of Infectious Disease Control and Prevention, Shandong University Institution for Prevention MedicineShandong Center for Disease Control and Prevention, Shandong Provincial Key Laboratory of Infectious Disease Control and Prevention, Shandong University Institution for Prevention MedicineShandong Center for Disease Control and Prevention, Shandong Provincial Key Laboratory of Infectious Disease Control and Prevention, Shandong University Institution for Prevention MedicineShandong Center for Disease Control and Prevention, Shandong Provincial Key Laboratory of Infectious Disease Control and Prevention, Shandong University Institution for Prevention MedicineAbstract Background Seasonal influenza and other respiratory tract infections are serious public health problems that need to be further addressed and investigated. Internet search data are recognized as a valuable source for forecasting influenza or other respiratory tract infection epidemics. However, the selection of internet search data and the application of forecasting methods are important for improving forecasting accuracy. The aim of the present study was to forecast influenza epidemics based on the long short-term memory neural network (LSTM) method, Baidu search index data, and the influenza-like-illness (ILI) rate. Methods The official weekly ILI% data for northern and southern mainland China were obtained from the Chinese Influenza Center from 2018 to 2021. Based on the Baidu Index, search indices related to influenza infection over the corresponding time period were obtained. Pearson correlation analysis was performed to explore the association between influenza-related search queries and the ILI% of southern and northern mainland China. The LSTM model was used to forecast the influenza epidemic within the same week and at lags of 1–4 weeks. The model performance was assessed by evaluation metrics, including the mean square error (MSE), root mean square error (RMSE) and mean absolute error (MAE). Results In total, 24 search queries in northern mainland China and 7 search queries in southern mainland China were found to be correlated and were used to construct the LSTM model, which included the same week and a lag of 1–4 weeks. The LSTM model showed that ILI% + mask with one lag week and ILI% + influenza name were good prediction modules, with reduced RMSE predictions of 16.75% and 4.20%, respectively, compared with the estimated ILI% for northern and southern mainland China. Conclusions The results illuminate the feasibility of using an internet search index as a complementary data source for influenza forecasting and the efficiency of using the LSTM model to forecast influenza epidemics.https://doi.org/10.1186/s12889-024-17978-0InfluenzaForecastLSTMBaidu search index |
spellingShingle | Su wei Sun Lin Zhao wenjing Song Shaoxia Yang Yuejie He Yujie Zhang Shu Li Zhong Liu Ti The prediction of influenza-like illness using national influenza surveillance data and Baidu query data BMC Public Health Influenza Forecast LSTM Baidu search index |
title | The prediction of influenza-like illness using national influenza surveillance data and Baidu query data |
title_full | The prediction of influenza-like illness using national influenza surveillance data and Baidu query data |
title_fullStr | The prediction of influenza-like illness using national influenza surveillance data and Baidu query data |
title_full_unstemmed | The prediction of influenza-like illness using national influenza surveillance data and Baidu query data |
title_short | The prediction of influenza-like illness using national influenza surveillance data and Baidu query data |
title_sort | prediction of influenza like illness using national influenza surveillance data and baidu query data |
topic | Influenza Forecast LSTM Baidu search index |
url | https://doi.org/10.1186/s12889-024-17978-0 |
work_keys_str_mv | AT suwei thepredictionofinfluenzalikeillnessusingnationalinfluenzasurveillancedataandbaiduquerydata AT sunlin thepredictionofinfluenzalikeillnessusingnationalinfluenzasurveillancedataandbaiduquerydata AT zhaowenjing thepredictionofinfluenzalikeillnessusingnationalinfluenzasurveillancedataandbaiduquerydata AT songshaoxia thepredictionofinfluenzalikeillnessusingnationalinfluenzasurveillancedataandbaiduquerydata AT yangyuejie thepredictionofinfluenzalikeillnessusingnationalinfluenzasurveillancedataandbaiduquerydata AT heyujie thepredictionofinfluenzalikeillnessusingnationalinfluenzasurveillancedataandbaiduquerydata AT zhangshu thepredictionofinfluenzalikeillnessusingnationalinfluenzasurveillancedataandbaiduquerydata AT lizhong thepredictionofinfluenzalikeillnessusingnationalinfluenzasurveillancedataandbaiduquerydata AT liuti thepredictionofinfluenzalikeillnessusingnationalinfluenzasurveillancedataandbaiduquerydata AT suwei predictionofinfluenzalikeillnessusingnationalinfluenzasurveillancedataandbaiduquerydata AT sunlin predictionofinfluenzalikeillnessusingnationalinfluenzasurveillancedataandbaiduquerydata AT zhaowenjing predictionofinfluenzalikeillnessusingnationalinfluenzasurveillancedataandbaiduquerydata AT songshaoxia predictionofinfluenzalikeillnessusingnationalinfluenzasurveillancedataandbaiduquerydata AT yangyuejie predictionofinfluenzalikeillnessusingnationalinfluenzasurveillancedataandbaiduquerydata AT heyujie predictionofinfluenzalikeillnessusingnationalinfluenzasurveillancedataandbaiduquerydata AT zhangshu predictionofinfluenzalikeillnessusingnationalinfluenzasurveillancedataandbaiduquerydata AT lizhong predictionofinfluenzalikeillnessusingnationalinfluenzasurveillancedataandbaiduquerydata AT liuti predictionofinfluenzalikeillnessusingnationalinfluenzasurveillancedataandbaiduquerydata |