Influenza trend prediction method combining Baidu index and support vector regression based on an improved particle swarm optimization algorithm

Web-based search query data have been recognized as valuable data sources for discovering new influenza epidemics. However, selecting search and query keywords and adopting prediction methods pose key challenges to improving the effectiveness of influenza prediction. In this study, web search data w...

Full description

Bibliographic Details
Main Authors: Hongxin Xue, Lingling Zhang, Haijian Liang, Liqun Kuang, Huiyan Han, Xiaowen Yang, Lei Guo
Format: Article
Language:English
Published: AIMS Press 2023-09-01
Series:AIMS Mathematics
Subjects:
Online Access:https://www.aimspress.com/article/doi/10.3934/math.20231303?viewType=HTML
_version_ 1797674895096676352
author Hongxin Xue
Lingling Zhang
Haijian Liang
Liqun Kuang
Huiyan Han
Xiaowen Yang
Lei Guo
author_facet Hongxin Xue
Lingling Zhang
Haijian Liang
Liqun Kuang
Huiyan Han
Xiaowen Yang
Lei Guo
author_sort Hongxin Xue
collection DOAJ
description Web-based search query data have been recognized as valuable data sources for discovering new influenza epidemics. However, selecting search and query keywords and adopting prediction methods pose key challenges to improving the effectiveness of influenza prediction. In this study, web search data were analyzed and excavated using big data and machine learning methods. The flu prediction model for the southern region of China, considering the impact of influenza transmission across regions and based on various keywords and historical influenza-like illness percentage (ILI%) data, was built (models 1–4) to verify the factors affecting the spread of the flu. To improve the accuracy of the influenza trend prediction, a support vector regression method based on an improved particle swarm optimization algorithm was proposed (IPSO-SVR), which was applied to the influenza prediction model to forecast ILI% in southern China. By comparing and analyzing the prediction results of each model, model 4, using the IPSO-SVR algorithm, exhibited higher prediction precision and more effective results, with its prediction indexes including the mean square error (MSE), root mean square error (RMSE) and mean absolute error (MAE) being 0.0596, 0.2441 and 0.1884, respectively. The experimental results show that the prediction precision significantly increased when the IPSO-SVR method was applied to the constructed ILI% model. A new theoretical basis and implementation strategy were provided for achieving more accurate influenza prevention and control in southern China.
first_indexed 2024-03-11T22:06:48Z
format Article
id doaj.art-4e2ce261d18b4c7588c4cedd9c4bf254
institution Directory Open Access Journal
issn 2473-6988
language English
last_indexed 2024-03-11T22:06:48Z
publishDate 2023-09-01
publisher AIMS Press
record_format Article
series AIMS Mathematics
spelling doaj.art-4e2ce261d18b4c7588c4cedd9c4bf2542023-09-25T01:37:29ZengAIMS PressAIMS Mathematics2473-69882023-09-0111255282554910.3934/math.20231303Influenza trend prediction method combining Baidu index and support vector regression based on an improved particle swarm optimization algorithmHongxin Xue 0Lingling Zhang1Haijian Liang 2Liqun Kuang 3Huiyan Han4Xiaowen Yang 5Lei Guo61. School of Computer Science and Technology, North University of China, Taiyuan, Shanxi 030051, China 2. Shanxi Province's Vision Information Processing and Intelligent Robot Engineering Research Center, Taiyuan, Shanxi 030051, China 3. Shanxi Key Laboratory of Machine Vision and Virtual Reality, Taiyuan, Shanxi 030051, China1. School of Computer Science and Technology, North University of China, Taiyuan, Shanxi 030051, China 2. Shanxi Province's Vision Information Processing and Intelligent Robot Engineering Research Center, Taiyuan, Shanxi 030051, China 3. Shanxi Key Laboratory of Machine Vision and Virtual Reality, Taiyuan, Shanxi 030051, China4. School of Software, North University of China, Taiyuan, Shanxi 030051, China1. School of Computer Science and Technology, North University of China, Taiyuan, Shanxi 030051, China 2. Shanxi Province's Vision Information Processing and Intelligent Robot Engineering Research Center, Taiyuan, Shanxi 030051, China 3. Shanxi Key Laboratory of Machine Vision and Virtual Reality, Taiyuan, Shanxi 030051, China1. School of Computer Science and Technology, North University of China, Taiyuan, Shanxi 030051, China 2. Shanxi Province's Vision Information Processing and Intelligent Robot Engineering Research Center, Taiyuan, Shanxi 030051, China 3. Shanxi Key Laboratory of Machine Vision and Virtual Reality, Taiyuan, Shanxi 030051, China1. School of Computer Science and Technology, North University of China, Taiyuan, Shanxi 030051, China 2. Shanxi Province's Vision Information Processing and Intelligent Robot Engineering Research Center, Taiyuan, Shanxi 030051, China 3. Shanxi Key Laboratory of Machine Vision and Virtual Reality, Taiyuan, Shanxi 030051, China1. School of Computer Science and Technology, North University of China, Taiyuan, Shanxi 030051, China 2. Shanxi Province's Vision Information Processing and Intelligent Robot Engineering Research Center, Taiyuan, Shanxi 030051, China 3. Shanxi Key Laboratory of Machine Vision and Virtual Reality, Taiyuan, Shanxi 030051, ChinaWeb-based search query data have been recognized as valuable data sources for discovering new influenza epidemics. However, selecting search and query keywords and adopting prediction methods pose key challenges to improving the effectiveness of influenza prediction. In this study, web search data were analyzed and excavated using big data and machine learning methods. The flu prediction model for the southern region of China, considering the impact of influenza transmission across regions and based on various keywords and historical influenza-like illness percentage (ILI%) data, was built (models 1–4) to verify the factors affecting the spread of the flu. To improve the accuracy of the influenza trend prediction, a support vector regression method based on an improved particle swarm optimization algorithm was proposed (IPSO-SVR), which was applied to the influenza prediction model to forecast ILI% in southern China. By comparing and analyzing the prediction results of each model, model 4, using the IPSO-SVR algorithm, exhibited higher prediction precision and more effective results, with its prediction indexes including the mean square error (MSE), root mean square error (RMSE) and mean absolute error (MAE) being 0.0596, 0.2441 and 0.1884, respectively. The experimental results show that the prediction precision significantly increased when the IPSO-SVR method was applied to the constructed ILI% model. A new theoretical basis and implementation strategy were provided for achieving more accurate influenza prevention and control in southern China.https://www.aimspress.com/article/doi/10.3934/math.20231303?viewType=HTMLinfluenzaweb search dataprediction modelprincipal component analysissupport vector regressionimproved pso algorithm
spellingShingle Hongxin Xue
Lingling Zhang
Haijian Liang
Liqun Kuang
Huiyan Han
Xiaowen Yang
Lei Guo
Influenza trend prediction method combining Baidu index and support vector regression based on an improved particle swarm optimization algorithm
AIMS Mathematics
influenza
web search data
prediction model
principal component analysis
support vector regression
improved pso algorithm
title Influenza trend prediction method combining Baidu index and support vector regression based on an improved particle swarm optimization algorithm
title_full Influenza trend prediction method combining Baidu index and support vector regression based on an improved particle swarm optimization algorithm
title_fullStr Influenza trend prediction method combining Baidu index and support vector regression based on an improved particle swarm optimization algorithm
title_full_unstemmed Influenza trend prediction method combining Baidu index and support vector regression based on an improved particle swarm optimization algorithm
title_short Influenza trend prediction method combining Baidu index and support vector regression based on an improved particle swarm optimization algorithm
title_sort influenza trend prediction method combining baidu index and support vector regression based on an improved particle swarm optimization algorithm
topic influenza
web search data
prediction model
principal component analysis
support vector regression
improved pso algorithm
url https://www.aimspress.com/article/doi/10.3934/math.20231303?viewType=HTML
work_keys_str_mv AT hongxinxue influenzatrendpredictionmethodcombiningbaiduindexandsupportvectorregressionbasedonanimprovedparticleswarmoptimizationalgorithm
AT linglingzhang influenzatrendpredictionmethodcombiningbaiduindexandsupportvectorregressionbasedonanimprovedparticleswarmoptimizationalgorithm
AT haijianliang influenzatrendpredictionmethodcombiningbaiduindexandsupportvectorregressionbasedonanimprovedparticleswarmoptimizationalgorithm
AT liqunkuang influenzatrendpredictionmethodcombiningbaiduindexandsupportvectorregressionbasedonanimprovedparticleswarmoptimizationalgorithm
AT huiyanhan influenzatrendpredictionmethodcombiningbaiduindexandsupportvectorregressionbasedonanimprovedparticleswarmoptimizationalgorithm
AT xiaowenyang influenzatrendpredictionmethodcombiningbaiduindexandsupportvectorregressionbasedonanimprovedparticleswarmoptimizationalgorithm
AT leiguo influenzatrendpredictionmethodcombiningbaiduindexandsupportvectorregressionbasedonanimprovedparticleswarmoptimizationalgorithm