Models for COVID-19 Data Prediction Based on Improved LSTM-ARIMA Algorithms

The global repercussions of the COVID-19 pandemic on economies and public health worldwide have been profound. This study aims to examine the developmental trends of the COVID-19 pandemic, establish predictive models, and provide insights for effective control measures against potential future disea...

Full description

Bibliographic Details
Main Authors: Yong-Chao Jin, Qian Cao, Qian Sun, Ye Lin, Dong-Mei Liu, Shan-Yu, Chen-Xi Wang, Xiao-Ling Wang, Xi-Yin Wang
Format: Article
Language:English
Published: IEEE 2024-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/10374119/
_version_ 1797360185980747776
author Yong-Chao Jin
Qian Cao
Qian Sun
Ye Lin
Dong-Mei Liu
Shan-Yu
Chen-Xi Wang
Xiao-Ling Wang
Xi-Yin Wang
author_facet Yong-Chao Jin
Qian Cao
Qian Sun
Ye Lin
Dong-Mei Liu
Shan-Yu
Chen-Xi Wang
Xiao-Ling Wang
Xi-Yin Wang
author_sort Yong-Chao Jin
collection DOAJ
description The global repercussions of the COVID-19 pandemic on economies and public health worldwide have been profound. This study aims to examine the developmental trends of the COVID-19 pandemic, establish predictive models, and provide insights for effective control measures against potential future disease outbreaks. Considering the coexistence of both linear and nonlinear factors in COVID-19 data, conventional single-machine learning and traditional forecasting models encounter challenges in accurately predicting pandemic trends. To enhance the precision of COVID-19 pandemic predictions by integrating linear and nonlinear factors, this study proposes three combined forecasting models: CNN-LSTM-ARIMA, TCN-LSTM-ARIMA, and SSA-LSTM-ARIMA. These models leverage the strengths of deep learning in capturing nonlinear factors and the capabilities of the traditional ARIMA model in handling linear factors. Initially, LSTM and ARIMA models are used to model and predict the COVID-19 pandemic in Quebec, Canada. Subsequently, CNN models, TCN models, and the Sparrow Search Algorithm are employed to integrate predictions from the LSTM and ARIMA models. Comparative analyses of the three combined models, it was found that the CNN-LSTM-ARIMA model exhibits the highest predictive accuracy, with an MSE of 7048.26, RMSE of 83.95, MAE of 61.18, MAPE of 0.16, and <inline-formula> <tex-math notation="LaTeX">$R^{2}$ </tex-math></inline-formula> of 0.95. To validate the applicability and stability of the CNN-LSTM-ARIMA model in predicting COVID-19 pandemics, Italian COVID-19 pandemic data was employed. The three combined forecasting models are established and evaluated using model evaluation metrics. The results affirm that the CNN-LSTM-ARIMA model remains the optimal choice, underscoring its high stability and suitability for COVID-19 pandemic forecasting endeavors.
first_indexed 2024-03-08T15:35:40Z
format Article
id doaj.art-730251d920714375a50a4f5fc10e94d1
institution Directory Open Access Journal
issn 2169-3536
language English
last_indexed 2024-03-08T15:35:40Z
publishDate 2024-01-01
publisher IEEE
record_format Article
series IEEE Access
spelling doaj.art-730251d920714375a50a4f5fc10e94d12024-01-10T00:05:54ZengIEEEIEEE Access2169-35362024-01-01123981399110.1109/ACCESS.2023.334740310374119Models for COVID-19 Data Prediction Based on Improved LSTM-ARIMA AlgorithmsYong-Chao Jin0https://orcid.org/0000-0001-9528-5720Qian Cao1Qian Sun2Ye Lin3Dong-Mei Liu4 Shan-Yu5Chen-Xi Wang6https://orcid.org/0000-0003-1357-4823Xiao-Ling Wang7https://orcid.org/0009-0000-8718-8023Xi-Yin Wang8https://orcid.org/0000-0003-3454-0374College of Science, North China University of Science and Technology, Tangshan, ChinaCollege of Science, North China University of Science and Technology, Tangshan, ChinaCollege of Science, North China University of Science and Technology, Tangshan, ChinaCollege of Science, North China University of Science and Technology, Tangshan, ChinaCollege of Science, North China University of Science and Technology, Tangshan, ChinaCollege of Science, North China University of Science and Technology, Tangshan, ChinaCollege of Science, North China University of Science and Technology, Tangshan, ChinaCollege of Science, North China University of Science and Technology, Tangshan, ChinaCollege of Science, North China University of Science and Technology, Tangshan, ChinaThe global repercussions of the COVID-19 pandemic on economies and public health worldwide have been profound. This study aims to examine the developmental trends of the COVID-19 pandemic, establish predictive models, and provide insights for effective control measures against potential future disease outbreaks. Considering the coexistence of both linear and nonlinear factors in COVID-19 data, conventional single-machine learning and traditional forecasting models encounter challenges in accurately predicting pandemic trends. To enhance the precision of COVID-19 pandemic predictions by integrating linear and nonlinear factors, this study proposes three combined forecasting models: CNN-LSTM-ARIMA, TCN-LSTM-ARIMA, and SSA-LSTM-ARIMA. These models leverage the strengths of deep learning in capturing nonlinear factors and the capabilities of the traditional ARIMA model in handling linear factors. Initially, LSTM and ARIMA models are used to model and predict the COVID-19 pandemic in Quebec, Canada. Subsequently, CNN models, TCN models, and the Sparrow Search Algorithm are employed to integrate predictions from the LSTM and ARIMA models. Comparative analyses of the three combined models, it was found that the CNN-LSTM-ARIMA model exhibits the highest predictive accuracy, with an MSE of 7048.26, RMSE of 83.95, MAE of 61.18, MAPE of 0.16, and <inline-formula> <tex-math notation="LaTeX">$R^{2}$ </tex-math></inline-formula> of 0.95. To validate the applicability and stability of the CNN-LSTM-ARIMA model in predicting COVID-19 pandemics, Italian COVID-19 pandemic data was employed. The three combined forecasting models are established and evaluated using model evaluation metrics. The results affirm that the CNN-LSTM-ARIMA model remains the optimal choice, underscoring its high stability and suitability for COVID-19 pandemic forecasting endeavors.https://ieeexplore.ieee.org/document/10374119/COVID-19LSTMARIMACNN-LSTM-ARIMATCN-LSTM-ARIMASSA-LSTM-ARIMA
spellingShingle Yong-Chao Jin
Qian Cao
Qian Sun
Ye Lin
Dong-Mei Liu
Shan-Yu
Chen-Xi Wang
Xiao-Ling Wang
Xi-Yin Wang
Models for COVID-19 Data Prediction Based on Improved LSTM-ARIMA Algorithms
IEEE Access
COVID-19
LSTM
ARIMA
CNN-LSTM-ARIMA
TCN-LSTM-ARIMA
SSA-LSTM-ARIMA
title Models for COVID-19 Data Prediction Based on Improved LSTM-ARIMA Algorithms
title_full Models for COVID-19 Data Prediction Based on Improved LSTM-ARIMA Algorithms
title_fullStr Models for COVID-19 Data Prediction Based on Improved LSTM-ARIMA Algorithms
title_full_unstemmed Models for COVID-19 Data Prediction Based on Improved LSTM-ARIMA Algorithms
title_short Models for COVID-19 Data Prediction Based on Improved LSTM-ARIMA Algorithms
title_sort models for covid 19 data prediction based on improved lstm arima algorithms
topic COVID-19
LSTM
ARIMA
CNN-LSTM-ARIMA
TCN-LSTM-ARIMA
SSA-LSTM-ARIMA
url https://ieeexplore.ieee.org/document/10374119/
work_keys_str_mv AT yongchaojin modelsforcovid19datapredictionbasedonimprovedlstmarimaalgorithms
AT qiancao modelsforcovid19datapredictionbasedonimprovedlstmarimaalgorithms
AT qiansun modelsforcovid19datapredictionbasedonimprovedlstmarimaalgorithms
AT yelin modelsforcovid19datapredictionbasedonimprovedlstmarimaalgorithms
AT dongmeiliu modelsforcovid19datapredictionbasedonimprovedlstmarimaalgorithms
AT shanyu modelsforcovid19datapredictionbasedonimprovedlstmarimaalgorithms
AT chenxiwang modelsforcovid19datapredictionbasedonimprovedlstmarimaalgorithms
AT xiaolingwang modelsforcovid19datapredictionbasedonimprovedlstmarimaalgorithms
AT xiyinwang modelsforcovid19datapredictionbasedonimprovedlstmarimaalgorithms