Models for COVID-19 Data Prediction Based on Improved LSTM-ARIMA Algorithms
The global repercussions of the COVID-19 pandemic on economies and public health worldwide have been profound. This study aims to examine the developmental trends of the COVID-19 pandemic, establish predictive models, and provide insights for effective control measures against potential future disea...
Main Authors: | , , , , , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
IEEE
2024-01-01
|
Series: | IEEE Access |
Subjects: | |
Online Access: | https://ieeexplore.ieee.org/document/10374119/ |
_version_ | 1797360185980747776 |
---|---|
author | Yong-Chao Jin Qian Cao Qian Sun Ye Lin Dong-Mei Liu Shan-Yu Chen-Xi Wang Xiao-Ling Wang Xi-Yin Wang |
author_facet | Yong-Chao Jin Qian Cao Qian Sun Ye Lin Dong-Mei Liu Shan-Yu Chen-Xi Wang Xiao-Ling Wang Xi-Yin Wang |
author_sort | Yong-Chao Jin |
collection | DOAJ |
description | The global repercussions of the COVID-19 pandemic on economies and public health worldwide have been profound. This study aims to examine the developmental trends of the COVID-19 pandemic, establish predictive models, and provide insights for effective control measures against potential future disease outbreaks. Considering the coexistence of both linear and nonlinear factors in COVID-19 data, conventional single-machine learning and traditional forecasting models encounter challenges in accurately predicting pandemic trends. To enhance the precision of COVID-19 pandemic predictions by integrating linear and nonlinear factors, this study proposes three combined forecasting models: CNN-LSTM-ARIMA, TCN-LSTM-ARIMA, and SSA-LSTM-ARIMA. These models leverage the strengths of deep learning in capturing nonlinear factors and the capabilities of the traditional ARIMA model in handling linear factors. Initially, LSTM and ARIMA models are used to model and predict the COVID-19 pandemic in Quebec, Canada. Subsequently, CNN models, TCN models, and the Sparrow Search Algorithm are employed to integrate predictions from the LSTM and ARIMA models. Comparative analyses of the three combined models, it was found that the CNN-LSTM-ARIMA model exhibits the highest predictive accuracy, with an MSE of 7048.26, RMSE of 83.95, MAE of 61.18, MAPE of 0.16, and <inline-formula> <tex-math notation="LaTeX">$R^{2}$ </tex-math></inline-formula> of 0.95. To validate the applicability and stability of the CNN-LSTM-ARIMA model in predicting COVID-19 pandemics, Italian COVID-19 pandemic data was employed. The three combined forecasting models are established and evaluated using model evaluation metrics. The results affirm that the CNN-LSTM-ARIMA model remains the optimal choice, underscoring its high stability and suitability for COVID-19 pandemic forecasting endeavors. |
first_indexed | 2024-03-08T15:35:40Z |
format | Article |
id | doaj.art-730251d920714375a50a4f5fc10e94d1 |
institution | Directory Open Access Journal |
issn | 2169-3536 |
language | English |
last_indexed | 2024-03-08T15:35:40Z |
publishDate | 2024-01-01 |
publisher | IEEE |
record_format | Article |
series | IEEE Access |
spelling | doaj.art-730251d920714375a50a4f5fc10e94d12024-01-10T00:05:54ZengIEEEIEEE Access2169-35362024-01-01123981399110.1109/ACCESS.2023.334740310374119Models for COVID-19 Data Prediction Based on Improved LSTM-ARIMA AlgorithmsYong-Chao Jin0https://orcid.org/0000-0001-9528-5720Qian Cao1Qian Sun2Ye Lin3Dong-Mei Liu4 Shan-Yu5Chen-Xi Wang6https://orcid.org/0000-0003-1357-4823Xiao-Ling Wang7https://orcid.org/0009-0000-8718-8023Xi-Yin Wang8https://orcid.org/0000-0003-3454-0374College of Science, North China University of Science and Technology, Tangshan, ChinaCollege of Science, North China University of Science and Technology, Tangshan, ChinaCollege of Science, North China University of Science and Technology, Tangshan, ChinaCollege of Science, North China University of Science and Technology, Tangshan, ChinaCollege of Science, North China University of Science and Technology, Tangshan, ChinaCollege of Science, North China University of Science and Technology, Tangshan, ChinaCollege of Science, North China University of Science and Technology, Tangshan, ChinaCollege of Science, North China University of Science and Technology, Tangshan, ChinaCollege of Science, North China University of Science and Technology, Tangshan, ChinaThe global repercussions of the COVID-19 pandemic on economies and public health worldwide have been profound. This study aims to examine the developmental trends of the COVID-19 pandemic, establish predictive models, and provide insights for effective control measures against potential future disease outbreaks. Considering the coexistence of both linear and nonlinear factors in COVID-19 data, conventional single-machine learning and traditional forecasting models encounter challenges in accurately predicting pandemic trends. To enhance the precision of COVID-19 pandemic predictions by integrating linear and nonlinear factors, this study proposes three combined forecasting models: CNN-LSTM-ARIMA, TCN-LSTM-ARIMA, and SSA-LSTM-ARIMA. These models leverage the strengths of deep learning in capturing nonlinear factors and the capabilities of the traditional ARIMA model in handling linear factors. Initially, LSTM and ARIMA models are used to model and predict the COVID-19 pandemic in Quebec, Canada. Subsequently, CNN models, TCN models, and the Sparrow Search Algorithm are employed to integrate predictions from the LSTM and ARIMA models. Comparative analyses of the three combined models, it was found that the CNN-LSTM-ARIMA model exhibits the highest predictive accuracy, with an MSE of 7048.26, RMSE of 83.95, MAE of 61.18, MAPE of 0.16, and <inline-formula> <tex-math notation="LaTeX">$R^{2}$ </tex-math></inline-formula> of 0.95. To validate the applicability and stability of the CNN-LSTM-ARIMA model in predicting COVID-19 pandemics, Italian COVID-19 pandemic data was employed. The three combined forecasting models are established and evaluated using model evaluation metrics. The results affirm that the CNN-LSTM-ARIMA model remains the optimal choice, underscoring its high stability and suitability for COVID-19 pandemic forecasting endeavors.https://ieeexplore.ieee.org/document/10374119/COVID-19LSTMARIMACNN-LSTM-ARIMATCN-LSTM-ARIMASSA-LSTM-ARIMA |
spellingShingle | Yong-Chao Jin Qian Cao Qian Sun Ye Lin Dong-Mei Liu Shan-Yu Chen-Xi Wang Xiao-Ling Wang Xi-Yin Wang Models for COVID-19 Data Prediction Based on Improved LSTM-ARIMA Algorithms IEEE Access COVID-19 LSTM ARIMA CNN-LSTM-ARIMA TCN-LSTM-ARIMA SSA-LSTM-ARIMA |
title | Models for COVID-19 Data Prediction Based on Improved LSTM-ARIMA Algorithms |
title_full | Models for COVID-19 Data Prediction Based on Improved LSTM-ARIMA Algorithms |
title_fullStr | Models for COVID-19 Data Prediction Based on Improved LSTM-ARIMA Algorithms |
title_full_unstemmed | Models for COVID-19 Data Prediction Based on Improved LSTM-ARIMA Algorithms |
title_short | Models for COVID-19 Data Prediction Based on Improved LSTM-ARIMA Algorithms |
title_sort | models for covid 19 data prediction based on improved lstm arima algorithms |
topic | COVID-19 LSTM ARIMA CNN-LSTM-ARIMA TCN-LSTM-ARIMA SSA-LSTM-ARIMA |
url | https://ieeexplore.ieee.org/document/10374119/ |
work_keys_str_mv | AT yongchaojin modelsforcovid19datapredictionbasedonimprovedlstmarimaalgorithms AT qiancao modelsforcovid19datapredictionbasedonimprovedlstmarimaalgorithms AT qiansun modelsforcovid19datapredictionbasedonimprovedlstmarimaalgorithms AT yelin modelsforcovid19datapredictionbasedonimprovedlstmarimaalgorithms AT dongmeiliu modelsforcovid19datapredictionbasedonimprovedlstmarimaalgorithms AT shanyu modelsforcovid19datapredictionbasedonimprovedlstmarimaalgorithms AT chenxiwang modelsforcovid19datapredictionbasedonimprovedlstmarimaalgorithms AT xiaolingwang modelsforcovid19datapredictionbasedonimprovedlstmarimaalgorithms AT xiyinwang modelsforcovid19datapredictionbasedonimprovedlstmarimaalgorithms |