Random forest regression for prediction of Covid-19 daily cases and deaths in Turkey

During pandemic periods, there is an intense flow of patients to hospitals. Depending on the disease, many patients may require hospitalization. In some cases, these patients must be taken to intensive care units and emergency interventions must be performed. However, finding a sufficient number of...

Full description

Bibliographic Details
Main Author: Figen Özen
Format: Article
Language:English
Published: Elsevier 2024-02-01
Series:Heliyon
Subjects:
Online Access:http://www.sciencedirect.com/science/article/pii/S2405844024017778
_version_ 1827322343590985728
author Figen Özen
author_facet Figen Özen
author_sort Figen Özen
collection DOAJ
description During pandemic periods, there is an intense flow of patients to hospitals. Depending on the disease, many patients may require hospitalization. In some cases, these patients must be taken to intensive care units and emergency interventions must be performed. However, finding a sufficient number of hospital beds or intensive care units during pandemic periods poses a big problem. In these periods, fast and effective planning is more important than ever. Another problem experienced during pandemic periods is the burial of the dead in case the number of deaths increases. This is also a situation that requires due planning. We can learn some lessons from Covid 19 pandemic and be prepared for the future ones. In this paper, statistical properties of the daily cases and daily deaths in Turkey, which is one of the most affected countries by the pandemic in the World, are studied. It is found that the characteristics are nonstationary. Then, random forest regression is applied to predict Covid-19 daily cases and deaths. In addition, seven other machine learning models, namely bagging, AdaBoost, gradient boosting, XGBoost, decision tree, LSTM and ARIMA regressors are built for comparison. The performance of the models are measured using accuracy, coefficient of variation, root-mean-square score and relative error metrics. When random forest regressors are employed, test data related to daily cases are predicted with an accuracy of 92.30% and with an r2 score of 0.9893. Besides, daily deaths are predicted with an accuracy of 91.39% and with an r2 score of 0.9834. The closest rival in predictions is the bagging regressor. Nevertheless, the results provided by this algoritm changed in different runs and this fact is shown in the study, as well. Comparisons are based on test data. Comparisons with the earlier works are also provided.
first_indexed 2024-03-08T03:29:47Z
format Article
id doaj.art-de31c3ea3be24704a2467b5796afd584
institution Directory Open Access Journal
issn 2405-8440
language English
last_indexed 2024-04-25T01:23:03Z
publishDate 2024-02-01
publisher Elsevier
record_format Article
series Heliyon
spelling doaj.art-de31c3ea3be24704a2467b5796afd5842024-03-09T09:26:06ZengElsevierHeliyon2405-84402024-02-01104e25746Random forest regression for prediction of Covid-19 daily cases and deaths in TurkeyFigen Özen0Department of Electrical and Electronics Engineering, Haliç University, Istanbul, TurkeyDuring pandemic periods, there is an intense flow of patients to hospitals. Depending on the disease, many patients may require hospitalization. In some cases, these patients must be taken to intensive care units and emergency interventions must be performed. However, finding a sufficient number of hospital beds or intensive care units during pandemic periods poses a big problem. In these periods, fast and effective planning is more important than ever. Another problem experienced during pandemic periods is the burial of the dead in case the number of deaths increases. This is also a situation that requires due planning. We can learn some lessons from Covid 19 pandemic and be prepared for the future ones. In this paper, statistical properties of the daily cases and daily deaths in Turkey, which is one of the most affected countries by the pandemic in the World, are studied. It is found that the characteristics are nonstationary. Then, random forest regression is applied to predict Covid-19 daily cases and deaths. In addition, seven other machine learning models, namely bagging, AdaBoost, gradient boosting, XGBoost, decision tree, LSTM and ARIMA regressors are built for comparison. The performance of the models are measured using accuracy, coefficient of variation, root-mean-square score and relative error metrics. When random forest regressors are employed, test data related to daily cases are predicted with an accuracy of 92.30% and with an r2 score of 0.9893. Besides, daily deaths are predicted with an accuracy of 91.39% and with an r2 score of 0.9834. The closest rival in predictions is the bagging regressor. Nevertheless, the results provided by this algoritm changed in different runs and this fact is shown in the study, as well. Comparisons are based on test data. Comparisons with the earlier works are also provided.http://www.sciencedirect.com/science/article/pii/S2405844024017778Covid-19 pandemicMachine learningEnsemble learningRandom forest regressorBagging regressorBoosting regressor
spellingShingle Figen Özen
Random forest regression for prediction of Covid-19 daily cases and deaths in Turkey
Heliyon
Covid-19 pandemic
Machine learning
Ensemble learning
Random forest regressor
Bagging regressor
Boosting regressor
title Random forest regression for prediction of Covid-19 daily cases and deaths in Turkey
title_full Random forest regression for prediction of Covid-19 daily cases and deaths in Turkey
title_fullStr Random forest regression for prediction of Covid-19 daily cases and deaths in Turkey
title_full_unstemmed Random forest regression for prediction of Covid-19 daily cases and deaths in Turkey
title_short Random forest regression for prediction of Covid-19 daily cases and deaths in Turkey
title_sort random forest regression for prediction of covid 19 daily cases and deaths in turkey
topic Covid-19 pandemic
Machine learning
Ensemble learning
Random forest regressor
Bagging regressor
Boosting regressor
url http://www.sciencedirect.com/science/article/pii/S2405844024017778
work_keys_str_mv AT figenozen randomforestregressionforpredictionofcovid19dailycasesanddeathsinturkey