Machine Learning-Based Screening Solution for COVID-19 Cases Investigation: Socio-Demographic and Behavioral Factors Analysis and COVID-19 Detection

Abstract The COVID-19 pandemic has unleashed an unprecedented global crisis, releasing a wave of illness, mortality, and economic disarray of unparalleled proportions. Numerous societal and behavioral aspects have conspired to fuel the rampant spread of COVID-19 across the globe. These factors encom...

Full description

Bibliographic Details
Main Authors: K. M. Aslam Uddin, Farida Siddiqi Prity, Maisha Tasnim, Sumiya Nur Jannat, Mohammad Omar Faruk, Jahirul Islam, Saydul Akbar Murad, Apurba Adhikary, Anupam Kumar Bairagi
Format: Article
Language:English
Published: Springer Nature 2023-10-01
Series:Human-Centric Intelligent Systems
Subjects:
Online Access:https://doi.org/10.1007/s44230-023-00049-9
_version_ 1797376914245025792
author K. M. Aslam Uddin
Farida Siddiqi Prity
Maisha Tasnim
Sumiya Nur Jannat
Mohammad Omar Faruk
Jahirul Islam
Saydul Akbar Murad
Apurba Adhikary
Anupam Kumar Bairagi
author_facet K. M. Aslam Uddin
Farida Siddiqi Prity
Maisha Tasnim
Sumiya Nur Jannat
Mohammad Omar Faruk
Jahirul Islam
Saydul Akbar Murad
Apurba Adhikary
Anupam Kumar Bairagi
author_sort K. M. Aslam Uddin
collection DOAJ
description Abstract The COVID-19 pandemic has unleashed an unprecedented global crisis, releasing a wave of illness, mortality, and economic disarray of unparalleled proportions. Numerous societal and behavioral aspects have conspired to fuel the rampant spread of COVID-19 across the globe. These factors encompass densely populated areas, adherence to mask-wearing protocols, inadequate awareness levels, and various behavioral and social practices. Despite the extensive research surrounding COVID-19 detection, an unfortunate dearth of studies has emerged to meticulously evaluate the intricate interplay between socio-demographic and behavioral factors and the likelihood of COVID-19 infection. Thus, a comprehensive online-based cross-sectional survey was methodically orchestrated, amassing data from a substantial sample size of 500 respondents. The precisely designed survey questionnaire encompassed various variables encompassing socio-demographics, behaviors, and social factors. The Bivariate Pearson’s Chi-square association test was deftly employed to unravel the complex associations between the explanatory variables and COVID-19 infection. The feature importance approach was also introduced to discern the utmost critical features underpinning this infectious predicament. Four distinct Machine Learning (ML) algorithms, specifically Decision Tree, Random Forest, CatBoost, and XGBoost, were employed to accurately predict COVID-19 infection based on a comprehensive analysis of socio-demographic and behavioral factors. The performance of these models was rigorously assessed using a range of evaluation metrics, including accuracy, recall, precision, ROC-AUC score, and F1 score. Pearson’s Chi-square test revealed a statistically significant association between vaccination status and COVID-19 infection. The use of sanitizer and masks, the timing of infection, and the interval between the first and second vaccine doses were significantly correlated with the likelihood of contracting the COVID-19 virus. Among the ML models tested, the XGBoost classifier demonstrated the highest classification accuracy, achieving an impressive 97.6%. These findings provide valuable insights for individuals, communities, and policymakers to implement targeted strategies aimed at mitigating the impact of the COVID-19 pandemic.
first_indexed 2024-03-08T19:46:24Z
format Article
id doaj.art-790f8169838c444ea4f978ece67aa2f2
institution Directory Open Access Journal
issn 2667-1336
language English
last_indexed 2024-03-08T19:46:24Z
publishDate 2023-10-01
publisher Springer Nature
record_format Article
series Human-Centric Intelligent Systems
spelling doaj.art-790f8169838c444ea4f978ece67aa2f22023-12-24T12:20:56ZengSpringer NatureHuman-Centric Intelligent Systems2667-13362023-10-013444146010.1007/s44230-023-00049-9Machine Learning-Based Screening Solution for COVID-19 Cases Investigation: Socio-Demographic and Behavioral Factors Analysis and COVID-19 DetectionK. M. Aslam Uddin0Farida Siddiqi Prity1Maisha Tasnim2Sumiya Nur Jannat3Mohammad Omar Faruk4Jahirul Islam5Saydul Akbar Murad6Apurba Adhikary7Anupam Kumar Bairagi8Department of Information and Communication Engineering, Noakhali Science and Technology UniversityDepartment of Information and Communication Engineering, Noakhali Science and Technology UniversityDepartment of Information and Communication Engineering, Noakhali Science and Technology UniversityDepartment of Statistics, Noakhali Science and Technology UniversityDepartment of Statistics, Noakhali Science and Technology UniversityDepartment of Computer Science and Engineering, New Mexico Institute of Mining and TechnologySchool of Computing Sciences and Engineering, University of Southern MississippiDepartment of Information and Communication Engineering, Noakhali Science and Technology UniversityComputing Sciences and Engineering Discipline, Khulna UniversityAbstract The COVID-19 pandemic has unleashed an unprecedented global crisis, releasing a wave of illness, mortality, and economic disarray of unparalleled proportions. Numerous societal and behavioral aspects have conspired to fuel the rampant spread of COVID-19 across the globe. These factors encompass densely populated areas, adherence to mask-wearing protocols, inadequate awareness levels, and various behavioral and social practices. Despite the extensive research surrounding COVID-19 detection, an unfortunate dearth of studies has emerged to meticulously evaluate the intricate interplay between socio-demographic and behavioral factors and the likelihood of COVID-19 infection. Thus, a comprehensive online-based cross-sectional survey was methodically orchestrated, amassing data from a substantial sample size of 500 respondents. The precisely designed survey questionnaire encompassed various variables encompassing socio-demographics, behaviors, and social factors. The Bivariate Pearson’s Chi-square association test was deftly employed to unravel the complex associations between the explanatory variables and COVID-19 infection. The feature importance approach was also introduced to discern the utmost critical features underpinning this infectious predicament. Four distinct Machine Learning (ML) algorithms, specifically Decision Tree, Random Forest, CatBoost, and XGBoost, were employed to accurately predict COVID-19 infection based on a comprehensive analysis of socio-demographic and behavioral factors. The performance of these models was rigorously assessed using a range of evaluation metrics, including accuracy, recall, precision, ROC-AUC score, and F1 score. Pearson’s Chi-square test revealed a statistically significant association between vaccination status and COVID-19 infection. The use of sanitizer and masks, the timing of infection, and the interval between the first and second vaccine doses were significantly correlated with the likelihood of contracting the COVID-19 virus. Among the ML models tested, the XGBoost classifier demonstrated the highest classification accuracy, achieving an impressive 97.6%. These findings provide valuable insights for individuals, communities, and policymakers to implement targeted strategies aimed at mitigating the impact of the COVID-19 pandemic.https://doi.org/10.1007/s44230-023-00049-9COVID-19Socio-demographicBehaviorPearson Chi-squareMachine Learning
spellingShingle K. M. Aslam Uddin
Farida Siddiqi Prity
Maisha Tasnim
Sumiya Nur Jannat
Mohammad Omar Faruk
Jahirul Islam
Saydul Akbar Murad
Apurba Adhikary
Anupam Kumar Bairagi
Machine Learning-Based Screening Solution for COVID-19 Cases Investigation: Socio-Demographic and Behavioral Factors Analysis and COVID-19 Detection
Human-Centric Intelligent Systems
COVID-19
Socio-demographic
Behavior
Pearson Chi-square
Machine Learning
title Machine Learning-Based Screening Solution for COVID-19 Cases Investigation: Socio-Demographic and Behavioral Factors Analysis and COVID-19 Detection
title_full Machine Learning-Based Screening Solution for COVID-19 Cases Investigation: Socio-Demographic and Behavioral Factors Analysis and COVID-19 Detection
title_fullStr Machine Learning-Based Screening Solution for COVID-19 Cases Investigation: Socio-Demographic and Behavioral Factors Analysis and COVID-19 Detection
title_full_unstemmed Machine Learning-Based Screening Solution for COVID-19 Cases Investigation: Socio-Demographic and Behavioral Factors Analysis and COVID-19 Detection
title_short Machine Learning-Based Screening Solution for COVID-19 Cases Investigation: Socio-Demographic and Behavioral Factors Analysis and COVID-19 Detection
title_sort machine learning based screening solution for covid 19 cases investigation socio demographic and behavioral factors analysis and covid 19 detection
topic COVID-19
Socio-demographic
Behavior
Pearson Chi-square
Machine Learning
url https://doi.org/10.1007/s44230-023-00049-9
work_keys_str_mv AT kmaslamuddin machinelearningbasedscreeningsolutionforcovid19casesinvestigationsociodemographicandbehavioralfactorsanalysisandcovid19detection
AT faridasiddiqiprity machinelearningbasedscreeningsolutionforcovid19casesinvestigationsociodemographicandbehavioralfactorsanalysisandcovid19detection
AT maishatasnim machinelearningbasedscreeningsolutionforcovid19casesinvestigationsociodemographicandbehavioralfactorsanalysisandcovid19detection
AT sumiyanurjannat machinelearningbasedscreeningsolutionforcovid19casesinvestigationsociodemographicandbehavioralfactorsanalysisandcovid19detection
AT mohammadomarfaruk machinelearningbasedscreeningsolutionforcovid19casesinvestigationsociodemographicandbehavioralfactorsanalysisandcovid19detection
AT jahirulislam machinelearningbasedscreeningsolutionforcovid19casesinvestigationsociodemographicandbehavioralfactorsanalysisandcovid19detection
AT saydulakbarmurad machinelearningbasedscreeningsolutionforcovid19casesinvestigationsociodemographicandbehavioralfactorsanalysisandcovid19detection
AT apurbaadhikary machinelearningbasedscreeningsolutionforcovid19casesinvestigationsociodemographicandbehavioralfactorsanalysisandcovid19detection
AT anupamkumarbairagi machinelearningbasedscreeningsolutionforcovid19casesinvestigationsociodemographicandbehavioralfactorsanalysisandcovid19detection