Spatial distribution and machine learning prediction of sexually transmitted infections and associated factors among sexually active men and women in Ethiopia, evidence from EDHS 2016

Abstract Introduction Sexually transmitted infections (STIs) are the major public health problem globally, affecting millions of people every day. The burden is high in the Sub-Saharan region, including Ethiopia. Besides, there is little evidence on the distribution of STIs across Ethiopian regions....

Full description

Bibliographic Details
Main Authors: Abdul-Aziz Kebede Kassaw, Tesfahun Melese Yilma, Yakub Sebastian, Abraham Yeneneh Birhanu, Mequannent Sharew Melaku, Sebwedin Surur Jemal
Format: Article
Language:English
Published: BMC 2023-01-01
Series:BMC Infectious Diseases
Subjects:
Online Access:https://doi.org/10.1186/s12879-023-07987-6
_version_ 1811175988207288320
author Abdul-Aziz Kebede Kassaw
Tesfahun Melese Yilma
Yakub Sebastian
Abraham Yeneneh Birhanu
Mequannent Sharew Melaku
Sebwedin Surur Jemal
author_facet Abdul-Aziz Kebede Kassaw
Tesfahun Melese Yilma
Yakub Sebastian
Abraham Yeneneh Birhanu
Mequannent Sharew Melaku
Sebwedin Surur Jemal
author_sort Abdul-Aziz Kebede Kassaw
collection DOAJ
description Abstract Introduction Sexually transmitted infections (STIs) are the major public health problem globally, affecting millions of people every day. The burden is high in the Sub-Saharan region, including Ethiopia. Besides, there is little evidence on the distribution of STIs across Ethiopian regions. Hence, having a better understanding of the infections is of great importance to lessen their burden on society. Therefore, this article aimed to assess predictors of STIs using machine learning techniques and their geographic distribution across Ethiopian regions. Assessing the predictors of STIs and their spatial distribution could help policymakers to understand the problems better and design interventions accordingly. Methods A community-based cross-sectional study was conducted from January 18, 2016, to June 27, 2016, using the 2016 Ethiopian Demography and Health Survey (EDHS) dataset. We applied spatial autocorrelation analysis using Global Moran’s I statistics to detect latent STI clusters. Spatial scan statics was done to identify local significant clusters based on the Bernoulli model using the SaTScan™ for spatial distribution and Supervised machine learning models such as C5.0 Decision tree, Random Forest, Support Vector Machine, Naïve Bayes, and Logistic regression were applied to the 2016 EDHS dataset for STI prediction and their performances were analyzed. Association rules were done using an unsupervised machine learning algorithm. Results The spatial distribution of STI in Ethiopia was clustered across the country with a global Moran’s index = 0.06 and p value = 0.04. The Random Forest algorithm was best for STI prediction with 69.48% balanced accuracy and 68.50% area under the curve. The random forest model showed that region, wealth, age category, educational level, age at first sex, working status, marital status, media access, alcohol drinking, chat chewing, and sex of the respondent were the top 11 predictors of STI in Ethiopia. Conclusion Applying random forest machine learning algorithm for STI prediction in Ethiopia is the proposed model to identify the predictors of STIs.
first_indexed 2024-04-10T19:45:46Z
format Article
id doaj.art-e253bdba4efa48d6b5dea2c6ac052910
institution Directory Open Access Journal
issn 1471-2334
language English
last_indexed 2024-04-10T19:45:46Z
publishDate 2023-01-01
publisher BMC
record_format Article
series BMC Infectious Diseases
spelling doaj.art-e253bdba4efa48d6b5dea2c6ac0529102023-01-29T12:05:20ZengBMCBMC Infectious Diseases1471-23342023-01-0123111610.1186/s12879-023-07987-6Spatial distribution and machine learning prediction of sexually transmitted infections and associated factors among sexually active men and women in Ethiopia, evidence from EDHS 2016Abdul-Aziz Kebede Kassaw0Tesfahun Melese Yilma1Yakub Sebastian2Abraham Yeneneh Birhanu3Mequannent Sharew Melaku4Sebwedin Surur Jemal5Department of Health Informatics, School of Public Health, College of Medicine and Health Sciences, Wollo UniversityDepartment of Health Informatics, Institute of Public Health, College of Medicine and Health Sciences, University of GondarCharles Darwin UniversityDepartment of Health Informatics, Institute of Public Health, College of Medicine and Health Sciences, University of GondarDepartment of Health Informatics, Institute of Public Health, College of Medicine and Health Sciences, University of GondarDepartment of Statistics, Collage of Natural and Computational Science, Mizan Tepi UniversityAbstract Introduction Sexually transmitted infections (STIs) are the major public health problem globally, affecting millions of people every day. The burden is high in the Sub-Saharan region, including Ethiopia. Besides, there is little evidence on the distribution of STIs across Ethiopian regions. Hence, having a better understanding of the infections is of great importance to lessen their burden on society. Therefore, this article aimed to assess predictors of STIs using machine learning techniques and their geographic distribution across Ethiopian regions. Assessing the predictors of STIs and their spatial distribution could help policymakers to understand the problems better and design interventions accordingly. Methods A community-based cross-sectional study was conducted from January 18, 2016, to June 27, 2016, using the 2016 Ethiopian Demography and Health Survey (EDHS) dataset. We applied spatial autocorrelation analysis using Global Moran’s I statistics to detect latent STI clusters. Spatial scan statics was done to identify local significant clusters based on the Bernoulli model using the SaTScan™ for spatial distribution and Supervised machine learning models such as C5.0 Decision tree, Random Forest, Support Vector Machine, Naïve Bayes, and Logistic regression were applied to the 2016 EDHS dataset for STI prediction and their performances were analyzed. Association rules were done using an unsupervised machine learning algorithm. Results The spatial distribution of STI in Ethiopia was clustered across the country with a global Moran’s index = 0.06 and p value = 0.04. The Random Forest algorithm was best for STI prediction with 69.48% balanced accuracy and 68.50% area under the curve. The random forest model showed that region, wealth, age category, educational level, age at first sex, working status, marital status, media access, alcohol drinking, chat chewing, and sex of the respondent were the top 11 predictors of STI in Ethiopia. Conclusion Applying random forest machine learning algorithm for STI prediction in Ethiopia is the proposed model to identify the predictors of STIs.https://doi.org/10.1186/s12879-023-07987-6Sexually transmitted infectionsSpatial distributionMachine learningPredictionEthiopia
spellingShingle Abdul-Aziz Kebede Kassaw
Tesfahun Melese Yilma
Yakub Sebastian
Abraham Yeneneh Birhanu
Mequannent Sharew Melaku
Sebwedin Surur Jemal
Spatial distribution and machine learning prediction of sexually transmitted infections and associated factors among sexually active men and women in Ethiopia, evidence from EDHS 2016
BMC Infectious Diseases
Sexually transmitted infections
Spatial distribution
Machine learning
Prediction
Ethiopia
title Spatial distribution and machine learning prediction of sexually transmitted infections and associated factors among sexually active men and women in Ethiopia, evidence from EDHS 2016
title_full Spatial distribution and machine learning prediction of sexually transmitted infections and associated factors among sexually active men and women in Ethiopia, evidence from EDHS 2016
title_fullStr Spatial distribution and machine learning prediction of sexually transmitted infections and associated factors among sexually active men and women in Ethiopia, evidence from EDHS 2016
title_full_unstemmed Spatial distribution and machine learning prediction of sexually transmitted infections and associated factors among sexually active men and women in Ethiopia, evidence from EDHS 2016
title_short Spatial distribution and machine learning prediction of sexually transmitted infections and associated factors among sexually active men and women in Ethiopia, evidence from EDHS 2016
title_sort spatial distribution and machine learning prediction of sexually transmitted infections and associated factors among sexually active men and women in ethiopia evidence from edhs 2016
topic Sexually transmitted infections
Spatial distribution
Machine learning
Prediction
Ethiopia
url https://doi.org/10.1186/s12879-023-07987-6
work_keys_str_mv AT abdulazizkebedekassaw spatialdistributionandmachinelearningpredictionofsexuallytransmittedinfectionsandassociatedfactorsamongsexuallyactivemenandwomeninethiopiaevidencefromedhs2016
AT tesfahunmeleseyilma spatialdistributionandmachinelearningpredictionofsexuallytransmittedinfectionsandassociatedfactorsamongsexuallyactivemenandwomeninethiopiaevidencefromedhs2016
AT yakubsebastian spatialdistributionandmachinelearningpredictionofsexuallytransmittedinfectionsandassociatedfactorsamongsexuallyactivemenandwomeninethiopiaevidencefromedhs2016
AT abrahamyenenehbirhanu spatialdistributionandmachinelearningpredictionofsexuallytransmittedinfectionsandassociatedfactorsamongsexuallyactivemenandwomeninethiopiaevidencefromedhs2016
AT mequannentsharewmelaku spatialdistributionandmachinelearningpredictionofsexuallytransmittedinfectionsandassociatedfactorsamongsexuallyactivemenandwomeninethiopiaevidencefromedhs2016
AT sebwedinsururjemal spatialdistributionandmachinelearningpredictionofsexuallytransmittedinfectionsandassociatedfactorsamongsexuallyactivemenandwomeninethiopiaevidencefromedhs2016