An Interpretable Machine Learning Approach for Hepatitis B Diagnosis

Hepatitis B is a potentially deadly liver infection caused by the hepatitis B virus. It is a serious public health problem globally. Substantial efforts have been made to apply machine learning in detecting the virus. However, the application of model interpretability is limited in the existing lite...

Full description

Bibliographic Details
Main Authors: George Obaido, Blessing Ogbuokiri, Theo G. Swart, Nimibofa Ayawei, Sydney Mambwe Kasongo, Kehinde Aruleba, Ibomoiye Domor Mienye, Idowu Aruleba, Williams Chukwu, Fadekemi Osaye, Oluwaseun F. Egbelowo, Simelane Simphiwe, Ebenezer Esenogho
Format: Article
Language:English
Published: MDPI AG 2022-11-01
Series:Applied Sciences
Subjects:
Online Access:https://www.mdpi.com/2076-3417/12/21/11127
_version_ 1827647369177464832
author George Obaido
Blessing Ogbuokiri
Theo G. Swart
Nimibofa Ayawei
Sydney Mambwe Kasongo
Kehinde Aruleba
Ibomoiye Domor Mienye
Idowu Aruleba
Williams Chukwu
Fadekemi Osaye
Oluwaseun F. Egbelowo
Simelane Simphiwe
Ebenezer Esenogho
author_facet George Obaido
Blessing Ogbuokiri
Theo G. Swart
Nimibofa Ayawei
Sydney Mambwe Kasongo
Kehinde Aruleba
Ibomoiye Domor Mienye
Idowu Aruleba
Williams Chukwu
Fadekemi Osaye
Oluwaseun F. Egbelowo
Simelane Simphiwe
Ebenezer Esenogho
author_sort George Obaido
collection DOAJ
description Hepatitis B is a potentially deadly liver infection caused by the hepatitis B virus. It is a serious public health problem globally. Substantial efforts have been made to apply machine learning in detecting the virus. However, the application of model interpretability is limited in the existing literature. Model interpretability makes it easier for humans to understand and trust the machine-learning model. Therefore, in this study, we used SHapley Additive exPlanations (SHAP), a game-based theoretical approach to explain and visualize the predictions of machine learning models applied for hepatitis B diagnosis. The algorithms used in building the models include decision tree, logistic regression, support vector machines, random forest, adaptive boosting (AdaBoost), and extreme gradient boosting (XGBoost), and they achieved balanced accuracies of 75%, 82%, 75%, 86%, 92%, and 90%, respectively. Meanwhile, the SHAP values showed that bilirubin is the most significant feature contributing to a higher mortality rate. Consequently, older patients are more likely to die with elevated bilirubin levels. The outcome of this study can aid health practitioners and health policymakers in explaining the result of machine learning models for health-related problems.
first_indexed 2024-03-09T19:17:11Z
format Article
id doaj.art-7b9abae8064241768e6af3ed7aaf0a91
institution Directory Open Access Journal
issn 2076-3417
language English
last_indexed 2024-03-09T19:17:11Z
publishDate 2022-11-01
publisher MDPI AG
record_format Article
series Applied Sciences
spelling doaj.art-7b9abae8064241768e6af3ed7aaf0a912023-11-24T03:38:27ZengMDPI AGApplied Sciences2076-34172022-11-0112211112710.3390/app122111127An Interpretable Machine Learning Approach for Hepatitis B DiagnosisGeorge Obaido0Blessing Ogbuokiri1Theo G. Swart2Nimibofa Ayawei3Sydney Mambwe Kasongo4Kehinde Aruleba5Ibomoiye Domor Mienye6Idowu Aruleba7Williams Chukwu8Fadekemi Osaye9Oluwaseun F. Egbelowo10Simelane Simphiwe11Ebenezer Esenogho12Center for Human-Compatible Artificial Intelligence (CHAI), Berkeley Institute for Data Science (BIDS), University of California, Berkeley, CA 94720, USADepartment of Mathematics and Statistics, York University, Toronto, ON M3J 1P3, CanadaCenter for Telecommunications, Department of Electrical and Electronic Engineering Science, University of Johannesburg, Johannesburg 2006, South AfricaDepartment of Chemistry, Bayelsa Medical University, Yenagoa PMB 178, NigeriaDepartment of Industrial Engineering, Faculty of Engineering, Stellenbosch University, Stellenbosch 7600, South AfricaSchool of Computing and Mathematical Sciences, University of Leicester, Leicester LE1 7RH, UKDepartment of Electrical and Electronic Engineering Science, University of Johannesburg, Johannesburg 2006, South AfricaDepartment of Electrical and Electronic Engineering Science, University of Johannesburg, Johannesburg 2006, South AfricaDepartment of Mathematics, Wake Forest University, Winston-Salem, NC 27109, USADepartment of Mathematics and Computer Science, Alabama State University, Montgomery, AL 36104, USADepartment of Integrative Biology, The University of Texas at Austin, Austin, TX 78712, USADepartment of Mathematics and Applied Mathematics, University of Johannesburg, Doornfontein 2028, South AfricaCenter for Telecommunications, Department of Electrical and Electronic Engineering Science, University of Johannesburg, Johannesburg 2006, South AfricaHepatitis B is a potentially deadly liver infection caused by the hepatitis B virus. It is a serious public health problem globally. Substantial efforts have been made to apply machine learning in detecting the virus. However, the application of model interpretability is limited in the existing literature. Model interpretability makes it easier for humans to understand and trust the machine-learning model. Therefore, in this study, we used SHapley Additive exPlanations (SHAP), a game-based theoretical approach to explain and visualize the predictions of machine learning models applied for hepatitis B diagnosis. The algorithms used in building the models include decision tree, logistic regression, support vector machines, random forest, adaptive boosting (AdaBoost), and extreme gradient boosting (XGBoost), and they achieved balanced accuracies of 75%, 82%, 75%, 86%, 92%, and 90%, respectively. Meanwhile, the SHAP values showed that bilirubin is the most significant feature contributing to a higher mortality rate. Consequently, older patients are more likely to die with elevated bilirubin levels. The outcome of this study can aid health practitioners and health policymakers in explaining the result of machine learning models for health-related problems.https://www.mdpi.com/2076-3417/12/21/11127disease predictionhepatitis Binterpretabilitymachine learning
spellingShingle George Obaido
Blessing Ogbuokiri
Theo G. Swart
Nimibofa Ayawei
Sydney Mambwe Kasongo
Kehinde Aruleba
Ibomoiye Domor Mienye
Idowu Aruleba
Williams Chukwu
Fadekemi Osaye
Oluwaseun F. Egbelowo
Simelane Simphiwe
Ebenezer Esenogho
An Interpretable Machine Learning Approach for Hepatitis B Diagnosis
Applied Sciences
disease prediction
hepatitis B
interpretability
machine learning
title An Interpretable Machine Learning Approach for Hepatitis B Diagnosis
title_full An Interpretable Machine Learning Approach for Hepatitis B Diagnosis
title_fullStr An Interpretable Machine Learning Approach for Hepatitis B Diagnosis
title_full_unstemmed An Interpretable Machine Learning Approach for Hepatitis B Diagnosis
title_short An Interpretable Machine Learning Approach for Hepatitis B Diagnosis
title_sort interpretable machine learning approach for hepatitis b diagnosis
topic disease prediction
hepatitis B
interpretability
machine learning
url https://www.mdpi.com/2076-3417/12/21/11127
work_keys_str_mv AT georgeobaido aninterpretablemachinelearningapproachforhepatitisbdiagnosis
AT blessingogbuokiri aninterpretablemachinelearningapproachforhepatitisbdiagnosis
AT theogswart aninterpretablemachinelearningapproachforhepatitisbdiagnosis
AT nimibofaayawei aninterpretablemachinelearningapproachforhepatitisbdiagnosis
AT sydneymambwekasongo aninterpretablemachinelearningapproachforhepatitisbdiagnosis
AT kehindearuleba aninterpretablemachinelearningapproachforhepatitisbdiagnosis
AT ibomoiyedomormienye aninterpretablemachinelearningapproachforhepatitisbdiagnosis
AT idowuaruleba aninterpretablemachinelearningapproachforhepatitisbdiagnosis
AT williamschukwu aninterpretablemachinelearningapproachforhepatitisbdiagnosis
AT fadekemiosaye aninterpretablemachinelearningapproachforhepatitisbdiagnosis
AT oluwaseunfegbelowo aninterpretablemachinelearningapproachforhepatitisbdiagnosis
AT simelanesimphiwe aninterpretablemachinelearningapproachforhepatitisbdiagnosis
AT ebenezeresenogho aninterpretablemachinelearningapproachforhepatitisbdiagnosis
AT georgeobaido interpretablemachinelearningapproachforhepatitisbdiagnosis
AT blessingogbuokiri interpretablemachinelearningapproachforhepatitisbdiagnosis
AT theogswart interpretablemachinelearningapproachforhepatitisbdiagnosis
AT nimibofaayawei interpretablemachinelearningapproachforhepatitisbdiagnosis
AT sydneymambwekasongo interpretablemachinelearningapproachforhepatitisbdiagnosis
AT kehindearuleba interpretablemachinelearningapproachforhepatitisbdiagnosis
AT ibomoiyedomormienye interpretablemachinelearningapproachforhepatitisbdiagnosis
AT idowuaruleba interpretablemachinelearningapproachforhepatitisbdiagnosis
AT williamschukwu interpretablemachinelearningapproachforhepatitisbdiagnosis
AT fadekemiosaye interpretablemachinelearningapproachforhepatitisbdiagnosis
AT oluwaseunfegbelowo interpretablemachinelearningapproachforhepatitisbdiagnosis
AT simelanesimphiwe interpretablemachinelearningapproachforhepatitisbdiagnosis
AT ebenezeresenogho interpretablemachinelearningapproachforhepatitisbdiagnosis