Exploration of black boxes of supervised machine learning models: A demonstration on development of predictive heart risk score

Machine learning (ML) often provides applicable high-performance models to facilitate decision-makers in various fields. However, this high performance is achieved at the expense of the interpretability of these models, which has been criticized by practitioners and has become a significant hindranc...

Full description

Bibliographic Details
Main Authors:	Sajid, Mirza Rizwan, Khan, Arshad Ali, Albar, Haitham M., Noryanti, Muhammad, Sami, Waqas, Bukhari, Syed Ahmad Chan, Wajahat, Iram
Format:	Article
Language:	English
Published:	Hindawi Limited 2022
Subjects:	QA Mathematics QA75 Electronic computers. Computer science QA76 Computer software T Technology (General) TA Engineering (General). Civil engineering (General)
Online Access:	http://umpir.ump.edu.my/id/eprint/35145/1/Exploration%20of%20black%20boxes%20of%20supervised%20machine%20learning%20models_A%20demonstration%20on%20development.pdf

_version_	1796995282704007168
author	Sajid, Mirza Rizwan Khan, Arshad Ali Albar, Haitham M. Noryanti, Muhammad Sami, Waqas Bukhari, Syed Ahmad Chan Wajahat, Iram
author_facet	Sajid, Mirza Rizwan Khan, Arshad Ali Albar, Haitham M. Noryanti, Muhammad Sami, Waqas Bukhari, Syed Ahmad Chan Wajahat, Iram
author_sort	Sajid, Mirza Rizwan
collection	UMP
description	Machine learning (ML) often provides applicable high-performance models to facilitate decision-makers in various fields. However, this high performance is achieved at the expense of the interpretability of these models, which has been criticized by practitioners and has become a significant hindrance in their application. Therefore, in highly sensitive decisions, black boxes of ML models are not recommended. We proposed a novel methodology that uses complex supervised ML models and transforms them into simple, interpretable, transparent statistical models. This methodology is like stacking ensemble ML in which the best ML models are used as a base learner to compute relative feature weights. The index of these weights is further used as a single covariate in the simple logistic regression model to estimate the likelihood of an event. We tested this methodology on the primary dataset related to cardiovascular diseases (CVDs), the leading cause of mortalities in recent times. Therefore, early risk assessment is an important dimension that can potentially reduce the burden of CVDs and their related mortality through accurate but interpretable risk prediction models. We developed an artificial neural network and support vector machines based on ML models and transformed them into a simple statistical model and heart risk scores. These simplified models were found transparent, reliable, valid, interpretable, and approximate in predictions. The findings of this study suggest that complex supervised ML models can be efficiently transformed into simple statistical models that can also be validated.
first_indexed	2024-03-06T13:00:02Z
format	Article
id	UMPir35145
institution	Universiti Malaysia Pahang
language	English
last_indexed	2024-03-06T13:00:02Z
publishDate	2022
publisher	Hindawi Limited
record_format	dspace
spelling	UMPir351452023-01-17T02:36:42Z http://umpir.ump.edu.my/id/eprint/35145/ Exploration of black boxes of supervised machine learning models: A demonstration on development of predictive heart risk score Sajid, Mirza Rizwan Khan, Arshad Ali Albar, Haitham M. Noryanti, Muhammad Sami, Waqas Bukhari, Syed Ahmad Chan Wajahat, Iram QA Mathematics QA75 Electronic computers. Computer science QA76 Computer software T Technology (General) TA Engineering (General). Civil engineering (General) Machine learning (ML) often provides applicable high-performance models to facilitate decision-makers in various fields. However, this high performance is achieved at the expense of the interpretability of these models, which has been criticized by practitioners and has become a significant hindrance in their application. Therefore, in highly sensitive decisions, black boxes of ML models are not recommended. We proposed a novel methodology that uses complex supervised ML models and transforms them into simple, interpretable, transparent statistical models. This methodology is like stacking ensemble ML in which the best ML models are used as a base learner to compute relative feature weights. The index of these weights is further used as a single covariate in the simple logistic regression model to estimate the likelihood of an event. We tested this methodology on the primary dataset related to cardiovascular diseases (CVDs), the leading cause of mortalities in recent times. Therefore, early risk assessment is an important dimension that can potentially reduce the burden of CVDs and their related mortality through accurate but interpretable risk prediction models. We developed an artificial neural network and support vector machines based on ML models and transformed them into a simple statistical model and heart risk scores. These simplified models were found transparent, reliable, valid, interpretable, and approximate in predictions. The findings of this study suggest that complex supervised ML models can be efficiently transformed into simple statistical models that can also be validated. Hindawi Limited 2022 Article PeerReviewed pdf en cc_by_4 http://umpir.ump.edu.my/id/eprint/35145/1/Exploration%20of%20black%20boxes%20of%20supervised%20machine%20learning%20models_A%20demonstration%20on%20development.pdf Sajid, Mirza Rizwan and Khan, Arshad Ali and Albar, Haitham M. and Noryanti, Muhammad and Sami, Waqas and Bukhari, Syed Ahmad Chan and Wajahat, Iram (2022) Exploration of black boxes of supervised machine learning models: A demonstration on development of predictive heart risk score. Computational Intelligence and Neuroscience, 2022 (5475313). pp. 1-11. ISSN 1687-5265. (Published) https://doi.org/10.1155/2022/5475313 https://doi.org/10.1155/2022/5475313
spellingShingle	QA Mathematics QA75 Electronic computers. Computer science QA76 Computer software T Technology (General) TA Engineering (General). Civil engineering (General) Sajid, Mirza Rizwan Khan, Arshad Ali Albar, Haitham M. Noryanti, Muhammad Sami, Waqas Bukhari, Syed Ahmad Chan Wajahat, Iram Exploration of black boxes of supervised machine learning models: A demonstration on development of predictive heart risk score
title	Exploration of black boxes of supervised machine learning models: A demonstration on development of predictive heart risk score
title_full	Exploration of black boxes of supervised machine learning models: A demonstration on development of predictive heart risk score
title_fullStr	Exploration of black boxes of supervised machine learning models: A demonstration on development of predictive heart risk score
title_full_unstemmed	Exploration of black boxes of supervised machine learning models: A demonstration on development of predictive heart risk score
title_short	Exploration of black boxes of supervised machine learning models: A demonstration on development of predictive heart risk score
title_sort	exploration of black boxes of supervised machine learning models a demonstration on development of predictive heart risk score
topic	QA Mathematics QA75 Electronic computers. Computer science QA76 Computer software T Technology (General) TA Engineering (General). Civil engineering (General)
url	http://umpir.ump.edu.my/id/eprint/35145/1/Exploration%20of%20black%20boxes%20of%20supervised%20machine%20learning%20models_A%20demonstration%20on%20development.pdf
work_keys_str_mv	AT sajidmirzarizwan explorationofblackboxesofsupervisedmachinelearningmodelsademonstrationondevelopmentofpredictiveheartriskscore AT khanarshadali explorationofblackboxesofsupervisedmachinelearningmodelsademonstrationondevelopmentofpredictiveheartriskscore AT albarhaithamm explorationofblackboxesofsupervisedmachinelearningmodelsademonstrationondevelopmentofpredictiveheartriskscore AT noryantimuhammad explorationofblackboxesofsupervisedmachinelearningmodelsademonstrationondevelopmentofpredictiveheartriskscore AT samiwaqas explorationofblackboxesofsupervisedmachinelearningmodelsademonstrationondevelopmentofpredictiveheartriskscore AT bukharisyedahmadchan explorationofblackboxesofsupervisedmachinelearningmodelsademonstrationondevelopmentofpredictiveheartriskscore AT wajahatiram explorationofblackboxesofsupervisedmachinelearningmodelsademonstrationondevelopmentofpredictiveheartriskscore

Exploration of black boxes of supervised machine learning models: A demonstration on development of predictive heart risk score

Similar Items