Shapley Values as a Strategy for Ensemble Weights Estimation

This study introduces a novel performance-based weighting scheme for ensemble learning using the Shapley value. The weighting uses the reciprocal of binary cross-entropy as a base learner’s performance metric and estimates its Shapley value to measure the overall contribution of a learner to an equa...

Full description

Bibliographic Details
Main Authors: Vaidotas Drungilas, Evaldas Vaičiukynas, Linas Ablonskis, Lina Čeponienė
Format: Article
Language:English
Published: MDPI AG 2023-06-01
Series:Applied Sciences
Subjects:
Online Access:https://www.mdpi.com/2076-3417/13/12/7010
_version_ 1827738768062283776
author Vaidotas Drungilas
Evaldas Vaičiukynas
Linas Ablonskis
Lina Čeponienė
author_facet Vaidotas Drungilas
Evaldas Vaičiukynas
Linas Ablonskis
Lina Čeponienė
author_sort Vaidotas Drungilas
collection DOAJ
description This study introduces a novel performance-based weighting scheme for ensemble learning using the Shapley value. The weighting uses the reciprocal of binary cross-entropy as a base learner’s performance metric and estimates its Shapley value to measure the overall contribution of a learner to an equally weighted ensemble of various sizes. Two variants of this strategy were empirically compared with a single monolith model and other static weighting strategies using two large banking-related datasets. A variant that discards learners with a negative Shapley value was ranked as first or at least second when constructing homogeneous ensembles, whereas for heterogeneous ensembles this strategy resulted in a better or at least similar detection performance to other weighting strategies tested. The main limitation being the computational complexity of Shapley calculations, the explored weighting strategy could be considered as a generalization of performance-based weighting.
first_indexed 2024-03-11T02:48:58Z
format Article
id doaj.art-a48081a78458412a8573ad9899531bdf
institution Directory Open Access Journal
issn 2076-3417
language English
last_indexed 2024-03-11T02:48:58Z
publishDate 2023-06-01
publisher MDPI AG
record_format Article
series Applied Sciences
spelling doaj.art-a48081a78458412a8573ad9899531bdf2023-11-18T09:07:42ZengMDPI AGApplied Sciences2076-34172023-06-011312701010.3390/app13127010Shapley Values as a Strategy for Ensemble Weights EstimationVaidotas Drungilas0Evaldas Vaičiukynas1Linas Ablonskis2Lina Čeponienė3Department of Information Systems, Faculty of Informatics, Kaunas University of Technology, 44249 Kaunas, LithuaniaDepartment of Information Systems, Faculty of Informatics, Kaunas University of Technology, 44249 Kaunas, LithuaniaDepartment of Information Systems, Faculty of Informatics, Kaunas University of Technology, 44249 Kaunas, LithuaniaDepartment of Information Systems, Faculty of Informatics, Kaunas University of Technology, 44249 Kaunas, LithuaniaThis study introduces a novel performance-based weighting scheme for ensemble learning using the Shapley value. The weighting uses the reciprocal of binary cross-entropy as a base learner’s performance metric and estimates its Shapley value to measure the overall contribution of a learner to an equally weighted ensemble of various sizes. Two variants of this strategy were empirically compared with a single monolith model and other static weighting strategies using two large banking-related datasets. A variant that discards learners with a negative Shapley value was ranked as first or at least second when constructing homogeneous ensembles, whereas for heterogeneous ensembles this strategy resulted in a better or at least similar detection performance to other weighting strategies tested. The main limitation being the computational complexity of Shapley calculations, the explored weighting strategy could be considered as a generalization of performance-based weighting.https://www.mdpi.com/2076-3417/13/12/7010machine learningensemble methodsShapley valueperformance weightingprivacy-preserving distributed learning
spellingShingle Vaidotas Drungilas
Evaldas Vaičiukynas
Linas Ablonskis
Lina Čeponienė
Shapley Values as a Strategy for Ensemble Weights Estimation
Applied Sciences
machine learning
ensemble methods
Shapley value
performance weighting
privacy-preserving distributed learning
title Shapley Values as a Strategy for Ensemble Weights Estimation
title_full Shapley Values as a Strategy for Ensemble Weights Estimation
title_fullStr Shapley Values as a Strategy for Ensemble Weights Estimation
title_full_unstemmed Shapley Values as a Strategy for Ensemble Weights Estimation
title_short Shapley Values as a Strategy for Ensemble Weights Estimation
title_sort shapley values as a strategy for ensemble weights estimation
topic machine learning
ensemble methods
Shapley value
performance weighting
privacy-preserving distributed learning
url https://www.mdpi.com/2076-3417/13/12/7010
work_keys_str_mv AT vaidotasdrungilas shapleyvaluesasastrategyforensembleweightsestimation
AT evaldasvaiciukynas shapleyvaluesasastrategyforensembleweightsestimation
AT linasablonskis shapleyvaluesasastrategyforensembleweightsestimation
AT linaceponiene shapleyvaluesasastrategyforensembleweightsestimation