When less is more powerful: Shapley value attributed ablation with augmented learning for practical time series sensor data classification

Time series sensor data classification tasks often suffer from training data scarcity issue due to the expenses associated with the expert-intervened annotation efforts. For example, Electrocardiogram (ECG) data classification for cardio-vascular disease (CVD) detection requires expensive labeling p...

Full description

Bibliographic Details
Main Authors: Arijit Ukil, Leandro Marin, Antonio J. Jara
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2022-01-01
Series:PLoS ONE
Online Access:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9683574/?tool=EBI
_version_ 1798016142433845248
author Arijit Ukil
Leandro Marin
Antonio J. Jara
author_facet Arijit Ukil
Leandro Marin
Antonio J. Jara
author_sort Arijit Ukil
collection DOAJ
description Time series sensor data classification tasks often suffer from training data scarcity issue due to the expenses associated with the expert-intervened annotation efforts. For example, Electrocardiogram (ECG) data classification for cardio-vascular disease (CVD) detection requires expensive labeling procedures with the help of cardiologists. Current state-of-the-art algorithms like deep learning models have shown outstanding performance under the general requirement of availability of large set of training examples. In this paper, we propose Shapley Attributed Ablation with Augmented Learning: ShapAAL, which demonstrates that deep learning algorithm with suitably selected subset of the seen examples or ablating the unimportant ones from the given limited training dataset can ensure consistently better classification performance under augmented training. In ShapAAL, additive perturbed training augments the input space to compensate the scarcity in training examples using Residual Network (ResNet) architecture through perturbation-induced inputs, while Shapley attribution seeks the subset from the augmented training space for better learnability with the goal of better general predictive performance, thanks to the “efficiency” and “null player” axioms of transferable utility games upon which Shapley value game is formulated. In ShapAAL, the subset of training examples that contribute positively to a supervised learning setup is derived from the notion of coalition games using Shapley values associated with each of the given inputs’ contribution into the model prediction. ShapAAL is a novel push-pull deep architecture where the subset selection through Shapley value attribution pushes the model to lower dimension while augmented training augments the learning capability of the model over unseen data. We perform ablation study to provide the empirical evidence of our claim and we show that proposed ShapAAL method consistently outperforms the current baselines and state-of-the-art algorithms for time series sensor data classification tasks from publicly available UCR time series archive that includes different practical important problems like detection of CVDs from ECG data.
first_indexed 2024-04-11T15:46:04Z
format Article
id doaj.art-c18fa76c873f4c7e9831cd283dddcf6f
institution Directory Open Access Journal
issn 1932-6203
language English
last_indexed 2024-04-11T15:46:04Z
publishDate 2022-01-01
publisher Public Library of Science (PLoS)
record_format Article
series PLoS ONE
spelling doaj.art-c18fa76c873f4c7e9831cd283dddcf6f2022-12-22T04:15:35ZengPublic Library of Science (PLoS)PLoS ONE1932-62032022-01-011711When less is more powerful: Shapley value attributed ablation with augmented learning for practical time series sensor data classificationArijit UkilLeandro MarinAntonio J. JaraTime series sensor data classification tasks often suffer from training data scarcity issue due to the expenses associated with the expert-intervened annotation efforts. For example, Electrocardiogram (ECG) data classification for cardio-vascular disease (CVD) detection requires expensive labeling procedures with the help of cardiologists. Current state-of-the-art algorithms like deep learning models have shown outstanding performance under the general requirement of availability of large set of training examples. In this paper, we propose Shapley Attributed Ablation with Augmented Learning: ShapAAL, which demonstrates that deep learning algorithm with suitably selected subset of the seen examples or ablating the unimportant ones from the given limited training dataset can ensure consistently better classification performance under augmented training. In ShapAAL, additive perturbed training augments the input space to compensate the scarcity in training examples using Residual Network (ResNet) architecture through perturbation-induced inputs, while Shapley attribution seeks the subset from the augmented training space for better learnability with the goal of better general predictive performance, thanks to the “efficiency” and “null player” axioms of transferable utility games upon which Shapley value game is formulated. In ShapAAL, the subset of training examples that contribute positively to a supervised learning setup is derived from the notion of coalition games using Shapley values associated with each of the given inputs’ contribution into the model prediction. ShapAAL is a novel push-pull deep architecture where the subset selection through Shapley value attribution pushes the model to lower dimension while augmented training augments the learning capability of the model over unseen data. We perform ablation study to provide the empirical evidence of our claim and we show that proposed ShapAAL method consistently outperforms the current baselines and state-of-the-art algorithms for time series sensor data classification tasks from publicly available UCR time series archive that includes different practical important problems like detection of CVDs from ECG data.https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9683574/?tool=EBI
spellingShingle Arijit Ukil
Leandro Marin
Antonio J. Jara
When less is more powerful: Shapley value attributed ablation with augmented learning for practical time series sensor data classification
PLoS ONE
title When less is more powerful: Shapley value attributed ablation with augmented learning for practical time series sensor data classification
title_full When less is more powerful: Shapley value attributed ablation with augmented learning for practical time series sensor data classification
title_fullStr When less is more powerful: Shapley value attributed ablation with augmented learning for practical time series sensor data classification
title_full_unstemmed When less is more powerful: Shapley value attributed ablation with augmented learning for practical time series sensor data classification
title_short When less is more powerful: Shapley value attributed ablation with augmented learning for practical time series sensor data classification
title_sort when less is more powerful shapley value attributed ablation with augmented learning for practical time series sensor data classification
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9683574/?tool=EBI
work_keys_str_mv AT arijitukil whenlessismorepowerfulshapleyvalueattributedablationwithaugmentedlearningforpracticaltimeseriessensordataclassification
AT leandromarin whenlessismorepowerfulshapleyvalueattributedablationwithaugmentedlearningforpracticaltimeseriessensordataclassification
AT antoniojjara whenlessismorepowerfulshapleyvalueattributedablationwithaugmentedlearningforpracticaltimeseriessensordataclassification