Model stacking to improve prediction and variable importance robustness for soft sensor development

This paper presents an interpretable ensemble modelling method, in which the predictions of several individual base learners are combined together through Stacked generalisation, which makes use of a secondary layer model, or so called meta-learner, that is trained on the output cross-validation pre...

Full description

Bibliographic Details
Main Authors:	Maxwell Barton, Barry Lennox
Format:	Article
Language:	English
Published:	Elsevier 2022-06-01
Series:	Digital Chemical Engineering
Subjects:	Machine learning Soft sensor Model interpretability
Online Access:	http://www.sciencedirect.com/science/article/pii/S2772508122000254

_version_	1818202641826578432
author	Maxwell Barton Barry Lennox
author_facet	Maxwell Barton Barry Lennox
author_sort	Maxwell Barton
collection	DOAJ
description	This paper presents an interpretable ensemble modelling method, in which the predictions of several individual base learners are combined together through Stacked generalisation, which makes use of a secondary layer model, or so called meta-learner, that is trained on the output cross-validation predictions of each base learner. To provide interpretability, the permutation variable importance (PVI) is computed on the ensemble, wherein variables are randomly shuffled and the reduction in predictive performance for the ensemble is calculated for each variable. This is a novel contribution, as no previous attempts have been made in the soft sensor literature to investigate the interpretability of ensemble models that use heterogeneous base learners. The Stacked ensemble model also avoids model selection, which is the process of choosing among many candidate models. Model selection is often based on cross-validation, which is not guaranteed to select the best model in terms of true generalisation performance on the test set. Instead, the proposed method combines multiple models instead of choosing a singular model, avoiding the need for model selection. The efficacy of the proposed methodology in terms of both variable importance and predictive performance is shown on a synthetic dataset, in which the variable importance is already known, and an industrial dataset of a refinery process provided by Dow. For the synthetic dataset, it is shown that the proposed method chooses the correct casual variables, whereas the in-built variable importance provided by the individual models, namely Partial least squares, Lasso, Random forests & XGBoost, can give increased importance to non-causal, randomly generated variables. For the industrial study, the combined ensemble is shown to outperform all individual base models in terms of predictive performance, whilst also providing a new perspective in terms of variable importance compared to previous studies.
first_indexed	2024-12-12T03:12:41Z
format	Article
id	doaj.art-029142e3917141f6b9ad21debac5f1ab
institution	Directory Open Access Journal
issn	2772-5081
language	English
last_indexed	2024-12-12T03:12:41Z
publishDate	2022-06-01
publisher	Elsevier
record_format	Article
series	Digital Chemical Engineering
spelling	doaj.art-029142e3917141f6b9ad21debac5f1ab2022-12-22T00:40:21ZengElsevierDigital Chemical Engineering2772-50812022-06-013100034Model stacking to improve prediction and variable importance robustness for soft sensor developmentMaxwell Barton0Barry Lennox1Corresponding author.; University of Manchester, Manchester, Greater Manchester, United KingdomUniversity of Manchester, Manchester, Greater Manchester, United KingdomThis paper presents an interpretable ensemble modelling method, in which the predictions of several individual base learners are combined together through Stacked generalisation, which makes use of a secondary layer model, or so called meta-learner, that is trained on the output cross-validation predictions of each base learner. To provide interpretability, the permutation variable importance (PVI) is computed on the ensemble, wherein variables are randomly shuffled and the reduction in predictive performance for the ensemble is calculated for each variable. This is a novel contribution, as no previous attempts have been made in the soft sensor literature to investigate the interpretability of ensemble models that use heterogeneous base learners. The Stacked ensemble model also avoids model selection, which is the process of choosing among many candidate models. Model selection is often based on cross-validation, which is not guaranteed to select the best model in terms of true generalisation performance on the test set. Instead, the proposed method combines multiple models instead of choosing a singular model, avoiding the need for model selection. The efficacy of the proposed methodology in terms of both variable importance and predictive performance is shown on a synthetic dataset, in which the variable importance is already known, and an industrial dataset of a refinery process provided by Dow. For the synthetic dataset, it is shown that the proposed method chooses the correct casual variables, whereas the in-built variable importance provided by the individual models, namely Partial least squares, Lasso, Random forests & XGBoost, can give increased importance to non-causal, randomly generated variables. For the industrial study, the combined ensemble is shown to outperform all individual base models in terms of predictive performance, whilst also providing a new perspective in terms of variable importance compared to previous studies.http://www.sciencedirect.com/science/article/pii/S2772508122000254Machine learningSoft sensorModel interpretability
spellingShingle	Maxwell Barton Barry Lennox Model stacking to improve prediction and variable importance robustness for soft sensor development Digital Chemical Engineering Machine learning Soft sensor Model interpretability
title	Model stacking to improve prediction and variable importance robustness for soft sensor development
title_full	Model stacking to improve prediction and variable importance robustness for soft sensor development
title_fullStr	Model stacking to improve prediction and variable importance robustness for soft sensor development
title_full_unstemmed	Model stacking to improve prediction and variable importance robustness for soft sensor development
title_short	Model stacking to improve prediction and variable importance robustness for soft sensor development
title_sort	model stacking to improve prediction and variable importance robustness for soft sensor development
topic	Machine learning Soft sensor Model interpretability
url	http://www.sciencedirect.com/science/article/pii/S2772508122000254
work_keys_str_mv	AT maxwellbarton modelstackingtoimprovepredictionandvariableimportancerobustnessforsoftsensordevelopment AT barrylennox modelstackingtoimprovepredictionandvariableimportancerobustnessforsoftsensordevelopment

Model stacking to improve prediction and variable importance robustness for soft sensor development

Similar Items