Can We Trust Score Plots?

In this paper, we discuss the validity of using score plots of component models such as partial least squares regression, especially when these models are used for building classification models, and models derived from partial least squares regression for discriminant analysis (PLS-DA). Using examp...

Full description

Bibliographic Details
Main Authors: Marta Bevilacqua, Rasmus Bro
Format: Article
Language:English
Published: MDPI AG 2020-07-01
Series:Metabolites
Subjects:
Online Access:https://www.mdpi.com/2218-1989/10/7/278
_version_ 1827713514646536192
author Marta Bevilacqua
Rasmus Bro
author_facet Marta Bevilacqua
Rasmus Bro
author_sort Marta Bevilacqua
collection DOAJ
description In this paper, we discuss the validity of using score plots of component models such as partial least squares regression, especially when these models are used for building classification models, and models derived from partial least squares regression for discriminant analysis (PLS-DA). Using examples and simulations, it is shown that the currently accepted practice of showing score plots from calibration models may give misleading interpretations. It is suggested and shown that the problem can be solved by replacing the currently used calibrated score plots with cross-validated score plots.
first_indexed 2024-03-10T18:37:00Z
format Article
id doaj.art-dfaf5902d2064af29c52d9d86bf5da1b
institution Directory Open Access Journal
issn 2218-1989
language English
last_indexed 2024-03-10T18:37:00Z
publishDate 2020-07-01
publisher MDPI AG
record_format Article
series Metabolites
spelling doaj.art-dfaf5902d2064af29c52d9d86bf5da1b2023-11-20T06:09:56ZengMDPI AGMetabolites2218-19892020-07-0110727810.3390/metabo10070278Can We Trust Score Plots?Marta Bevilacqua0Rasmus Bro1Department of Food Science, University of Copenhagen, Rolighedsvej 26, 1958 Frederiksberg, DenmarkDepartment of Food Science, University of Copenhagen, Rolighedsvej 26, 1958 Frederiksberg, DenmarkIn this paper, we discuss the validity of using score plots of component models such as partial least squares regression, especially when these models are used for building classification models, and models derived from partial least squares regression for discriminant analysis (PLS-DA). Using examples and simulations, it is shown that the currently accepted practice of showing score plots from calibration models may give misleading interpretations. It is suggested and shown that the problem can be solved by replacing the currently used calibrated score plots with cross-validated score plots.https://www.mdpi.com/2218-1989/10/7/278overfittingscore plotsvalidation
spellingShingle Marta Bevilacqua
Rasmus Bro
Can We Trust Score Plots?
Metabolites
overfitting
score plots
validation
title Can We Trust Score Plots?
title_full Can We Trust Score Plots?
title_fullStr Can We Trust Score Plots?
title_full_unstemmed Can We Trust Score Plots?
title_short Can We Trust Score Plots?
title_sort can we trust score plots
topic overfitting
score plots
validation
url https://www.mdpi.com/2218-1989/10/7/278
work_keys_str_mv AT martabevilacqua canwetrustscoreplots
AT rasmusbro canwetrustscoreplots