Automated fitting process using robust reliable weighted average on near infrared spectral data analysis

With the complexity of Near Infrared (NIR) spectral data, the selection of the optimal number of Partial Least Squares (PLS) components in the fitted Partial Least Squares Regression (PLSR) model is very important. Selecting a small number of PLS components leads to under fitting, whereas selecting...

Full description

Bibliographic Details
Main Authors: Silalahi, Divo Dharma, Midi, Habshah, Arasan, Jayanthi, Mustafa, Mohd Shafie, Caliman, Jean Pierre
Format: Article
Language:English
Published: Multidisciplinary Digital Publishing Institute 2020
Online Access:http://psasir.upm.edu.my/id/eprint/87995/1/ABSTRACT.pdf
_version_ 1796981912959451136
author Silalahi, Divo Dharma
Midi, Habshah
Arasan, Jayanthi
Mustafa, Mohd Shafie
Caliman, Jean Pierre
author_facet Silalahi, Divo Dharma
Midi, Habshah
Arasan, Jayanthi
Mustafa, Mohd Shafie
Caliman, Jean Pierre
author_sort Silalahi, Divo Dharma
collection UPM
description With the complexity of Near Infrared (NIR) spectral data, the selection of the optimal number of Partial Least Squares (PLS) components in the fitted Partial Least Squares Regression (PLSR) model is very important. Selecting a small number of PLS components leads to under fitting, whereas selecting a large number of PLS components results in over fitting. Several methods exist in the selection procedure, and each yields a different result. However, so far no one has been able to determine the more superior method. In addition, the current methods are susceptible to the presence of outliers and High Leverage Points (HLP) in a dataset. In this study, a new automated fitting process method on PLSR model is introduced. The method is called the Robust Reliable Weighted Average—PLS (RRWA-PLS), and it is less sensitive to the optimum number of PLS components. The RRWA-PLS uses the weighted average strategy from multiple PLSR models generated by the different complexities of the PLS components. The method assigns robust procedures in the weighing schemes as an improvement to the existing Weighted Average—PLS (WA-PLS) method. The weighing schemes in the proposed method are resistant to outliers and HLP and thus, preserve the contribution of the most relevant variables in the fitted model. The evaluation was done by utilizing artificial data with the Monte Carlo simulation and NIR spectral data of oil palm (Elaeis guineensis Jacq.) fruit mesocarp. Based on the results, the method claims to have shown its superiority in the improvement of the weight and variable selection procedures in the WA-PLS. It is also resistant to the influence of outliers and HLP in the dataset. The RRWA-PLS method provides a promising robust solution for the automated fitting process in the PLSR model as unlike the classical PLS, it does not require the selection of an optimal number of PLS components.
first_indexed 2024-03-06T10:44:36Z
format Article
id upm.eprints-87995
institution Universiti Putra Malaysia
language English
last_indexed 2024-03-06T10:44:36Z
publishDate 2020
publisher Multidisciplinary Digital Publishing Institute
record_format dspace
spelling upm.eprints-879952022-05-24T08:15:54Z http://psasir.upm.edu.my/id/eprint/87995/ Automated fitting process using robust reliable weighted average on near infrared spectral data analysis Silalahi, Divo Dharma Midi, Habshah Arasan, Jayanthi Mustafa, Mohd Shafie Caliman, Jean Pierre With the complexity of Near Infrared (NIR) spectral data, the selection of the optimal number of Partial Least Squares (PLS) components in the fitted Partial Least Squares Regression (PLSR) model is very important. Selecting a small number of PLS components leads to under fitting, whereas selecting a large number of PLS components results in over fitting. Several methods exist in the selection procedure, and each yields a different result. However, so far no one has been able to determine the more superior method. In addition, the current methods are susceptible to the presence of outliers and High Leverage Points (HLP) in a dataset. In this study, a new automated fitting process method on PLSR model is introduced. The method is called the Robust Reliable Weighted Average—PLS (RRWA-PLS), and it is less sensitive to the optimum number of PLS components. The RRWA-PLS uses the weighted average strategy from multiple PLSR models generated by the different complexities of the PLS components. The method assigns robust procedures in the weighing schemes as an improvement to the existing Weighted Average—PLS (WA-PLS) method. The weighing schemes in the proposed method are resistant to outliers and HLP and thus, preserve the contribution of the most relevant variables in the fitted model. The evaluation was done by utilizing artificial data with the Monte Carlo simulation and NIR spectral data of oil palm (Elaeis guineensis Jacq.) fruit mesocarp. Based on the results, the method claims to have shown its superiority in the improvement of the weight and variable selection procedures in the WA-PLS. It is also resistant to the influence of outliers and HLP in the dataset. The RRWA-PLS method provides a promising robust solution for the automated fitting process in the PLSR model as unlike the classical PLS, it does not require the selection of an optimal number of PLS components. Multidisciplinary Digital Publishing Institute 2020 Article PeerReviewed text en http://psasir.upm.edu.my/id/eprint/87995/1/ABSTRACT.pdf Silalahi, Divo Dharma and Midi, Habshah and Arasan, Jayanthi and Mustafa, Mohd Shafie and Caliman, Jean Pierre (2020) Automated fitting process using robust reliable weighted average on near infrared spectral data analysis. Symmetry-Basel, 12 (12). pp. 1-27. ISSN 2073-8994 https://www.mdpi.com/2073-8994/12/12/2099 10.3390/sym12122099
spellingShingle Silalahi, Divo Dharma
Midi, Habshah
Arasan, Jayanthi
Mustafa, Mohd Shafie
Caliman, Jean Pierre
Automated fitting process using robust reliable weighted average on near infrared spectral data analysis
title Automated fitting process using robust reliable weighted average on near infrared spectral data analysis
title_full Automated fitting process using robust reliable weighted average on near infrared spectral data analysis
title_fullStr Automated fitting process using robust reliable weighted average on near infrared spectral data analysis
title_full_unstemmed Automated fitting process using robust reliable weighted average on near infrared spectral data analysis
title_short Automated fitting process using robust reliable weighted average on near infrared spectral data analysis
title_sort automated fitting process using robust reliable weighted average on near infrared spectral data analysis
url http://psasir.upm.edu.my/id/eprint/87995/1/ABSTRACT.pdf
work_keys_str_mv AT silalahidivodharma automatedfittingprocessusingrobustreliableweightedaverageonnearinfraredspectraldataanalysis
AT midihabshah automatedfittingprocessusingrobustreliableweightedaverageonnearinfraredspectraldataanalysis
AT arasanjayanthi automatedfittingprocessusingrobustreliableweightedaverageonnearinfraredspectraldataanalysis
AT mustafamohdshafie automatedfittingprocessusingrobustreliableweightedaverageonnearinfraredspectraldataanalysis
AT calimanjeanpierre automatedfittingprocessusingrobustreliableweightedaverageonnearinfraredspectraldataanalysis