Research on Apple Origins Classification Optimization Based on Least-Angle Regression in Instance Selection

Machine learning is used widely in near-infrared spectroscopy (NIRS) for fruit qualification. However, the directly split training set used contains redundant samples, and errors may be introduced into the model. Euclidean distance-based and K-nearest neighbor-based instance selection (IS) methods a...

Full description

Bibliographic Details
Main Authors: Bin Li, Yuqi Wang, Lisha Li, Yande Liu
Format: Article
Language:English
Published: MDPI AG 2023-09-01
Series:Agriculture
Subjects:
Online Access:https://www.mdpi.com/2077-0472/13/10/1868
_version_ 1797575111882047488
author Bin Li
Yuqi Wang
Lisha Li
Yande Liu
author_facet Bin Li
Yuqi Wang
Lisha Li
Yande Liu
author_sort Bin Li
collection DOAJ
description Machine learning is used widely in near-infrared spectroscopy (NIRS) for fruit qualification. However, the directly split training set used contains redundant samples, and errors may be introduced into the model. Euclidean distance-based and K-nearest neighbor-based instance selection (IS) methods are widely used to remove useless samples because of their accessibility. However, they either have high accuracy and low compression or vice versa. To compress the sample size while improving the accuracy, the least-angle regression (LAR) method was proposed for classification instance selection, and a discrimination experiment was conducted on a total of four origins of 952 apples. The sample sets were split into the raw training set and testing set; the optimal training samples were selected using the LAR-based instance selection (LARIS) method, and the four other selection methods were compared. The results showed that 26.9% of the raw training samples were selected using LARIS, and the model based on these training samples had the highest accuracy. Thus, the apple origin classification model based on LARIS can achieve the goal of high accuracy and compression and provide experimental support for the least-angle regression algorithm in classification instance selection.
first_indexed 2024-03-10T21:31:44Z
format Article
id doaj.art-ccbbe3f0d11e4bb39b96683437a9f6d9
institution Directory Open Access Journal
issn 2077-0472
language English
last_indexed 2024-03-10T21:31:44Z
publishDate 2023-09-01
publisher MDPI AG
record_format Article
series Agriculture
spelling doaj.art-ccbbe3f0d11e4bb39b96683437a9f6d92023-11-19T15:17:56ZengMDPI AGAgriculture2077-04722023-09-011310186810.3390/agriculture13101868Research on Apple Origins Classification Optimization Based on Least-Angle Regression in Instance SelectionBin Li0Yuqi Wang1Lisha Li2Yande Liu3School of Mechatronics and Vehicle Engineering, East China Jiaotong University, Nanchang 330013, ChinaSchool of Mechatronics and Vehicle Engineering, East China Jiaotong University, Nanchang 330013, ChinaSchool of Mechatronics and Vehicle Engineering, East China Jiaotong University, Nanchang 330013, ChinaSchool of Mechatronics and Vehicle Engineering, East China Jiaotong University, Nanchang 330013, ChinaMachine learning is used widely in near-infrared spectroscopy (NIRS) for fruit qualification. However, the directly split training set used contains redundant samples, and errors may be introduced into the model. Euclidean distance-based and K-nearest neighbor-based instance selection (IS) methods are widely used to remove useless samples because of their accessibility. However, they either have high accuracy and low compression or vice versa. To compress the sample size while improving the accuracy, the least-angle regression (LAR) method was proposed for classification instance selection, and a discrimination experiment was conducted on a total of four origins of 952 apples. The sample sets were split into the raw training set and testing set; the optimal training samples were selected using the LAR-based instance selection (LARIS) method, and the four other selection methods were compared. The results showed that 26.9% of the raw training samples were selected using LARIS, and the model based on these training samples had the highest accuracy. Thus, the apple origin classification model based on LARIS can achieve the goal of high accuracy and compression and provide experimental support for the least-angle regression algorithm in classification instance selection.https://www.mdpi.com/2077-0472/13/10/1868instance selectionleast-angle regressionclassificationSVMnear-infrared spectroscopy
spellingShingle Bin Li
Yuqi Wang
Lisha Li
Yande Liu
Research on Apple Origins Classification Optimization Based on Least-Angle Regression in Instance Selection
Agriculture
instance selection
least-angle regression
classification
SVM
near-infrared spectroscopy
title Research on Apple Origins Classification Optimization Based on Least-Angle Regression in Instance Selection
title_full Research on Apple Origins Classification Optimization Based on Least-Angle Regression in Instance Selection
title_fullStr Research on Apple Origins Classification Optimization Based on Least-Angle Regression in Instance Selection
title_full_unstemmed Research on Apple Origins Classification Optimization Based on Least-Angle Regression in Instance Selection
title_short Research on Apple Origins Classification Optimization Based on Least-Angle Regression in Instance Selection
title_sort research on apple origins classification optimization based on least angle regression in instance selection
topic instance selection
least-angle regression
classification
SVM
near-infrared spectroscopy
url https://www.mdpi.com/2077-0472/13/10/1868
work_keys_str_mv AT binli researchonappleoriginsclassificationoptimizationbasedonleastangleregressionininstanceselection
AT yuqiwang researchonappleoriginsclassificationoptimizationbasedonleastangleregressionininstanceselection
AT lishali researchonappleoriginsclassificationoptimizationbasedonleastangleregressionininstanceselection
AT yandeliu researchonappleoriginsclassificationoptimizationbasedonleastangleregressionininstanceselection