Non-destructive prediction of protein contents of soybean seeds using near-infrared hyperspectral imaging

Protein content is one of the most crucial factors in soybean quality. However, the breeding procedure necessitates the time-consuming and costly selection of elite genotypes from many experimental lines in a destructive manner. The present work aims to predict protein content in single soybean seed...

Full description

Bibliographic Details
Main Authors: Aulia, Rizkiana, Kim, Yena, Zuhrotul Amanah, Hanim, Muhammad Akbar Andi, Arief, Kim, Haeun, Kim, Hangi, Lee, Wang-Hee, Kim, Kyung-Hwan, Baek, Jeong-Ho, Cho, Byoung-Kwan
Format: Article
Language:English
Published: Elsevier B.V. 2022
Subjects:
Online Access:https://repository.ugm.ac.id/282850/1/56_Non%20destructive%20prediction.pdf
Description
Summary:Protein content is one of the most crucial factors in soybean quality. However, the breeding procedure necessitates the time-consuming and costly selection of elite genotypes from many experimental lines in a destructive manner. The present work aims to predict protein content in single soybean seeds non-destructively using Near-Infrared (NIR) Hyperspectral Imaging (HSI). 1491 seed samples from 3 varieties of the low, medium, and high protein content (consisting of 371, 560, and 560 samples, respectively) were measured using the NIR-HSI system with a range of 900–1800 nm. The spectral data extracted from the HSI 3D hypercube were synchronised to the reference values examined from chemical analysis. The calibration model was constructed using partial least square regression (PLSR) methods based on the 70% spectral data and then validated using the remaining 30% of data. The result showed that the NIR-HSI technique is a promising method to predict protein content in soybean seeds, as shown by an R2 of 0.92 and a root mean square error (RMSE) of 1.08% . In addition, the chemical images visualised the distribution of protein content for the multiple soybean seed showed the possibility of the developed technique for the use of rapid evaluation of massive samples in the processing line.