Soybean Yield Estimation and Its Components: A Linear Regression Approach

Soybean yield estimation is either based on yield monitors or agro-meteorological and satellite imagery data, but they present several limiting factors regarding on-farm decision level. Aware that machine learning approaches have been largely applied to estimate soybean yield and the availability of...

Full description

Bibliographic Details
Main Authors: Marcelo Chan Fu Wei, José Paulo Molin
Format: Article
Language:English
Published: MDPI AG 2020-08-01
Series:Agriculture
Subjects:
Online Access:https://www.mdpi.com/2077-0472/10/8/348
_version_ 1797558931940179968
author Marcelo Chan Fu Wei
José Paulo Molin
author_facet Marcelo Chan Fu Wei
José Paulo Molin
author_sort Marcelo Chan Fu Wei
collection DOAJ
description Soybean yield estimation is either based on yield monitors or agro-meteorological and satellite imagery data, but they present several limiting factors regarding on-farm decision level. Aware that machine learning approaches have been largely applied to estimate soybean yield and the availability of data regarding soybean yield and its components (number of grains (NG) and thousand grains weight (TGW)), there is an opportunity to study their relationships. The objective was to explore the relationships between soybean yield and its components, generate equations to estimate yield and evaluate its prediction accuracy. The training dataset was composed of soybean yield and its components’ data from 2010 to 2019. Linear regression models based on NG, TGW and yield were fitted on the training dataset and applied to a validation dataset composed of 58 on-field collected samples. It was found that globally TGW and NG presented weak (r = 0.50) and strong (r = 0.92) linear relationships with yield, respectively. In addition to that, applying the fitted models to the validation dataset, model based on NG presented the highest accuracy, coefficient of determination (R<sup>2</sup>) of 0.70, mean absolute error (MAE) of 639.99 kg ha<sup>−1</sup> and root mean squared error (RMSE) of 726.67 kg ha<sup>−1</sup>.
first_indexed 2024-03-10T17:38:26Z
format Article
id doaj.art-437e9c5aedb44f3bbaa9501b21e79858
institution Directory Open Access Journal
issn 2077-0472
language English
last_indexed 2024-03-10T17:38:26Z
publishDate 2020-08-01
publisher MDPI AG
record_format Article
series Agriculture
spelling doaj.art-437e9c5aedb44f3bbaa9501b21e798582023-11-20T09:46:25ZengMDPI AGAgriculture2077-04722020-08-0110834810.3390/agriculture10080348Soybean Yield Estimation and Its Components: A Linear Regression ApproachMarcelo Chan Fu Wei0José Paulo Molin1College of Agriculture “Luiz de Queiroz”, University of Sao Paulo, 11 Padua Dias Avenue, Piracicaba 13418-900, BrazilCollege of Agriculture “Luiz de Queiroz”, University of Sao Paulo, 11 Padua Dias Avenue, Piracicaba 13418-900, BrazilSoybean yield estimation is either based on yield monitors or agro-meteorological and satellite imagery data, but they present several limiting factors regarding on-farm decision level. Aware that machine learning approaches have been largely applied to estimate soybean yield and the availability of data regarding soybean yield and its components (number of grains (NG) and thousand grains weight (TGW)), there is an opportunity to study their relationships. The objective was to explore the relationships between soybean yield and its components, generate equations to estimate yield and evaluate its prediction accuracy. The training dataset was composed of soybean yield and its components’ data from 2010 to 2019. Linear regression models based on NG, TGW and yield were fitted on the training dataset and applied to a validation dataset composed of 58 on-field collected samples. It was found that globally TGW and NG presented weak (r = 0.50) and strong (r = 0.92) linear relationships with yield, respectively. In addition to that, applying the fitted models to the validation dataset, model based on NG presented the highest accuracy, coefficient of determination (R<sup>2</sup>) of 0.70, mean absolute error (MAE) of 639.99 kg ha<sup>−1</sup> and root mean squared error (RMSE) of 726.67 kg ha<sup>−1</sup>.https://www.mdpi.com/2077-0472/10/8/348hundred grains weightmachine learningnumber of grainsprecision agriculturethousand grains weight
spellingShingle Marcelo Chan Fu Wei
José Paulo Molin
Soybean Yield Estimation and Its Components: A Linear Regression Approach
Agriculture
hundred grains weight
machine learning
number of grains
precision agriculture
thousand grains weight
title Soybean Yield Estimation and Its Components: A Linear Regression Approach
title_full Soybean Yield Estimation and Its Components: A Linear Regression Approach
title_fullStr Soybean Yield Estimation and Its Components: A Linear Regression Approach
title_full_unstemmed Soybean Yield Estimation and Its Components: A Linear Regression Approach
title_short Soybean Yield Estimation and Its Components: A Linear Regression Approach
title_sort soybean yield estimation and its components a linear regression approach
topic hundred grains weight
machine learning
number of grains
precision agriculture
thousand grains weight
url https://www.mdpi.com/2077-0472/10/8/348
work_keys_str_mv AT marcelochanfuwei soybeanyieldestimationanditscomponentsalinearregressionapproach
AT josepaulomolin soybeanyieldestimationanditscomponentsalinearregressionapproach