Comparing artificial‐intelligence techniques with state‐of‐the‐art parametric prediction models for predicting soybean traits

Abstract Soybean [Glycine max (L.) Merr.] is a significant source of protein and oil and is also widely used as animal feed. Thus, developing lines that are superior in terms of yield, protein, and oil content is important to feed the ever‐growing population. As opposed to high‐cost phenotyping, gen...

Full description

Bibliographic Details
Main Authors: Susweta Ray, Diego Jarquin, Reka Howard
Format: Article
Language:English
Published: Wiley 2023-03-01
Series:The Plant Genome
Online Access:https://doi.org/10.1002/tpg2.20263
_version_ 1827993424558555136
author Susweta Ray
Diego Jarquin
Reka Howard
author_facet Susweta Ray
Diego Jarquin
Reka Howard
author_sort Susweta Ray
collection DOAJ
description Abstract Soybean [Glycine max (L.) Merr.] is a significant source of protein and oil and is also widely used as animal feed. Thus, developing lines that are superior in terms of yield, protein, and oil content is important to feed the ever‐growing population. As opposed to high‐cost phenotyping, genotyping is both cost and time efficient for breeders because evaluating new lines in different environments (location–year combinations) can be costly. Several genomic prediction (GP) methods have been developed to use the marker and environment data effectively to predict the yield or other relevant phenotypic traits of crops. Our study compares a conventional GP method (genomic best linear unbiased predictor [GBLUP]), a kernel method (Gaussian kernel [GK]), an artificial‐intelligence (AI) method (deep learning [DL]), and a hybrid method that corresponds to the emulation of a DL model using a kernel method (an arc‐cosine kernel [AK]) in terms of their prediction accuracies for predicting grain yield, oil, and protein using data from the soybean nested association mapping experiment (1,379 genotypes tested in six environments, all genotypes in all environments). The relative performance of the four methods varied with the response variable and whether the model includes the genotype × environmental interaction (G×E) effects or not. The GBLUP consistently showed better performances, whereas GK and AK followed a similar pattern to GBLUP and DL performed slightly worse than the other three methods in most of the cases; however, this may also be attributed to suboptimal hyperparameters. The DL method performed particularly worse than the other three methods in presence of the G×E effects.
first_indexed 2024-04-10T04:25:48Z
format Article
id doaj.art-20eb285bc227477aa40f236c61d762b2
institution Directory Open Access Journal
issn 1940-3372
language English
last_indexed 2024-04-10T04:25:48Z
publishDate 2023-03-01
publisher Wiley
record_format Article
series The Plant Genome
spelling doaj.art-20eb285bc227477aa40f236c61d762b22023-03-10T14:45:44ZengWileyThe Plant Genome1940-33722023-03-01161n/an/a10.1002/tpg2.20263Comparing artificial‐intelligence techniques with state‐of‐the‐art parametric prediction models for predicting soybean traitsSusweta Ray0Diego Jarquin1Reka Howard2Dep. of Statistics Univ. of Nebraska–Lincoln Lincoln NE 68583 USADep. of AgronomyUniv. of Florida GainesvilleFL32611USADep. of Statistics Univ. of Nebraska–Lincoln Lincoln NE 68583 USAAbstract Soybean [Glycine max (L.) Merr.] is a significant source of protein and oil and is also widely used as animal feed. Thus, developing lines that are superior in terms of yield, protein, and oil content is important to feed the ever‐growing population. As opposed to high‐cost phenotyping, genotyping is both cost and time efficient for breeders because evaluating new lines in different environments (location–year combinations) can be costly. Several genomic prediction (GP) methods have been developed to use the marker and environment data effectively to predict the yield or other relevant phenotypic traits of crops. Our study compares a conventional GP method (genomic best linear unbiased predictor [GBLUP]), a kernel method (Gaussian kernel [GK]), an artificial‐intelligence (AI) method (deep learning [DL]), and a hybrid method that corresponds to the emulation of a DL model using a kernel method (an arc‐cosine kernel [AK]) in terms of their prediction accuracies for predicting grain yield, oil, and protein using data from the soybean nested association mapping experiment (1,379 genotypes tested in six environments, all genotypes in all environments). The relative performance of the four methods varied with the response variable and whether the model includes the genotype × environmental interaction (G×E) effects or not. The GBLUP consistently showed better performances, whereas GK and AK followed a similar pattern to GBLUP and DL performed slightly worse than the other three methods in most of the cases; however, this may also be attributed to suboptimal hyperparameters. The DL method performed particularly worse than the other three methods in presence of the G×E effects.https://doi.org/10.1002/tpg2.20263
spellingShingle Susweta Ray
Diego Jarquin
Reka Howard
Comparing artificial‐intelligence techniques with state‐of‐the‐art parametric prediction models for predicting soybean traits
The Plant Genome
title Comparing artificial‐intelligence techniques with state‐of‐the‐art parametric prediction models for predicting soybean traits
title_full Comparing artificial‐intelligence techniques with state‐of‐the‐art parametric prediction models for predicting soybean traits
title_fullStr Comparing artificial‐intelligence techniques with state‐of‐the‐art parametric prediction models for predicting soybean traits
title_full_unstemmed Comparing artificial‐intelligence techniques with state‐of‐the‐art parametric prediction models for predicting soybean traits
title_short Comparing artificial‐intelligence techniques with state‐of‐the‐art parametric prediction models for predicting soybean traits
title_sort comparing artificial intelligence techniques with state of the art parametric prediction models for predicting soybean traits
url https://doi.org/10.1002/tpg2.20263
work_keys_str_mv AT suswetaray comparingartificialintelligencetechniqueswithstateoftheartparametricpredictionmodelsforpredictingsoybeantraits
AT diegojarquin comparingartificialintelligencetechniqueswithstateoftheartparametricpredictionmodelsforpredictingsoybeantraits
AT rekahoward comparingartificialintelligencetechniqueswithstateoftheartparametricpredictionmodelsforpredictingsoybeantraits