Modeling heterogeneous (co)variances from adjacent-SNP groups improves genomic prediction for milk protein composition traits

Abstract Background Accurate genomic prediction requires a large reference population, which is problematic for traits that are expensive to measure. Traits related to milk protein composition are not routinely recorded due to costly procedures and are considered to be controlled by a few quantitati...

Full description

Bibliographic Details
Main Authors: Grum Gebreyesus, Mogens S. Lund, Bart Buitenhuis, Henk Bovenhuis, Nina A. Poulsen, Luc G. Janss
Format: Article
Language:deu
Published: BMC 2017-12-01
Series:Genetics Selection Evolution
Online Access:http://link.springer.com/article/10.1186/s12711-017-0364-8
_version_ 1828357254019022848
author Grum Gebreyesus
Mogens S. Lund
Bart Buitenhuis
Henk Bovenhuis
Nina A. Poulsen
Luc G. Janss
author_facet Grum Gebreyesus
Mogens S. Lund
Bart Buitenhuis
Henk Bovenhuis
Nina A. Poulsen
Luc G. Janss
author_sort Grum Gebreyesus
collection DOAJ
description Abstract Background Accurate genomic prediction requires a large reference population, which is problematic for traits that are expensive to measure. Traits related to milk protein composition are not routinely recorded due to costly procedures and are considered to be controlled by a few quantitative trait loci of large effect. The amount of variation explained may vary between regions leading to heterogeneous (co)variance patterns across the genome. Genomic prediction models that can efficiently take such heterogeneity of (co)variances into account can result in improved prediction reliability. In this study, we developed and implemented novel univariate and bivariate Bayesian prediction models, based on estimates of heterogeneous (co)variances for genome segments (BayesAS). Available data consisted of milk protein composition traits measured on cows and de-regressed proofs of total protein yield derived for bulls. Single-nucleotide polymorphisms (SNPs), from 50K SNP arrays, were grouped into non-overlapping genome segments. A segment was defined as one SNP, or a group of 50, 100, or 200 adjacent SNPs, or one chromosome, or the whole genome. Traditional univariate and bivariate genomic best linear unbiased prediction (GBLUP) models were also run for comparison. Reliabilities were calculated through a resampling strategy and using deterministic formula. Results BayesAS models improved prediction reliability for most of the traits compared to GBLUP models and this gain depended on segment size and genetic architecture of the traits. The gain in prediction reliability was especially marked for the protein composition traits β-CN, κ-CN and β-LG, for which prediction reliabilities were improved by 49 percentage points on average using the MT-BayesAS model with a 100-SNP segment size compared to the bivariate GBLUP. Prediction reliabilities were highest with the BayesAS model that uses a 100-SNP segment size. The bivariate versions of our BayesAS models resulted in extra gains of up to 6% in prediction reliability compared to the univariate versions. Conclusions Substantial improvement in prediction reliability was possible for most of the traits related to milk protein composition using our novel BayesAS models. Grouping adjacent SNPs into segments provided enhanced information to estimate parameters and allowing the segments to have different (co)variances helped disentangle heterogeneous (co)variances across the genome.
first_indexed 2024-04-14T03:13:22Z
format Article
id doaj.art-1cd5c22224524f08940efd34c1a76141
institution Directory Open Access Journal
issn 1297-9686
language deu
last_indexed 2024-04-14T03:13:22Z
publishDate 2017-12-01
publisher BMC
record_format Article
series Genetics Selection Evolution
spelling doaj.art-1cd5c22224524f08940efd34c1a761412022-12-22T02:15:33ZdeuBMCGenetics Selection Evolution1297-96862017-12-0149111310.1186/s12711-017-0364-8Modeling heterogeneous (co)variances from adjacent-SNP groups improves genomic prediction for milk protein composition traitsGrum Gebreyesus0Mogens S. Lund1Bart Buitenhuis2Henk Bovenhuis3Nina A. Poulsen4Luc G. Janss5Department of Molecular Biology and Genetics, Center for Quantitative Genetics and Genomics, Aarhus UniversityDepartment of Molecular Biology and Genetics, Center for Quantitative Genetics and Genomics, Aarhus UniversityDepartment of Molecular Biology and Genetics, Center for Quantitative Genetics and Genomics, Aarhus UniversityAnimal Breeding and Genomics Centre, Wageningen UniversityDepartment of Food Science, Aarhus UniversityDepartment of Molecular Biology and Genetics, Center for Quantitative Genetics and Genomics, Aarhus UniversityAbstract Background Accurate genomic prediction requires a large reference population, which is problematic for traits that are expensive to measure. Traits related to milk protein composition are not routinely recorded due to costly procedures and are considered to be controlled by a few quantitative trait loci of large effect. The amount of variation explained may vary between regions leading to heterogeneous (co)variance patterns across the genome. Genomic prediction models that can efficiently take such heterogeneity of (co)variances into account can result in improved prediction reliability. In this study, we developed and implemented novel univariate and bivariate Bayesian prediction models, based on estimates of heterogeneous (co)variances for genome segments (BayesAS). Available data consisted of milk protein composition traits measured on cows and de-regressed proofs of total protein yield derived for bulls. Single-nucleotide polymorphisms (SNPs), from 50K SNP arrays, were grouped into non-overlapping genome segments. A segment was defined as one SNP, or a group of 50, 100, or 200 adjacent SNPs, or one chromosome, or the whole genome. Traditional univariate and bivariate genomic best linear unbiased prediction (GBLUP) models were also run for comparison. Reliabilities were calculated through a resampling strategy and using deterministic formula. Results BayesAS models improved prediction reliability for most of the traits compared to GBLUP models and this gain depended on segment size and genetic architecture of the traits. The gain in prediction reliability was especially marked for the protein composition traits β-CN, κ-CN and β-LG, for which prediction reliabilities were improved by 49 percentage points on average using the MT-BayesAS model with a 100-SNP segment size compared to the bivariate GBLUP. Prediction reliabilities were highest with the BayesAS model that uses a 100-SNP segment size. The bivariate versions of our BayesAS models resulted in extra gains of up to 6% in prediction reliability compared to the univariate versions. Conclusions Substantial improvement in prediction reliability was possible for most of the traits related to milk protein composition using our novel BayesAS models. Grouping adjacent SNPs into segments provided enhanced information to estimate parameters and allowing the segments to have different (co)variances helped disentangle heterogeneous (co)variances across the genome.http://link.springer.com/article/10.1186/s12711-017-0364-8
spellingShingle Grum Gebreyesus
Mogens S. Lund
Bart Buitenhuis
Henk Bovenhuis
Nina A. Poulsen
Luc G. Janss
Modeling heterogeneous (co)variances from adjacent-SNP groups improves genomic prediction for milk protein composition traits
Genetics Selection Evolution
title Modeling heterogeneous (co)variances from adjacent-SNP groups improves genomic prediction for milk protein composition traits
title_full Modeling heterogeneous (co)variances from adjacent-SNP groups improves genomic prediction for milk protein composition traits
title_fullStr Modeling heterogeneous (co)variances from adjacent-SNP groups improves genomic prediction for milk protein composition traits
title_full_unstemmed Modeling heterogeneous (co)variances from adjacent-SNP groups improves genomic prediction for milk protein composition traits
title_short Modeling heterogeneous (co)variances from adjacent-SNP groups improves genomic prediction for milk protein composition traits
title_sort modeling heterogeneous co variances from adjacent snp groups improves genomic prediction for milk protein composition traits
url http://link.springer.com/article/10.1186/s12711-017-0364-8
work_keys_str_mv AT grumgebreyesus modelingheterogeneouscovariancesfromadjacentsnpgroupsimprovesgenomicpredictionformilkproteincompositiontraits
AT mogensslund modelingheterogeneouscovariancesfromadjacentsnpgroupsimprovesgenomicpredictionformilkproteincompositiontraits
AT bartbuitenhuis modelingheterogeneouscovariancesfromadjacentsnpgroupsimprovesgenomicpredictionformilkproteincompositiontraits
AT henkbovenhuis modelingheterogeneouscovariancesfromadjacentsnpgroupsimprovesgenomicpredictionformilkproteincompositiontraits
AT ninaapoulsen modelingheterogeneouscovariancesfromadjacentsnpgroupsimprovesgenomicpredictionformilkproteincompositiontraits
AT lucgjanss modelingheterogeneouscovariancesfromadjacentsnpgroupsimprovesgenomicpredictionformilkproteincompositiontraits