Meta-analysis for milk fat and protein percentage using imputed sequence variant genotypes in 94,321 cattle from eight cattle breeds

Abstract Background Sequence-based genome-wide association studies (GWAS) provide high statistical power to identify candidate causal mutations when a large number of individuals with both sequence variant genotypes and phenotypes is available. A meta-analysis combines summary statistics from multip...

Full description

Bibliographic Details
Main Authors: Irene van den Berg, Ruidong Xiang, Janez Jenko, Hubert Pausch, Mekki Boussaha, Chris Schrooten, Thierry Tribout, Arne B. Gjuvsland, Didier Boichard, Øyvind Nordbø, Marie-Pierre Sanchez, Mike E. Goddard
Format: Article
Language:deu
Published: BMC 2020-07-01
Series:Genetics Selection Evolution
Online Access:http://link.springer.com/article/10.1186/s12711-020-00556-4
_version_ 1819039293559013376
author Irene van den Berg
Ruidong Xiang
Janez Jenko
Hubert Pausch
Mekki Boussaha
Chris Schrooten
Thierry Tribout
Arne B. Gjuvsland
Didier Boichard
Øyvind Nordbø
Marie-Pierre Sanchez
Mike E. Goddard
author_facet Irene van den Berg
Ruidong Xiang
Janez Jenko
Hubert Pausch
Mekki Boussaha
Chris Schrooten
Thierry Tribout
Arne B. Gjuvsland
Didier Boichard
Øyvind Nordbø
Marie-Pierre Sanchez
Mike E. Goddard
author_sort Irene van den Berg
collection DOAJ
description Abstract Background Sequence-based genome-wide association studies (GWAS) provide high statistical power to identify candidate causal mutations when a large number of individuals with both sequence variant genotypes and phenotypes is available. A meta-analysis combines summary statistics from multiple GWAS and increases the power to detect trait-associated variants without requiring access to data at the individual level of the GWAS mapping cohorts. Because linkage disequilibrium between adjacent markers is conserved only over short distances across breeds, a multi-breed meta-analysis can improve mapping precision. Results To maximise the power to identify quantitative trait loci (QTL), we combined the results of nine within-population GWAS that used imputed sequence variant genotypes of 94,321 cattle from eight breeds, to perform a large-scale meta-analysis for fat and protein percentage in cattle. The meta-analysis detected (p ≤ 10−8) 138 QTL for fat percentage and 176 QTL for protein percentage. This was more than the number of QTL detected in all within-population GWAS together (124 QTL for fat percentage and 104 QTL for protein percentage). Among all the lead variants, 100 QTL for fat percentage and 114 QTL for protein percentage had the same direction of effect in all within-population GWAS. This indicates either persistence of the linkage phase between the causal variant and the lead variant across breeds or that some of the lead variants might indeed be causal or tightly linked with causal variants. The percentage of intergenic variants was substantially lower for significant variants than for non-significant variants, and significant variants had mostly moderate to high minor allele frequencies. Significant variants were also clustered in genes that are known to be relevant for fat and protein percentages in milk. Conclusions Our study identified a large number of QTL associated with fat and protein percentage in dairy cattle. We demonstrated that large-scale multi-breed meta-analysis reveals more QTL at the nucleotide resolution than within-population GWAS. Significant variants were more often located in genic regions than non-significant variants and a large part of them was located in potentially regulatory regions.
first_indexed 2024-12-21T08:50:54Z
format Article
id doaj.art-8fc569f86b0e419b804acf71d07792d8
institution Directory Open Access Journal
issn 1297-9686
language deu
last_indexed 2024-12-21T08:50:54Z
publishDate 2020-07-01
publisher BMC
record_format Article
series Genetics Selection Evolution
spelling doaj.art-8fc569f86b0e419b804acf71d07792d82022-12-21T19:09:42ZdeuBMCGenetics Selection Evolution1297-96862020-07-0152111610.1186/s12711-020-00556-4Meta-analysis for milk fat and protein percentage using imputed sequence variant genotypes in 94,321 cattle from eight cattle breedsIrene van den Berg0Ruidong Xiang1Janez Jenko2Hubert Pausch3Mekki Boussaha4Chris Schrooten5Thierry Tribout6Arne B. Gjuvsland7Didier Boichard8Øyvind Nordbø9Marie-Pierre Sanchez10Mike E. Goddard11Agriculture Victoria Research, AgriBioAgriculture Victoria Research, AgriBioGENO SAAnimal Genomics, ETH ZurichUniversité Paris-Saclay, INRAE, AgroParisTech, GABICRVUniversité Paris-Saclay, INRAE, AgroParisTech, GABIGENO SAUniversité Paris-Saclay, INRAE, AgroParisTech, GABIGENO SAUniversité Paris-Saclay, INRAE, AgroParisTech, GABIAgriculture Victoria Research, AgriBioAbstract Background Sequence-based genome-wide association studies (GWAS) provide high statistical power to identify candidate causal mutations when a large number of individuals with both sequence variant genotypes and phenotypes is available. A meta-analysis combines summary statistics from multiple GWAS and increases the power to detect trait-associated variants without requiring access to data at the individual level of the GWAS mapping cohorts. Because linkage disequilibrium between adjacent markers is conserved only over short distances across breeds, a multi-breed meta-analysis can improve mapping precision. Results To maximise the power to identify quantitative trait loci (QTL), we combined the results of nine within-population GWAS that used imputed sequence variant genotypes of 94,321 cattle from eight breeds, to perform a large-scale meta-analysis for fat and protein percentage in cattle. The meta-analysis detected (p ≤ 10−8) 138 QTL for fat percentage and 176 QTL for protein percentage. This was more than the number of QTL detected in all within-population GWAS together (124 QTL for fat percentage and 104 QTL for protein percentage). Among all the lead variants, 100 QTL for fat percentage and 114 QTL for protein percentage had the same direction of effect in all within-population GWAS. This indicates either persistence of the linkage phase between the causal variant and the lead variant across breeds or that some of the lead variants might indeed be causal or tightly linked with causal variants. The percentage of intergenic variants was substantially lower for significant variants than for non-significant variants, and significant variants had mostly moderate to high minor allele frequencies. Significant variants were also clustered in genes that are known to be relevant for fat and protein percentages in milk. Conclusions Our study identified a large number of QTL associated with fat and protein percentage in dairy cattle. We demonstrated that large-scale multi-breed meta-analysis reveals more QTL at the nucleotide resolution than within-population GWAS. Significant variants were more often located in genic regions than non-significant variants and a large part of them was located in potentially regulatory regions.http://link.springer.com/article/10.1186/s12711-020-00556-4
spellingShingle Irene van den Berg
Ruidong Xiang
Janez Jenko
Hubert Pausch
Mekki Boussaha
Chris Schrooten
Thierry Tribout
Arne B. Gjuvsland
Didier Boichard
Øyvind Nordbø
Marie-Pierre Sanchez
Mike E. Goddard
Meta-analysis for milk fat and protein percentage using imputed sequence variant genotypes in 94,321 cattle from eight cattle breeds
Genetics Selection Evolution
title Meta-analysis for milk fat and protein percentage using imputed sequence variant genotypes in 94,321 cattle from eight cattle breeds
title_full Meta-analysis for milk fat and protein percentage using imputed sequence variant genotypes in 94,321 cattle from eight cattle breeds
title_fullStr Meta-analysis for milk fat and protein percentage using imputed sequence variant genotypes in 94,321 cattle from eight cattle breeds
title_full_unstemmed Meta-analysis for milk fat and protein percentage using imputed sequence variant genotypes in 94,321 cattle from eight cattle breeds
title_short Meta-analysis for milk fat and protein percentage using imputed sequence variant genotypes in 94,321 cattle from eight cattle breeds
title_sort meta analysis for milk fat and protein percentage using imputed sequence variant genotypes in 94 321 cattle from eight cattle breeds
url http://link.springer.com/article/10.1186/s12711-020-00556-4
work_keys_str_mv AT irenevandenberg metaanalysisformilkfatandproteinpercentageusingimputedsequencevariantgenotypesin94321cattlefromeightcattlebreeds
AT ruidongxiang metaanalysisformilkfatandproteinpercentageusingimputedsequencevariantgenotypesin94321cattlefromeightcattlebreeds
AT janezjenko metaanalysisformilkfatandproteinpercentageusingimputedsequencevariantgenotypesin94321cattlefromeightcattlebreeds
AT hubertpausch metaanalysisformilkfatandproteinpercentageusingimputedsequencevariantgenotypesin94321cattlefromeightcattlebreeds
AT mekkiboussaha metaanalysisformilkfatandproteinpercentageusingimputedsequencevariantgenotypesin94321cattlefromeightcattlebreeds
AT chrisschrooten metaanalysisformilkfatandproteinpercentageusingimputedsequencevariantgenotypesin94321cattlefromeightcattlebreeds
AT thierrytribout metaanalysisformilkfatandproteinpercentageusingimputedsequencevariantgenotypesin94321cattlefromeightcattlebreeds
AT arnebgjuvsland metaanalysisformilkfatandproteinpercentageusingimputedsequencevariantgenotypesin94321cattlefromeightcattlebreeds
AT didierboichard metaanalysisformilkfatandproteinpercentageusingimputedsequencevariantgenotypesin94321cattlefromeightcattlebreeds
AT øyvindnordbø metaanalysisformilkfatandproteinpercentageusingimputedsequencevariantgenotypesin94321cattlefromeightcattlebreeds
AT mariepierresanchez metaanalysisformilkfatandproteinpercentageusingimputedsequencevariantgenotypesin94321cattlefromeightcattlebreeds
AT mikeegoddard metaanalysisformilkfatandproteinpercentageusingimputedsequencevariantgenotypesin94321cattlefromeightcattlebreeds