Genomic heritability: what is it?

Whole-genome regression methods are being increasingly used for the analysis and prediction of complex traits and diseases. In human genetics, these methods are commonly used for inferences about genetic parameters, such as the amount of genetic variance among individuals or the proportion of phenot...

Full description

Bibliographic Details
Main Authors: Gustavo de Los Campos, Daniel Sorensen, Daniel Gianola
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2015-05-01
Series:PLoS Genetics
Online Access:http://europepmc.org/articles/PMC4420472?pdf=render
_version_ 1828878304201932800
author Gustavo de Los Campos
Daniel Sorensen
Daniel Gianola
author_facet Gustavo de Los Campos
Daniel Sorensen
Daniel Gianola
author_sort Gustavo de Los Campos
collection DOAJ
description Whole-genome regression methods are being increasingly used for the analysis and prediction of complex traits and diseases. In human genetics, these methods are commonly used for inferences about genetic parameters, such as the amount of genetic variance among individuals or the proportion of phenotypic variance that can be explained by regression on molecular markers. This is so even though some of the assumptions commonly adopted for data analysis are at odds with important quantitative genetic concepts. In this article we develop theory that leads to a precise definition of parameters arising in high dimensional genomic regressions; we focus on the so-called genomic heritability: the proportion of variance of a trait that can be explained (in the population) by a linear regression on a set of markers. We propose a definition of this parameter that is framed within the classical quantitative genetics theory and show that the genomic heritability and the trait heritability parameters are equal only when all causal variants are typed. Further, we discuss how the genomic variance and genomic heritability, defined as quantitative genetic parameters, relate to parameters of statistical models commonly used for inferences, and indicate potential inferential problems that are assessed further using simulations. When a large proportion of the markers used in the analysis are in LE with QTL the likelihood function can be misspecified. This can induce a sizable finite-sample bias and, possibly, lack of consistency of likelihood (or Bayesian) estimates. This situation can be encountered if the individuals in the sample are distantly related and linkage disequilibrium spans over short regions. This bias does not negate the use of whole-genome regression models as predictive machines; however, our results indicate that caution is needed when using marker-based regressions for inferences about population parameters such as the genomic heritability.
first_indexed 2024-12-13T09:02:55Z
format Article
id doaj.art-3dd3586e135048bf8c0822224a199ee1
institution Directory Open Access Journal
issn 1553-7390
1553-7404
language English
last_indexed 2024-12-13T09:02:55Z
publishDate 2015-05-01
publisher Public Library of Science (PLoS)
record_format Article
series PLoS Genetics
spelling doaj.art-3dd3586e135048bf8c0822224a199ee12022-12-21T23:53:07ZengPublic Library of Science (PLoS)PLoS Genetics1553-73901553-74042015-05-01115e100504810.1371/journal.pgen.1005048Genomic heritability: what is it?Gustavo de Los CamposDaniel SorensenDaniel GianolaWhole-genome regression methods are being increasingly used for the analysis and prediction of complex traits and diseases. In human genetics, these methods are commonly used for inferences about genetic parameters, such as the amount of genetic variance among individuals or the proportion of phenotypic variance that can be explained by regression on molecular markers. This is so even though some of the assumptions commonly adopted for data analysis are at odds with important quantitative genetic concepts. In this article we develop theory that leads to a precise definition of parameters arising in high dimensional genomic regressions; we focus on the so-called genomic heritability: the proportion of variance of a trait that can be explained (in the population) by a linear regression on a set of markers. We propose a definition of this parameter that is framed within the classical quantitative genetics theory and show that the genomic heritability and the trait heritability parameters are equal only when all causal variants are typed. Further, we discuss how the genomic variance and genomic heritability, defined as quantitative genetic parameters, relate to parameters of statistical models commonly used for inferences, and indicate potential inferential problems that are assessed further using simulations. When a large proportion of the markers used in the analysis are in LE with QTL the likelihood function can be misspecified. This can induce a sizable finite-sample bias and, possibly, lack of consistency of likelihood (or Bayesian) estimates. This situation can be encountered if the individuals in the sample are distantly related and linkage disequilibrium spans over short regions. This bias does not negate the use of whole-genome regression models as predictive machines; however, our results indicate that caution is needed when using marker-based regressions for inferences about population parameters such as the genomic heritability.http://europepmc.org/articles/PMC4420472?pdf=render
spellingShingle Gustavo de Los Campos
Daniel Sorensen
Daniel Gianola
Genomic heritability: what is it?
PLoS Genetics
title Genomic heritability: what is it?
title_full Genomic heritability: what is it?
title_fullStr Genomic heritability: what is it?
title_full_unstemmed Genomic heritability: what is it?
title_short Genomic heritability: what is it?
title_sort genomic heritability what is it
url http://europepmc.org/articles/PMC4420472?pdf=render
work_keys_str_mv AT gustavodeloscampos genomicheritabilitywhatisit
AT danielsorensen genomicheritabilitywhatisit
AT danielgianola genomicheritabilitywhatisit