Using whole-genome sequence data to predict quantitative trait phenotypes in Drosophila melanogaster.

Predicting organismal phenotypes from genotype data is important for plant and animal breeding, medicine, and evolutionary biology. Genomic-based phenotype prediction has been applied for single-nucleotide polymorphism (SNP) genotyping platforms, but not using complete genome sequences. Here, we rep...

Full description

Bibliographic Details
Main Authors: Ulrike Ober, Julien F Ayroles, Eric A Stone, Stephen Richards, Dianhui Zhu, Richard A Gibbs, Christian Stricker, Daniel Gianola, Martin Schlather, Trudy F C Mackay, Henner Simianer
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2012-01-01
Series:PLoS Genetics
Online Access:http://europepmc.org/articles/PMC3342952?pdf=render
_version_ 1818337041705861120
author Ulrike Ober
Julien F Ayroles
Eric A Stone
Stephen Richards
Dianhui Zhu
Richard A Gibbs
Christian Stricker
Daniel Gianola
Martin Schlather
Trudy F C Mackay
Henner Simianer
author_facet Ulrike Ober
Julien F Ayroles
Eric A Stone
Stephen Richards
Dianhui Zhu
Richard A Gibbs
Christian Stricker
Daniel Gianola
Martin Schlather
Trudy F C Mackay
Henner Simianer
author_sort Ulrike Ober
collection DOAJ
description Predicting organismal phenotypes from genotype data is important for plant and animal breeding, medicine, and evolutionary biology. Genomic-based phenotype prediction has been applied for single-nucleotide polymorphism (SNP) genotyping platforms, but not using complete genome sequences. Here, we report genomic prediction for starvation stress resistance and startle response in Drosophila melanogaster, using ∼2.5 million SNPs determined by sequencing the Drosophila Genetic Reference Panel population of inbred lines. We constructed a genomic relationship matrix from the SNP data and used it in a genomic best linear unbiased prediction (GBLUP) model. We assessed predictive ability as the correlation between predicted genetic values and observed phenotypes by cross-validation, and found a predictive ability of 0.239±0.008 (0.230±0.012) for starvation resistance (startle response). The predictive ability of BayesB, a Bayesian method with internal SNP selection, was not greater than GBLUP. Selection of the 5% SNPs with either the highest absolute effect or variance explained did not improve predictive ability. Predictive ability decreased only when fewer than 150,000 SNPs were used to construct the genomic relationship matrix. We hypothesize that predictive power in this population stems from the SNP-based modeling of the subtle relationship structure caused by long-range linkage disequilibrium and not from population structure or SNPs in linkage disequilibrium with causal variants. We discuss the implications of these results for genomic prediction in other organisms.
first_indexed 2024-12-13T14:48:55Z
format Article
id doaj.art-40f685790bf9479e9ec59164d94d0b36
institution Directory Open Access Journal
issn 1553-7390
1553-7404
language English
last_indexed 2024-12-13T14:48:55Z
publishDate 2012-01-01
publisher Public Library of Science (PLoS)
record_format Article
series PLoS Genetics
spelling doaj.art-40f685790bf9479e9ec59164d94d0b362022-12-21T23:41:23ZengPublic Library of Science (PLoS)PLoS Genetics1553-73901553-74042012-01-0185e100268510.1371/journal.pgen.1002685Using whole-genome sequence data to predict quantitative trait phenotypes in Drosophila melanogaster.Ulrike OberJulien F AyrolesEric A StoneStephen RichardsDianhui ZhuRichard A GibbsChristian StrickerDaniel GianolaMartin SchlatherTrudy F C MackayHenner SimianerPredicting organismal phenotypes from genotype data is important for plant and animal breeding, medicine, and evolutionary biology. Genomic-based phenotype prediction has been applied for single-nucleotide polymorphism (SNP) genotyping platforms, but not using complete genome sequences. Here, we report genomic prediction for starvation stress resistance and startle response in Drosophila melanogaster, using ∼2.5 million SNPs determined by sequencing the Drosophila Genetic Reference Panel population of inbred lines. We constructed a genomic relationship matrix from the SNP data and used it in a genomic best linear unbiased prediction (GBLUP) model. We assessed predictive ability as the correlation between predicted genetic values and observed phenotypes by cross-validation, and found a predictive ability of 0.239±0.008 (0.230±0.012) for starvation resistance (startle response). The predictive ability of BayesB, a Bayesian method with internal SNP selection, was not greater than GBLUP. Selection of the 5% SNPs with either the highest absolute effect or variance explained did not improve predictive ability. Predictive ability decreased only when fewer than 150,000 SNPs were used to construct the genomic relationship matrix. We hypothesize that predictive power in this population stems from the SNP-based modeling of the subtle relationship structure caused by long-range linkage disequilibrium and not from population structure or SNPs in linkage disequilibrium with causal variants. We discuss the implications of these results for genomic prediction in other organisms.http://europepmc.org/articles/PMC3342952?pdf=render
spellingShingle Ulrike Ober
Julien F Ayroles
Eric A Stone
Stephen Richards
Dianhui Zhu
Richard A Gibbs
Christian Stricker
Daniel Gianola
Martin Schlather
Trudy F C Mackay
Henner Simianer
Using whole-genome sequence data to predict quantitative trait phenotypes in Drosophila melanogaster.
PLoS Genetics
title Using whole-genome sequence data to predict quantitative trait phenotypes in Drosophila melanogaster.
title_full Using whole-genome sequence data to predict quantitative trait phenotypes in Drosophila melanogaster.
title_fullStr Using whole-genome sequence data to predict quantitative trait phenotypes in Drosophila melanogaster.
title_full_unstemmed Using whole-genome sequence data to predict quantitative trait phenotypes in Drosophila melanogaster.
title_short Using whole-genome sequence data to predict quantitative trait phenotypes in Drosophila melanogaster.
title_sort using whole genome sequence data to predict quantitative trait phenotypes in drosophila melanogaster
url http://europepmc.org/articles/PMC3342952?pdf=render
work_keys_str_mv AT ulrikeober usingwholegenomesequencedatatopredictquantitativetraitphenotypesindrosophilamelanogaster
AT julienfayroles usingwholegenomesequencedatatopredictquantitativetraitphenotypesindrosophilamelanogaster
AT ericastone usingwholegenomesequencedatatopredictquantitativetraitphenotypesindrosophilamelanogaster
AT stephenrichards usingwholegenomesequencedatatopredictquantitativetraitphenotypesindrosophilamelanogaster
AT dianhuizhu usingwholegenomesequencedatatopredictquantitativetraitphenotypesindrosophilamelanogaster
AT richardagibbs usingwholegenomesequencedatatopredictquantitativetraitphenotypesindrosophilamelanogaster
AT christianstricker usingwholegenomesequencedatatopredictquantitativetraitphenotypesindrosophilamelanogaster
AT danielgianola usingwholegenomesequencedatatopredictquantitativetraitphenotypesindrosophilamelanogaster
AT martinschlather usingwholegenomesequencedatatopredictquantitativetraitphenotypesindrosophilamelanogaster
AT trudyfcmackay usingwholegenomesequencedatatopredictquantitativetraitphenotypesindrosophilamelanogaster
AT hennersimianer usingwholegenomesequencedatatopredictquantitativetraitphenotypesindrosophilamelanogaster