Using whole-genome sequence data to predict quantitative trait phenotypes in Drosophila melanogaster.
Predicting organismal phenotypes from genotype data is important for plant and animal breeding, medicine, and evolutionary biology. Genomic-based phenotype prediction has been applied for single-nucleotide polymorphism (SNP) genotyping platforms, but not using complete genome sequences. Here, we rep...
Main Authors: | , , , , , , , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Public Library of Science (PLoS)
2012-01-01
|
Series: | PLoS Genetics |
Online Access: | http://europepmc.org/articles/PMC3342952?pdf=render |
_version_ | 1818337041705861120 |
---|---|
author | Ulrike Ober Julien F Ayroles Eric A Stone Stephen Richards Dianhui Zhu Richard A Gibbs Christian Stricker Daniel Gianola Martin Schlather Trudy F C Mackay Henner Simianer |
author_facet | Ulrike Ober Julien F Ayroles Eric A Stone Stephen Richards Dianhui Zhu Richard A Gibbs Christian Stricker Daniel Gianola Martin Schlather Trudy F C Mackay Henner Simianer |
author_sort | Ulrike Ober |
collection | DOAJ |
description | Predicting organismal phenotypes from genotype data is important for plant and animal breeding, medicine, and evolutionary biology. Genomic-based phenotype prediction has been applied for single-nucleotide polymorphism (SNP) genotyping platforms, but not using complete genome sequences. Here, we report genomic prediction for starvation stress resistance and startle response in Drosophila melanogaster, using ∼2.5 million SNPs determined by sequencing the Drosophila Genetic Reference Panel population of inbred lines. We constructed a genomic relationship matrix from the SNP data and used it in a genomic best linear unbiased prediction (GBLUP) model. We assessed predictive ability as the correlation between predicted genetic values and observed phenotypes by cross-validation, and found a predictive ability of 0.239±0.008 (0.230±0.012) for starvation resistance (startle response). The predictive ability of BayesB, a Bayesian method with internal SNP selection, was not greater than GBLUP. Selection of the 5% SNPs with either the highest absolute effect or variance explained did not improve predictive ability. Predictive ability decreased only when fewer than 150,000 SNPs were used to construct the genomic relationship matrix. We hypothesize that predictive power in this population stems from the SNP-based modeling of the subtle relationship structure caused by long-range linkage disequilibrium and not from population structure or SNPs in linkage disequilibrium with causal variants. We discuss the implications of these results for genomic prediction in other organisms. |
first_indexed | 2024-12-13T14:48:55Z |
format | Article |
id | doaj.art-40f685790bf9479e9ec59164d94d0b36 |
institution | Directory Open Access Journal |
issn | 1553-7390 1553-7404 |
language | English |
last_indexed | 2024-12-13T14:48:55Z |
publishDate | 2012-01-01 |
publisher | Public Library of Science (PLoS) |
record_format | Article |
series | PLoS Genetics |
spelling | doaj.art-40f685790bf9479e9ec59164d94d0b362022-12-21T23:41:23ZengPublic Library of Science (PLoS)PLoS Genetics1553-73901553-74042012-01-0185e100268510.1371/journal.pgen.1002685Using whole-genome sequence data to predict quantitative trait phenotypes in Drosophila melanogaster.Ulrike OberJulien F AyrolesEric A StoneStephen RichardsDianhui ZhuRichard A GibbsChristian StrickerDaniel GianolaMartin SchlatherTrudy F C MackayHenner SimianerPredicting organismal phenotypes from genotype data is important for plant and animal breeding, medicine, and evolutionary biology. Genomic-based phenotype prediction has been applied for single-nucleotide polymorphism (SNP) genotyping platforms, but not using complete genome sequences. Here, we report genomic prediction for starvation stress resistance and startle response in Drosophila melanogaster, using ∼2.5 million SNPs determined by sequencing the Drosophila Genetic Reference Panel population of inbred lines. We constructed a genomic relationship matrix from the SNP data and used it in a genomic best linear unbiased prediction (GBLUP) model. We assessed predictive ability as the correlation between predicted genetic values and observed phenotypes by cross-validation, and found a predictive ability of 0.239±0.008 (0.230±0.012) for starvation resistance (startle response). The predictive ability of BayesB, a Bayesian method with internal SNP selection, was not greater than GBLUP. Selection of the 5% SNPs with either the highest absolute effect or variance explained did not improve predictive ability. Predictive ability decreased only when fewer than 150,000 SNPs were used to construct the genomic relationship matrix. We hypothesize that predictive power in this population stems from the SNP-based modeling of the subtle relationship structure caused by long-range linkage disequilibrium and not from population structure or SNPs in linkage disequilibrium with causal variants. We discuss the implications of these results for genomic prediction in other organisms.http://europepmc.org/articles/PMC3342952?pdf=render |
spellingShingle | Ulrike Ober Julien F Ayroles Eric A Stone Stephen Richards Dianhui Zhu Richard A Gibbs Christian Stricker Daniel Gianola Martin Schlather Trudy F C Mackay Henner Simianer Using whole-genome sequence data to predict quantitative trait phenotypes in Drosophila melanogaster. PLoS Genetics |
title | Using whole-genome sequence data to predict quantitative trait phenotypes in Drosophila melanogaster. |
title_full | Using whole-genome sequence data to predict quantitative trait phenotypes in Drosophila melanogaster. |
title_fullStr | Using whole-genome sequence data to predict quantitative trait phenotypes in Drosophila melanogaster. |
title_full_unstemmed | Using whole-genome sequence data to predict quantitative trait phenotypes in Drosophila melanogaster. |
title_short | Using whole-genome sequence data to predict quantitative trait phenotypes in Drosophila melanogaster. |
title_sort | using whole genome sequence data to predict quantitative trait phenotypes in drosophila melanogaster |
url | http://europepmc.org/articles/PMC3342952?pdf=render |
work_keys_str_mv | AT ulrikeober usingwholegenomesequencedatatopredictquantitativetraitphenotypesindrosophilamelanogaster AT julienfayroles usingwholegenomesequencedatatopredictquantitativetraitphenotypesindrosophilamelanogaster AT ericastone usingwholegenomesequencedatatopredictquantitativetraitphenotypesindrosophilamelanogaster AT stephenrichards usingwholegenomesequencedatatopredictquantitativetraitphenotypesindrosophilamelanogaster AT dianhuizhu usingwholegenomesequencedatatopredictquantitativetraitphenotypesindrosophilamelanogaster AT richardagibbs usingwholegenomesequencedatatopredictquantitativetraitphenotypesindrosophilamelanogaster AT christianstricker usingwholegenomesequencedatatopredictquantitativetraitphenotypesindrosophilamelanogaster AT danielgianola usingwholegenomesequencedatatopredictquantitativetraitphenotypesindrosophilamelanogaster AT martinschlather usingwholegenomesequencedatatopredictquantitativetraitphenotypesindrosophilamelanogaster AT trudyfcmackay usingwholegenomesequencedatatopredictquantitativetraitphenotypesindrosophilamelanogaster AT hennersimianer usingwholegenomesequencedatatopredictquantitativetraitphenotypesindrosophilamelanogaster |