Genetic variation in an individual human exome.

There is much interest in characterizing the variation in a human individual, because this may elucidate what contributes significantly to a person's phenotype, thereby enabling personalized genomics. We focus here on the variants in a person's 'exome,' which is the set of exons...

Full description

Bibliographic Details
Main Authors: Pauline C Ng, Samuel Levy, Jiaqi Huang, Timothy B Stockwell, Brian P Walenz, Kelvin Li, Nelson Axelrod, Dana A Busam, Robert L Strausberg, J Craig Venter
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2008-08-01
Series:PLoS Genetics
Online Access:http://europepmc.org/articles/PMC2493042?pdf=render
_version_ 1811325506822340608
author Pauline C Ng
Samuel Levy
Jiaqi Huang
Timothy B Stockwell
Brian P Walenz
Kelvin Li
Nelson Axelrod
Dana A Busam
Robert L Strausberg
J Craig Venter
author_facet Pauline C Ng
Samuel Levy
Jiaqi Huang
Timothy B Stockwell
Brian P Walenz
Kelvin Li
Nelson Axelrod
Dana A Busam
Robert L Strausberg
J Craig Venter
author_sort Pauline C Ng
collection DOAJ
description There is much interest in characterizing the variation in a human individual, because this may elucidate what contributes significantly to a person's phenotype, thereby enabling personalized genomics. We focus here on the variants in a person's 'exome,' which is the set of exons in a genome, because the exome is believed to harbor much of the functional variation. We provide an analysis of the approximately 12,500 variants that affect the protein coding portion of an individual's genome. We identified approximately 10,400 nonsynonymous single nucleotide polymorphisms (nsSNPs) in this individual, of which approximately 15-20% are rare in the human population. We predict approximately 1,500 nsSNPs affect protein function and these tend be heterozygous, rare, or novel. Of the approximately 700 coding indels, approximately half tend to have lengths that are a multiple of three, which causes insertions/deletions of amino acids in the corresponding protein, rather than introducing frameshifts. Coding indels also occur frequently at the termini of genes, so even if an indel causes a frameshift, an alternative start or stop site in the gene can still be used to make a functional protein. In summary, we reduced the set of approximately 12,500 nonsilent coding variants by approximately 8-fold to a set of variants that are most likely to have major effects on their proteins' functions. This is our first glimpse of an individual's exome and a snapshot of the current state of personalized genomics. The majority of coding variants in this individual are common and appear to be functionally neutral. Our results also indicate that some variants can be used to improve the current NCBI human reference genome. As more genomes are sequenced, many rare variants and non-SNP variants will be discovered. We present an approach to analyze the coding variation in humans by proposing multiple bioinformatic methods to hone in on possible functional variation.
first_indexed 2024-04-13T14:34:24Z
format Article
id doaj.art-ec25641b3e3242fd9b4c301b74382959
institution Directory Open Access Journal
issn 1553-7390
1553-7404
language English
last_indexed 2024-04-13T14:34:24Z
publishDate 2008-08-01
publisher Public Library of Science (PLoS)
record_format Article
series PLoS Genetics
spelling doaj.art-ec25641b3e3242fd9b4c301b743829592022-12-22T02:43:06ZengPublic Library of Science (PLoS)PLoS Genetics1553-73901553-74042008-08-0148e100016010.1371/journal.pgen.1000160Genetic variation in an individual human exome.Pauline C NgSamuel LevyJiaqi HuangTimothy B StockwellBrian P WalenzKelvin LiNelson AxelrodDana A BusamRobert L StrausbergJ Craig VenterThere is much interest in characterizing the variation in a human individual, because this may elucidate what contributes significantly to a person's phenotype, thereby enabling personalized genomics. We focus here on the variants in a person's 'exome,' which is the set of exons in a genome, because the exome is believed to harbor much of the functional variation. We provide an analysis of the approximately 12,500 variants that affect the protein coding portion of an individual's genome. We identified approximately 10,400 nonsynonymous single nucleotide polymorphisms (nsSNPs) in this individual, of which approximately 15-20% are rare in the human population. We predict approximately 1,500 nsSNPs affect protein function and these tend be heterozygous, rare, or novel. Of the approximately 700 coding indels, approximately half tend to have lengths that are a multiple of three, which causes insertions/deletions of amino acids in the corresponding protein, rather than introducing frameshifts. Coding indels also occur frequently at the termini of genes, so even if an indel causes a frameshift, an alternative start or stop site in the gene can still be used to make a functional protein. In summary, we reduced the set of approximately 12,500 nonsilent coding variants by approximately 8-fold to a set of variants that are most likely to have major effects on their proteins' functions. This is our first glimpse of an individual's exome and a snapshot of the current state of personalized genomics. The majority of coding variants in this individual are common and appear to be functionally neutral. Our results also indicate that some variants can be used to improve the current NCBI human reference genome. As more genomes are sequenced, many rare variants and non-SNP variants will be discovered. We present an approach to analyze the coding variation in humans by proposing multiple bioinformatic methods to hone in on possible functional variation.http://europepmc.org/articles/PMC2493042?pdf=render
spellingShingle Pauline C Ng
Samuel Levy
Jiaqi Huang
Timothy B Stockwell
Brian P Walenz
Kelvin Li
Nelson Axelrod
Dana A Busam
Robert L Strausberg
J Craig Venter
Genetic variation in an individual human exome.
PLoS Genetics
title Genetic variation in an individual human exome.
title_full Genetic variation in an individual human exome.
title_fullStr Genetic variation in an individual human exome.
title_full_unstemmed Genetic variation in an individual human exome.
title_short Genetic variation in an individual human exome.
title_sort genetic variation in an individual human exome
url http://europepmc.org/articles/PMC2493042?pdf=render
work_keys_str_mv AT paulinecng geneticvariationinanindividualhumanexome
AT samuellevy geneticvariationinanindividualhumanexome
AT jiaqihuang geneticvariationinanindividualhumanexome
AT timothybstockwell geneticvariationinanindividualhumanexome
AT brianpwalenz geneticvariationinanindividualhumanexome
AT kelvinli geneticvariationinanindividualhumanexome
AT nelsonaxelrod geneticvariationinanindividualhumanexome
AT danaabusam geneticvariationinanindividualhumanexome
AT robertlstrausberg geneticvariationinanindividualhumanexome
AT jcraigventer geneticvariationinanindividualhumanexome