Target SNP selection in complex disease association studies

<p>Abstract</p> <p>Background</p> <p>The massive amount of SNP data stored at public internet sites provides unprecedented access to human genetic variation. Selecting target SNP for disease-gene association studies is currently done more or less randomly as decision ru...

Full description

Bibliographic Details
Main Author: Wjst Matthias
Format: Article
Language:English
Published: BMC 2004-07-01
Series:BMC Bioinformatics
Online Access:http://www.biomedcentral.com/1471-2105/5/92
_version_ 1818390298421624832
author Wjst Matthias
author_facet Wjst Matthias
author_sort Wjst Matthias
collection DOAJ
description <p>Abstract</p> <p>Background</p> <p>The massive amount of SNP data stored at public internet sites provides unprecedented access to human genetic variation. Selecting target SNP for disease-gene association studies is currently done more or less randomly as decision rules for the selection of functional relevant SNPs are not available.</p> <p>Results</p> <p>We implemented a computational pipeline that retrieves the genomic sequence of target genes, collects information about sequence variation and selects functional motifs containing SNPs. Motifs being considered are gene promoter, exon-intron structure, AU-rich mRNA elements, transcription factor binding motifs, cryptic and enhancer splice sites together with expression in target tissue.</p> <p>As a case study, 396 genes on chromosome 6p21 in the extended HLA region were selected that contributed nearly 20,000 SNPs. By computer annotation ~2,500 SNPs in functional motifs could be identified. Most of these SNPs are disrupting transcription factor binding sites but only those introducing new sites had a significant depressing effect on SNP allele frequency. Other decision rules concern position within motifs, the validity of SNP database entries, the unique occurrence in the genome and conserved sequence context in other mammalian genomes.</p> <p>Conclusion</p> <p>Only 10% of all gene-based SNPs have sequence-predicted functional relevance making them a primary target for genotyping in association studies.</p>
first_indexed 2024-12-14T04:55:24Z
format Article
id doaj.art-d4e87e567e884499b3609069578a12cf
institution Directory Open Access Journal
issn 1471-2105
language English
last_indexed 2024-12-14T04:55:24Z
publishDate 2004-07-01
publisher BMC
record_format Article
series BMC Bioinformatics
spelling doaj.art-d4e87e567e884499b3609069578a12cf2022-12-21T23:16:23ZengBMCBMC Bioinformatics1471-21052004-07-01519210.1186/1471-2105-5-92Target SNP selection in complex disease association studiesWjst Matthias<p>Abstract</p> <p>Background</p> <p>The massive amount of SNP data stored at public internet sites provides unprecedented access to human genetic variation. Selecting target SNP for disease-gene association studies is currently done more or less randomly as decision rules for the selection of functional relevant SNPs are not available.</p> <p>Results</p> <p>We implemented a computational pipeline that retrieves the genomic sequence of target genes, collects information about sequence variation and selects functional motifs containing SNPs. Motifs being considered are gene promoter, exon-intron structure, AU-rich mRNA elements, transcription factor binding motifs, cryptic and enhancer splice sites together with expression in target tissue.</p> <p>As a case study, 396 genes on chromosome 6p21 in the extended HLA region were selected that contributed nearly 20,000 SNPs. By computer annotation ~2,500 SNPs in functional motifs could be identified. Most of these SNPs are disrupting transcription factor binding sites but only those introducing new sites had a significant depressing effect on SNP allele frequency. Other decision rules concern position within motifs, the validity of SNP database entries, the unique occurrence in the genome and conserved sequence context in other mammalian genomes.</p> <p>Conclusion</p> <p>Only 10% of all gene-based SNPs have sequence-predicted functional relevance making them a primary target for genotyping in association studies.</p>http://www.biomedcentral.com/1471-2105/5/92
spellingShingle Wjst Matthias
Target SNP selection in complex disease association studies
BMC Bioinformatics
title Target SNP selection in complex disease association studies
title_full Target SNP selection in complex disease association studies
title_fullStr Target SNP selection in complex disease association studies
title_full_unstemmed Target SNP selection in complex disease association studies
title_short Target SNP selection in complex disease association studies
title_sort target snp selection in complex disease association studies
url http://www.biomedcentral.com/1471-2105/5/92
work_keys_str_mv AT wjstmatthias targetsnpselectionincomplexdiseaseassociationstudies