SNPs occur in regions with less genomic sequence conservation.

Rates of SNPs (single nucleotide polymorphisms) and cross-species genomic sequence conservation reflect intra- and inter-species variation, respectively. Here, I report SNP rates and genomic sequence conservation adjacent to mRNA processing regions and show that, as expected, more SNPs occur in less...

Full description

Bibliographic Details
Main Author: John C Castle
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2011-01-01
Series:PLoS ONE
Online Access:http://europepmc.org/articles/PMC3108954?pdf=render
_version_ 1811327956782415872
author John C Castle
author_facet John C Castle
author_sort John C Castle
collection DOAJ
description Rates of SNPs (single nucleotide polymorphisms) and cross-species genomic sequence conservation reflect intra- and inter-species variation, respectively. Here, I report SNP rates and genomic sequence conservation adjacent to mRNA processing regions and show that, as expected, more SNPs occur in less conserved regions and that functional regions have fewer SNPs. Results are confirmed using both mouse and human data. Regions include protein start codons, 3' splice sites, 5' splice sites, protein stop codons, predicted miRNA binding sites, and polyadenylation sites. Throughout, SNP rates are lower and conservation is higher at regulatory sites. Within coding regions, SNP rates are highest and conservation is lowest at codon position three and the fewest SNPs are found at codon position two, reflecting codon degeneracy for amino acid encoding. Exon splice sites show high conservation and very low SNP rates, reflecting both splicing signals and protein coding. Relaxed constraint on the codon third position is dramatically seen when separating exonic SNP rates based on intron phase. At polyadenylation sites, a peak of conservation and low SNP rate occurs from 30 to 17 nt preceding the site. This region is highly enriched for the sequence AAUAAA, reflecting the location of the conserved polyA signal. miRNA 3' UTR target sites are predicted incorporating interspecies genomic sequence conservation; SNP rates are low in these sites, again showing fewer SNPs in conserved regions. Together, these results confirm that SNPs, reflecting recent genetic variation, occur more frequently in regions with less evolutionarily conservation.
first_indexed 2024-04-13T15:17:01Z
format Article
id doaj.art-ac07964d756e4a7b886fcfc355b548cd
institution Directory Open Access Journal
issn 1932-6203
language English
last_indexed 2024-04-13T15:17:01Z
publishDate 2011-01-01
publisher Public Library of Science (PLoS)
record_format Article
series PLoS ONE
spelling doaj.art-ac07964d756e4a7b886fcfc355b548cd2022-12-22T02:41:50ZengPublic Library of Science (PLoS)PLoS ONE1932-62032011-01-0166e2066010.1371/journal.pone.0020660SNPs occur in regions with less genomic sequence conservation.John C CastleRates of SNPs (single nucleotide polymorphisms) and cross-species genomic sequence conservation reflect intra- and inter-species variation, respectively. Here, I report SNP rates and genomic sequence conservation adjacent to mRNA processing regions and show that, as expected, more SNPs occur in less conserved regions and that functional regions have fewer SNPs. Results are confirmed using both mouse and human data. Regions include protein start codons, 3' splice sites, 5' splice sites, protein stop codons, predicted miRNA binding sites, and polyadenylation sites. Throughout, SNP rates are lower and conservation is higher at regulatory sites. Within coding regions, SNP rates are highest and conservation is lowest at codon position three and the fewest SNPs are found at codon position two, reflecting codon degeneracy for amino acid encoding. Exon splice sites show high conservation and very low SNP rates, reflecting both splicing signals and protein coding. Relaxed constraint on the codon third position is dramatically seen when separating exonic SNP rates based on intron phase. At polyadenylation sites, a peak of conservation and low SNP rate occurs from 30 to 17 nt preceding the site. This region is highly enriched for the sequence AAUAAA, reflecting the location of the conserved polyA signal. miRNA 3' UTR target sites are predicted incorporating interspecies genomic sequence conservation; SNP rates are low in these sites, again showing fewer SNPs in conserved regions. Together, these results confirm that SNPs, reflecting recent genetic variation, occur more frequently in regions with less evolutionarily conservation.http://europepmc.org/articles/PMC3108954?pdf=render
spellingShingle John C Castle
SNPs occur in regions with less genomic sequence conservation.
PLoS ONE
title SNPs occur in regions with less genomic sequence conservation.
title_full SNPs occur in regions with less genomic sequence conservation.
title_fullStr SNPs occur in regions with less genomic sequence conservation.
title_full_unstemmed SNPs occur in regions with less genomic sequence conservation.
title_short SNPs occur in regions with less genomic sequence conservation.
title_sort snps occur in regions with less genomic sequence conservation
url http://europepmc.org/articles/PMC3108954?pdf=render
work_keys_str_mv AT johnccastle snpsoccurinregionswithlessgenomicsequenceconservation