Evolutionary rates and patterns for human transcription factor binding sites derived from repetitive DNA

<p>Abstract</p> <p>Background</p> <p>The majority of human non-protein-coding DNA is made up of repetitive sequences, mainly transposable elements (TEs). It is becoming increasingly apparent that many of these repetitive DNA sequence elements encode gene regulatory func...

Full description

Bibliographic Details
Main Authors: McDonald John F, Landsman David, Mariño-Ramírez Leonardo, Polavarapu Nalini, Jordan I King
Format: Article
Language:English
Published: BMC 2008-05-01
Series:BMC Genomics
Online Access:http://www.biomedcentral.com/1471-2164/9/226
_version_ 1811245651492601856
author McDonald John F
Landsman David
Mariño-Ramírez Leonardo
Polavarapu Nalini
Jordan I King
author_facet McDonald John F
Landsman David
Mariño-Ramírez Leonardo
Polavarapu Nalini
Jordan I King
author_sort McDonald John F
collection DOAJ
description <p>Abstract</p> <p>Background</p> <p>The majority of human non-protein-coding DNA is made up of repetitive sequences, mainly transposable elements (TEs). It is becoming increasingly apparent that many of these repetitive DNA sequence elements encode gene regulatory functions. This fact has important evolutionary implications, since repetitive DNA is the most dynamic part of the genome. We set out to assess the evolutionary rate and pattern of experimentally characterized human transcription factor binding sites (TFBS) that are derived from repetitive versus non-repetitive DNA to test whether repeat-derived TFBS are in fact rapidly evolving. We also evaluated the position-specific patterns of variation among TFBS to look for signs of functional constraint on TFBS derived from repetitive and non-repetitive DNA.</p> <p>Results</p> <p>We found numerous experimentally characterized TFBS in the human genome, 7–10% of all mapped sites, which are derived from repetitive DNA sequences including simple sequence repeats (SSRs) and TEs. TE-derived TFBS sequences are far less conserved between species than TFBS derived from SSRs and non-repetitive DNA. Despite their rapid evolution, several lines of evidence indicate that TE-derived TFBS are functionally constrained. First of all, ancient TE families, such as MIR and L2, are enriched for TFBS relative to younger families like Alu and L1. Secondly, functionally important positions in TE-derived TFBS, specifically those residues thought to physically interact with their cognate protein binding factors (TF), are more evolutionarily conserved than adjacent TFBS positions. Finally, TE-derived TFBS show position-specific patterns of sequence variation that are highly distinct from random patterns and similar to the variation seen for non-repeat derived sequences of the same TFBS.</p> <p>Conclusion</p> <p>The abundance of experimentally characterized human TFBS that are derived from repetitive DNA speaks to the substantial regulatory effects that this class of sequence has on the human genome. The unique evolutionary properties of repeat-derived TFBS are perhaps even more intriguing. TE-derived TFBS in particular, while clearly functionally constrained, evolve extremely rapidly relative to non-repeat derived sites. Such rapidly evolving TFBS are likely to confer species-specific regulatory phenotypes, <it>i.e</it>. divergent expression patterns, on the human evolutionary lineage. This result has practical implications with respect to the widespread use of evolutionary conservation as a surrogate for functionally relevant non-coding DNA. Most TE-derived TFBS would be missed using the kinds of sequence conservation-based screens, such as phylogenetic footprinting, that are used to help characterize non-coding DNA. Thus, the very TFBS that are most likely to yield human-specific characteristics will be neglected by the comparative genomic techniques that are currently <it>de rigeur </it>for the identification of novel regulatory sites.</p>
first_indexed 2024-04-12T14:42:36Z
format Article
id doaj.art-786fa36e76514cdaa326abadd401ed4a
institution Directory Open Access Journal
issn 1471-2164
language English
last_indexed 2024-04-12T14:42:36Z
publishDate 2008-05-01
publisher BMC
record_format Article
series BMC Genomics
spelling doaj.art-786fa36e76514cdaa326abadd401ed4a2022-12-22T03:28:46ZengBMCBMC Genomics1471-21642008-05-019122610.1186/1471-2164-9-226Evolutionary rates and patterns for human transcription factor binding sites derived from repetitive DNAMcDonald John FLandsman DavidMariño-Ramírez LeonardoPolavarapu NaliniJordan I King<p>Abstract</p> <p>Background</p> <p>The majority of human non-protein-coding DNA is made up of repetitive sequences, mainly transposable elements (TEs). It is becoming increasingly apparent that many of these repetitive DNA sequence elements encode gene regulatory functions. This fact has important evolutionary implications, since repetitive DNA is the most dynamic part of the genome. We set out to assess the evolutionary rate and pattern of experimentally characterized human transcription factor binding sites (TFBS) that are derived from repetitive versus non-repetitive DNA to test whether repeat-derived TFBS are in fact rapidly evolving. We also evaluated the position-specific patterns of variation among TFBS to look for signs of functional constraint on TFBS derived from repetitive and non-repetitive DNA.</p> <p>Results</p> <p>We found numerous experimentally characterized TFBS in the human genome, 7–10% of all mapped sites, which are derived from repetitive DNA sequences including simple sequence repeats (SSRs) and TEs. TE-derived TFBS sequences are far less conserved between species than TFBS derived from SSRs and non-repetitive DNA. Despite their rapid evolution, several lines of evidence indicate that TE-derived TFBS are functionally constrained. First of all, ancient TE families, such as MIR and L2, are enriched for TFBS relative to younger families like Alu and L1. Secondly, functionally important positions in TE-derived TFBS, specifically those residues thought to physically interact with their cognate protein binding factors (TF), are more evolutionarily conserved than adjacent TFBS positions. Finally, TE-derived TFBS show position-specific patterns of sequence variation that are highly distinct from random patterns and similar to the variation seen for non-repeat derived sequences of the same TFBS.</p> <p>Conclusion</p> <p>The abundance of experimentally characterized human TFBS that are derived from repetitive DNA speaks to the substantial regulatory effects that this class of sequence has on the human genome. The unique evolutionary properties of repeat-derived TFBS are perhaps even more intriguing. TE-derived TFBS in particular, while clearly functionally constrained, evolve extremely rapidly relative to non-repeat derived sites. Such rapidly evolving TFBS are likely to confer species-specific regulatory phenotypes, <it>i.e</it>. divergent expression patterns, on the human evolutionary lineage. This result has practical implications with respect to the widespread use of evolutionary conservation as a surrogate for functionally relevant non-coding DNA. Most TE-derived TFBS would be missed using the kinds of sequence conservation-based screens, such as phylogenetic footprinting, that are used to help characterize non-coding DNA. Thus, the very TFBS that are most likely to yield human-specific characteristics will be neglected by the comparative genomic techniques that are currently <it>de rigeur </it>for the identification of novel regulatory sites.</p>http://www.biomedcentral.com/1471-2164/9/226
spellingShingle McDonald John F
Landsman David
Mariño-Ramírez Leonardo
Polavarapu Nalini
Jordan I King
Evolutionary rates and patterns for human transcription factor binding sites derived from repetitive DNA
BMC Genomics
title Evolutionary rates and patterns for human transcription factor binding sites derived from repetitive DNA
title_full Evolutionary rates and patterns for human transcription factor binding sites derived from repetitive DNA
title_fullStr Evolutionary rates and patterns for human transcription factor binding sites derived from repetitive DNA
title_full_unstemmed Evolutionary rates and patterns for human transcription factor binding sites derived from repetitive DNA
title_short Evolutionary rates and patterns for human transcription factor binding sites derived from repetitive DNA
title_sort evolutionary rates and patterns for human transcription factor binding sites derived from repetitive dna
url http://www.biomedcentral.com/1471-2164/9/226
work_keys_str_mv AT mcdonaldjohnf evolutionaryratesandpatternsforhumantranscriptionfactorbindingsitesderivedfromrepetitivedna
AT landsmandavid evolutionaryratesandpatternsforhumantranscriptionfactorbindingsitesderivedfromrepetitivedna
AT marinoramirezleonardo evolutionaryratesandpatternsforhumantranscriptionfactorbindingsitesderivedfromrepetitivedna
AT polavarapunalini evolutionaryratesandpatternsforhumantranscriptionfactorbindingsitesderivedfromrepetitivedna
AT jordaniking evolutionaryratesandpatternsforhumantranscriptionfactorbindingsitesderivedfromrepetitivedna