Weird gene in a weird mammal: comparative and functional analysis of the highly divergent sand rat Pdx1 gene
<p>Deleterious mutations in highly conserved genes are expected to be removed by natural selection. However, when these genes are trapped in dynamic regions of the genome, for example mutational hotspots, it becomes difficult to predict how the conflict between nucleotide changes and natural s...
Main Author: | |
---|---|
Other Authors: | |
Format: | Thesis |
Language: | English |
Published: |
2020
|
Subjects: |
_version_ | 1797105011286605824 |
---|---|
author | Dai, Y |
author2 | Holland, P |
author_facet | Holland, P Dai, Y |
author_sort | Dai, Y |
collection | OXFORD |
description | <p>Deleterious mutations in highly conserved genes are expected to be removed by natural selection. However, when these genes are trapped in dynamic regions of the genome, for example mutational hotspots, it becomes difficult to predict how the conflict between nucleotide changes and natural selection will be resolved. In this thesis, I investigate highly divergent genes in the fat sand rat (Psammomys obesus) associated with GC-skew. To understand how the tug-of-war between GC-skew and natural selection is resolved, I focus on a highly representative example of an extremely conserved gene trapped in an extreme GCrich region of the sand rat genome: pancreatic duodenum homeobox 1 (Pdx1).</p>
<p>In Chapter 2, I hypothesize that the highly divergent sand rat PDX1 protein may be a result of adaptive selection, and that similar amino acid changes may be observed in other arid-living
rodents. Using genomic and transcriptomic data gathered from 33 rodent species, I show that there is near 100% conservation of the PDX1 homeodomain sequence in non-gerbil species,
including 15 rodent species that reside in arid or semi-arid habitats. The highly divergent Pdx1 gene observed in the sand rat is only found in three gerbil species, suggesting that adaptive or relaxed selection in an arid environment may not be the key factor affecting Pdx1 gene evolution in this lineage. Instead, other factors such as GC composition bias may be the main factor driving Pdx1 gene divergence in the gerbil subfamily.</p>
<p>In Chapter 3, I hypothesize that natural selection may have only managed to purge extremely deleterious mutations while tolerating the existence of mildly deleterious mutations in these
genes. Using mammalian cell transfection and pulse-chase labelling, I reveal loss of a conserved ubiquitination site and compensation from a unique, gerbil-specific ubiquitination
site. This provides evidence for loss of a conserved amino acid with important function associated with GC-skew and links the presence of a gerbil-unique lysine residue with a specific function.</p>
<p>In Chapter 4, I use qPCR to demonstrate significantly altered regulatory effects of the sand rat PDX1 protein on key target genes in rat pancreatic cells, Ins1 and Ins2. I also present
establishment of CRISPR/Cas9 edited knock-in cell lines with the aim of teasing apart whether these observed differences were caused by changes in sand rat PDX1 binding site recognition. I conclude that the highly divergent sand rat PDX1 has likely lost conserved functional residues and has significantly reduced ability to initiate proper biological response to glucose stimulation in a rat cell environment. At the same time, results in Chapter 3 show the presence of gerbil-specific residues with specific function, indicating possible compensation by natural selection.</p>
<p>I use genome wide comparison methods in Chapter 5 to demonstrate that extreme GC-skew has affected multiple highly conserved genes with assorted function. Pdx1 is not the only
aberrantly divergent gene in the gerbil lineage, and these aberrantly divergent genes are linked to GC-skew and high GC regions in the gerbil genome. Four divergent genes, Pdx1, Insr,
Medag and Spp1 are likely associated with obesity and abnormal metabolism observed in gerbils on a standard laboratory diet.</p>
<p>Overall, this research provides further characterization of GC-rich genome regions in the gerbil lineage and functional analysis of a highly divergent sand rat gene, Pdx1, that is affected by
GC-skew but also has significant functional importance. This work suggests that when extreme changes in genome nucleotide content occurs, natural selection may be incapable of removing some maladaptive mutations in highly conserved genes that are otherwise abolished in other lineages.</p> |
first_indexed | 2024-03-07T06:41:29Z |
format | Thesis |
id | oxford-uuid:f96af087-c350-4d86-a2f7-315b7ec655a5 |
institution | University of Oxford |
language | English |
last_indexed | 2024-03-07T06:41:29Z |
publishDate | 2020 |
record_format | dspace |
spelling | oxford-uuid:f96af087-c350-4d86-a2f7-315b7ec655a52022-03-27T12:57:55ZWeird gene in a weird mammal: comparative and functional analysis of the highly divergent sand rat Pdx1 geneThesishttp://purl.org/coar/resource_type/c_db06uuid:f96af087-c350-4d86-a2f7-315b7ec655a5Molecular biologyMolecular evolutionZoologyEnglishHyrax Deposit2020Dai, YHolland, P<p>Deleterious mutations in highly conserved genes are expected to be removed by natural selection. However, when these genes are trapped in dynamic regions of the genome, for example mutational hotspots, it becomes difficult to predict how the conflict between nucleotide changes and natural selection will be resolved. In this thesis, I investigate highly divergent genes in the fat sand rat (Psammomys obesus) associated with GC-skew. To understand how the tug-of-war between GC-skew and natural selection is resolved, I focus on a highly representative example of an extremely conserved gene trapped in an extreme GCrich region of the sand rat genome: pancreatic duodenum homeobox 1 (Pdx1).</p> <p>In Chapter 2, I hypothesize that the highly divergent sand rat PDX1 protein may be a result of adaptive selection, and that similar amino acid changes may be observed in other arid-living rodents. Using genomic and transcriptomic data gathered from 33 rodent species, I show that there is near 100% conservation of the PDX1 homeodomain sequence in non-gerbil species, including 15 rodent species that reside in arid or semi-arid habitats. The highly divergent Pdx1 gene observed in the sand rat is only found in three gerbil species, suggesting that adaptive or relaxed selection in an arid environment may not be the key factor affecting Pdx1 gene evolution in this lineage. Instead, other factors such as GC composition bias may be the main factor driving Pdx1 gene divergence in the gerbil subfamily.</p> <p>In Chapter 3, I hypothesize that natural selection may have only managed to purge extremely deleterious mutations while tolerating the existence of mildly deleterious mutations in these genes. Using mammalian cell transfection and pulse-chase labelling, I reveal loss of a conserved ubiquitination site and compensation from a unique, gerbil-specific ubiquitination site. This provides evidence for loss of a conserved amino acid with important function associated with GC-skew and links the presence of a gerbil-unique lysine residue with a specific function.</p> <p>In Chapter 4, I use qPCR to demonstrate significantly altered regulatory effects of the sand rat PDX1 protein on key target genes in rat pancreatic cells, Ins1 and Ins2. I also present establishment of CRISPR/Cas9 edited knock-in cell lines with the aim of teasing apart whether these observed differences were caused by changes in sand rat PDX1 binding site recognition. I conclude that the highly divergent sand rat PDX1 has likely lost conserved functional residues and has significantly reduced ability to initiate proper biological response to glucose stimulation in a rat cell environment. At the same time, results in Chapter 3 show the presence of gerbil-specific residues with specific function, indicating possible compensation by natural selection.</p> <p>I use genome wide comparison methods in Chapter 5 to demonstrate that extreme GC-skew has affected multiple highly conserved genes with assorted function. Pdx1 is not the only aberrantly divergent gene in the gerbil lineage, and these aberrantly divergent genes are linked to GC-skew and high GC regions in the gerbil genome. Four divergent genes, Pdx1, Insr, Medag and Spp1 are likely associated with obesity and abnormal metabolism observed in gerbils on a standard laboratory diet.</p> <p>Overall, this research provides further characterization of GC-rich genome regions in the gerbil lineage and functional analysis of a highly divergent sand rat gene, Pdx1, that is affected by GC-skew but also has significant functional importance. This work suggests that when extreme changes in genome nucleotide content occurs, natural selection may be incapable of removing some maladaptive mutations in highly conserved genes that are otherwise abolished in other lineages.</p> |
spellingShingle | Molecular biology Molecular evolution Zoology Dai, Y Weird gene in a weird mammal: comparative and functional analysis of the highly divergent sand rat Pdx1 gene |
title | Weird gene in a weird mammal: comparative and functional analysis of the highly divergent sand rat Pdx1 gene |
title_full | Weird gene in a weird mammal: comparative and functional analysis of the highly divergent sand rat Pdx1 gene |
title_fullStr | Weird gene in a weird mammal: comparative and functional analysis of the highly divergent sand rat Pdx1 gene |
title_full_unstemmed | Weird gene in a weird mammal: comparative and functional analysis of the highly divergent sand rat Pdx1 gene |
title_short | Weird gene in a weird mammal: comparative and functional analysis of the highly divergent sand rat Pdx1 gene |
title_sort | weird gene in a weird mammal comparative and functional analysis of the highly divergent sand rat pdx1 gene |
topic | Molecular biology Molecular evolution Zoology |
work_keys_str_mv | AT daiy weirdgeneinaweirdmammalcomparativeandfunctionalanalysisofthehighlydivergentsandratpdx1gene |