The Population Genomics of Sunflowers and Genomic Determinants of Protein Evolution Revealed by RNAseq

Few studies have investigated the causes of evolutionary rate variation among plant nuclear genes, especially in recently diverged species still capable of hybridizing in the wild. The recent advent of Next Generation Sequencing (NGS) permits investigation of genome wide rates of protein evolution a...

Full description

Bibliographic Details
Main Authors: Loren H. Rieseberg, Nolan C. Kane, Brook T. Moyers, Sébastien Renaut, Christopher J. Grassa
Format: Article
Language:English
Published: MDPI AG 2012-10-01
Series:Biology
Subjects:
Online Access:http://www.mdpi.com/2079-7737/1/3/575
_version_ 1827834597652561920
author Loren H. Rieseberg
Nolan C. Kane
Brook T. Moyers
Sébastien Renaut
Christopher J. Grassa
author_facet Loren H. Rieseberg
Nolan C. Kane
Brook T. Moyers
Sébastien Renaut
Christopher J. Grassa
author_sort Loren H. Rieseberg
collection DOAJ
description Few studies have investigated the causes of evolutionary rate variation among plant nuclear genes, especially in recently diverged species still capable of hybridizing in the wild. The recent advent of Next Generation Sequencing (NGS) permits investigation of genome wide rates of protein evolution and the role of selection in generating and maintaining divergence. Here, we use individual whole-transcriptome sequencing (RNAseq) to refine our understanding of the population genomics of wild species of sunflowers (Helianthus spp.) and the factors that affect rates of protein evolution. We aligned 35 GB of transcriptome sequencing data and identified 433,257 polymorphic sites (SNPs) in a reference transcriptome comprising 16,312 genes. Using SNP markers, we identified strong population clustering largely corresponding to the three species analyzed here (Helianthus annuus, H. petiolaris, H. debilis), with one distinct early generation hybrid. Then, we calculated the proportions of adaptive substitution fixed by selection (alpha) and identified gene ontology categories with elevated values of alpha. The “response to biotic stimulus” category had the highest mean alpha across the three interspecific comparisons, implying that natural selection imposed by other organisms plays an important role in driving protein evolution in wild sunflowers. Finally, we examined the relationship between protein evolution (dN/dS ratio) and several genomic factors predicted to co-vary with protein evolution (gene expression level, divergence and specificity, genetic divergence [FST], and nucleotide diversity pi). We find that variation in rates of protein divergence was correlated with gene expression level and specificity, consistent with results from a broad range of taxa and timescales. This would in turn imply that these factors govern protein evolution both at a microevolutionary and macroevolutionary timescale. Our results contribute to a general understanding of the determinants of rates of protein evolution and the impact of selection on patterns of polymorphism and divergence.
first_indexed 2024-03-12T05:53:10Z
format Article
id doaj.art-1175e957e59a47c6ab4198e7675e1ce6
institution Directory Open Access Journal
issn 2079-7737
language English
last_indexed 2024-03-12T05:53:10Z
publishDate 2012-10-01
publisher MDPI AG
record_format Article
series Biology
spelling doaj.art-1175e957e59a47c6ab4198e7675e1ce62023-09-03T04:51:16ZengMDPI AGBiology2079-77372012-10-011357559610.3390/biology1030575The Population Genomics of Sunflowers and Genomic Determinants of Protein Evolution Revealed by RNAseqLoren H. RiesebergNolan C. KaneBrook T. MoyersSébastien RenautChristopher J. GrassaFew studies have investigated the causes of evolutionary rate variation among plant nuclear genes, especially in recently diverged species still capable of hybridizing in the wild. The recent advent of Next Generation Sequencing (NGS) permits investigation of genome wide rates of protein evolution and the role of selection in generating and maintaining divergence. Here, we use individual whole-transcriptome sequencing (RNAseq) to refine our understanding of the population genomics of wild species of sunflowers (Helianthus spp.) and the factors that affect rates of protein evolution. We aligned 35 GB of transcriptome sequencing data and identified 433,257 polymorphic sites (SNPs) in a reference transcriptome comprising 16,312 genes. Using SNP markers, we identified strong population clustering largely corresponding to the three species analyzed here (Helianthus annuus, H. petiolaris, H. debilis), with one distinct early generation hybrid. Then, we calculated the proportions of adaptive substitution fixed by selection (alpha) and identified gene ontology categories with elevated values of alpha. The “response to biotic stimulus” category had the highest mean alpha across the three interspecific comparisons, implying that natural selection imposed by other organisms plays an important role in driving protein evolution in wild sunflowers. Finally, we examined the relationship between protein evolution (dN/dS ratio) and several genomic factors predicted to co-vary with protein evolution (gene expression level, divergence and specificity, genetic divergence [FST], and nucleotide diversity pi). We find that variation in rates of protein divergence was correlated with gene expression level and specificity, consistent with results from a broad range of taxa and timescales. This would in turn imply that these factors govern protein evolution both at a microevolutionary and macroevolutionary timescale. Our results contribute to a general understanding of the determinants of rates of protein evolution and the impact of selection on patterns of polymorphism and divergence.http://www.mdpi.com/2079-7737/1/3/575RNAseqgene expressionsequence divergenceHelianthusselectiontranscriptomeMcDonald-Kreitman test
spellingShingle Loren H. Rieseberg
Nolan C. Kane
Brook T. Moyers
Sébastien Renaut
Christopher J. Grassa
The Population Genomics of Sunflowers and Genomic Determinants of Protein Evolution Revealed by RNAseq
Biology
RNAseq
gene expression
sequence divergence
Helianthus
selection
transcriptome
McDonald-Kreitman test
title The Population Genomics of Sunflowers and Genomic Determinants of Protein Evolution Revealed by RNAseq
title_full The Population Genomics of Sunflowers and Genomic Determinants of Protein Evolution Revealed by RNAseq
title_fullStr The Population Genomics of Sunflowers and Genomic Determinants of Protein Evolution Revealed by RNAseq
title_full_unstemmed The Population Genomics of Sunflowers and Genomic Determinants of Protein Evolution Revealed by RNAseq
title_short The Population Genomics of Sunflowers and Genomic Determinants of Protein Evolution Revealed by RNAseq
title_sort population genomics of sunflowers and genomic determinants of protein evolution revealed by rnaseq
topic RNAseq
gene expression
sequence divergence
Helianthus
selection
transcriptome
McDonald-Kreitman test
url http://www.mdpi.com/2079-7737/1/3/575
work_keys_str_mv AT lorenhrieseberg thepopulationgenomicsofsunflowersandgenomicdeterminantsofproteinevolutionrevealedbyrnaseq
AT nolanckane thepopulationgenomicsofsunflowersandgenomicdeterminantsofproteinevolutionrevealedbyrnaseq
AT brooktmoyers thepopulationgenomicsofsunflowersandgenomicdeterminantsofproteinevolutionrevealedbyrnaseq
AT sebastienrenaut thepopulationgenomicsofsunflowersandgenomicdeterminantsofproteinevolutionrevealedbyrnaseq
AT christopherjgrassa thepopulationgenomicsofsunflowersandgenomicdeterminantsofproteinevolutionrevealedbyrnaseq
AT lorenhrieseberg populationgenomicsofsunflowersandgenomicdeterminantsofproteinevolutionrevealedbyrnaseq
AT nolanckane populationgenomicsofsunflowersandgenomicdeterminantsofproteinevolutionrevealedbyrnaseq
AT brooktmoyers populationgenomicsofsunflowersandgenomicdeterminantsofproteinevolutionrevealedbyrnaseq
AT sebastienrenaut populationgenomicsofsunflowersandgenomicdeterminantsofproteinevolutionrevealedbyrnaseq
AT christopherjgrassa populationgenomicsofsunflowersandgenomicdeterminantsofproteinevolutionrevealedbyrnaseq