Quantifying shifts in natural selection on codon usage between protein regions: a population genetics approach
Abstract Background Codon usage bias (CUB), the non-uniform usage of synonymous codons, occurs across all domains of life. Adaptive CUB is hypothesized to result from various selective pressures, including selection for efficient ribosome elongation, accurate translation, mRNA secondary structure, a...
Main Authors: | , |
---|---|
Format: | Article |
Language: | English |
Published: |
BMC
2022-05-01
|
Series: | BMC Genomics |
Subjects: | |
Online Access: | https://doi.org/10.1186/s12864-022-08635-0 |
_version_ | 1818533741413269504 |
---|---|
author | Alexander L. Cope Michael A. Gilchrist |
author_facet | Alexander L. Cope Michael A. Gilchrist |
author_sort | Alexander L. Cope |
collection | DOAJ |
description | Abstract Background Codon usage bias (CUB), the non-uniform usage of synonymous codons, occurs across all domains of life. Adaptive CUB is hypothesized to result from various selective pressures, including selection for efficient ribosome elongation, accurate translation, mRNA secondary structure, and/or protein folding. Given the critical link between protein folding and protein function, numerous studies have analyzed the relationship between codon usage and protein structure. The results from these studies have often been contradictory, likely reflecting the differing methods used for measuring codon usage and the failure to appropriately control for confounding factors, such as differences in amino acid usage between protein structures and changes in the frequency of different structures with gene expression. Results Here we take an explicit population genetics approach to quantify codon-specific shifts in natural selection related to protein structure in S. cerevisiae and E. coli. Unlike other metrics of codon usage, our approach explicitly separates the effects of natural selection, scaled by gene expression, and mutation bias while naturally accounting for a region’s amino acid usage. Bayesian model comparisons suggest selection on codon usage varies only slightly between helix, sheet, and coil secondary structures and, similarly, between structured and intrinsically-disordered regions. Similarly, in contrast to prevous findings, we find selection on codon usage only varies slightly at the termini of helices in E. coli. Using simulated data, we show this previous work indicating “non-optimal” codons are enriched at the beginning of helices in S. cerevisiae was due to failure to control for various confounding factors (e.g. amino acid biases, gene expression, etc.), and rather than selection to modulate cotranslational folding. Conclusions Our results reveal a weak relationship between codon usage and protein structure, indicating that differences in selection on codon usage between structures are slight. In addition to the magnitude of differences in selection between protein structures being slight, the observed shifts appear to be idiosyncratic and largely codon-specific rather than systematic reversals in the nature of selection. Overall, our work demonstrates the statistical power and benefits of studying selective shifts on codon usage or other genomic features from an explicitly evolutionary approach. Limitations of this approach and future potential research avenues are discussed. |
first_indexed | 2024-12-11T18:02:44Z |
format | Article |
id | doaj.art-28b26c944ed04d97a1ea73f74890cbea |
institution | Directory Open Access Journal |
issn | 1471-2164 |
language | English |
last_indexed | 2024-12-11T18:02:44Z |
publishDate | 2022-05-01 |
publisher | BMC |
record_format | Article |
series | BMC Genomics |
spelling | doaj.art-28b26c944ed04d97a1ea73f74890cbea2022-12-22T00:55:49ZengBMCBMC Genomics1471-21642022-05-0123111910.1186/s12864-022-08635-0Quantifying shifts in natural selection on codon usage between protein regions: a population genetics approachAlexander L. Cope0Michael A. Gilchrist1Genome Science and Technology, University of TennesseeGenome Science and Technology, University of TennesseeAbstract Background Codon usage bias (CUB), the non-uniform usage of synonymous codons, occurs across all domains of life. Adaptive CUB is hypothesized to result from various selective pressures, including selection for efficient ribosome elongation, accurate translation, mRNA secondary structure, and/or protein folding. Given the critical link between protein folding and protein function, numerous studies have analyzed the relationship between codon usage and protein structure. The results from these studies have often been contradictory, likely reflecting the differing methods used for measuring codon usage and the failure to appropriately control for confounding factors, such as differences in amino acid usage between protein structures and changes in the frequency of different structures with gene expression. Results Here we take an explicit population genetics approach to quantify codon-specific shifts in natural selection related to protein structure in S. cerevisiae and E. coli. Unlike other metrics of codon usage, our approach explicitly separates the effects of natural selection, scaled by gene expression, and mutation bias while naturally accounting for a region’s amino acid usage. Bayesian model comparisons suggest selection on codon usage varies only slightly between helix, sheet, and coil secondary structures and, similarly, between structured and intrinsically-disordered regions. Similarly, in contrast to prevous findings, we find selection on codon usage only varies slightly at the termini of helices in E. coli. Using simulated data, we show this previous work indicating “non-optimal” codons are enriched at the beginning of helices in S. cerevisiae was due to failure to control for various confounding factors (e.g. amino acid biases, gene expression, etc.), and rather than selection to modulate cotranslational folding. Conclusions Our results reveal a weak relationship between codon usage and protein structure, indicating that differences in selection on codon usage between structures are slight. In addition to the magnitude of differences in selection between protein structures being slight, the observed shifts appear to be idiosyncratic and largely codon-specific rather than systematic reversals in the nature of selection. Overall, our work demonstrates the statistical power and benefits of studying selective shifts on codon usage or other genomic features from an explicitly evolutionary approach. Limitations of this approach and future potential research avenues are discussed.https://doi.org/10.1186/s12864-022-08635-0Codon usage biasProtein foldingProtein secondary structurePopulation genetics |
spellingShingle | Alexander L. Cope Michael A. Gilchrist Quantifying shifts in natural selection on codon usage between protein regions: a population genetics approach BMC Genomics Codon usage bias Protein folding Protein secondary structure Population genetics |
title | Quantifying shifts in natural selection on codon usage between protein regions: a population genetics approach |
title_full | Quantifying shifts in natural selection on codon usage between protein regions: a population genetics approach |
title_fullStr | Quantifying shifts in natural selection on codon usage between protein regions: a population genetics approach |
title_full_unstemmed | Quantifying shifts in natural selection on codon usage between protein regions: a population genetics approach |
title_short | Quantifying shifts in natural selection on codon usage between protein regions: a population genetics approach |
title_sort | quantifying shifts in natural selection on codon usage between protein regions a population genetics approach |
topic | Codon usage bias Protein folding Protein secondary structure Population genetics |
url | https://doi.org/10.1186/s12864-022-08635-0 |
work_keys_str_mv | AT alexanderlcope quantifyingshiftsinnaturalselectiononcodonusagebetweenproteinregionsapopulationgeneticsapproach AT michaelagilchrist quantifyingshiftsinnaturalselectiononcodonusagebetweenproteinregionsapopulationgeneticsapproach |