Quantifying shifts in natural selection on codon usage between protein regions: a population genetics approach

Abstract Background Codon usage bias (CUB), the non-uniform usage of synonymous codons, occurs across all domains of life. Adaptive CUB is hypothesized to result from various selective pressures, including selection for efficient ribosome elongation, accurate translation, mRNA secondary structure, a...

Full description

Bibliographic Details
Main Authors: Alexander L. Cope, Michael A. Gilchrist
Format: Article
Language:English
Published: BMC 2022-05-01
Series:BMC Genomics
Subjects:
Online Access:https://doi.org/10.1186/s12864-022-08635-0
_version_ 1818533741413269504
author Alexander L. Cope
Michael A. Gilchrist
author_facet Alexander L. Cope
Michael A. Gilchrist
author_sort Alexander L. Cope
collection DOAJ
description Abstract Background Codon usage bias (CUB), the non-uniform usage of synonymous codons, occurs across all domains of life. Adaptive CUB is hypothesized to result from various selective pressures, including selection for efficient ribosome elongation, accurate translation, mRNA secondary structure, and/or protein folding. Given the critical link between protein folding and protein function, numerous studies have analyzed the relationship between codon usage and protein structure. The results from these studies have often been contradictory, likely reflecting the differing methods used for measuring codon usage and the failure to appropriately control for confounding factors, such as differences in amino acid usage between protein structures and changes in the frequency of different structures with gene expression. Results Here we take an explicit population genetics approach to quantify codon-specific shifts in natural selection related to protein structure in S. cerevisiae and E. coli. Unlike other metrics of codon usage, our approach explicitly separates the effects of natural selection, scaled by gene expression, and mutation bias while naturally accounting for a region’s amino acid usage. Bayesian model comparisons suggest selection on codon usage varies only slightly between helix, sheet, and coil secondary structures and, similarly, between structured and intrinsically-disordered regions. Similarly, in contrast to prevous findings, we find selection on codon usage only varies slightly at the termini of helices in E. coli. Using simulated data, we show this previous work indicating “non-optimal” codons are enriched at the beginning of helices in S. cerevisiae was due to failure to control for various confounding factors (e.g. amino acid biases, gene expression, etc.), and rather than selection to modulate cotranslational folding. Conclusions Our results reveal a weak relationship between codon usage and protein structure, indicating that differences in selection on codon usage between structures are slight. In addition to the magnitude of differences in selection between protein structures being slight, the observed shifts appear to be idiosyncratic and largely codon-specific rather than systematic reversals in the nature of selection. Overall, our work demonstrates the statistical power and benefits of studying selective shifts on codon usage or other genomic features from an explicitly evolutionary approach. Limitations of this approach and future potential research avenues are discussed.
first_indexed 2024-12-11T18:02:44Z
format Article
id doaj.art-28b26c944ed04d97a1ea73f74890cbea
institution Directory Open Access Journal
issn 1471-2164
language English
last_indexed 2024-12-11T18:02:44Z
publishDate 2022-05-01
publisher BMC
record_format Article
series BMC Genomics
spelling doaj.art-28b26c944ed04d97a1ea73f74890cbea2022-12-22T00:55:49ZengBMCBMC Genomics1471-21642022-05-0123111910.1186/s12864-022-08635-0Quantifying shifts in natural selection on codon usage between protein regions: a population genetics approachAlexander L. Cope0Michael A. Gilchrist1Genome Science and Technology, University of TennesseeGenome Science and Technology, University of TennesseeAbstract Background Codon usage bias (CUB), the non-uniform usage of synonymous codons, occurs across all domains of life. Adaptive CUB is hypothesized to result from various selective pressures, including selection for efficient ribosome elongation, accurate translation, mRNA secondary structure, and/or protein folding. Given the critical link between protein folding and protein function, numerous studies have analyzed the relationship between codon usage and protein structure. The results from these studies have often been contradictory, likely reflecting the differing methods used for measuring codon usage and the failure to appropriately control for confounding factors, such as differences in amino acid usage between protein structures and changes in the frequency of different structures with gene expression. Results Here we take an explicit population genetics approach to quantify codon-specific shifts in natural selection related to protein structure in S. cerevisiae and E. coli. Unlike other metrics of codon usage, our approach explicitly separates the effects of natural selection, scaled by gene expression, and mutation bias while naturally accounting for a region’s amino acid usage. Bayesian model comparisons suggest selection on codon usage varies only slightly between helix, sheet, and coil secondary structures and, similarly, between structured and intrinsically-disordered regions. Similarly, in contrast to prevous findings, we find selection on codon usage only varies slightly at the termini of helices in E. coli. Using simulated data, we show this previous work indicating “non-optimal” codons are enriched at the beginning of helices in S. cerevisiae was due to failure to control for various confounding factors (e.g. amino acid biases, gene expression, etc.), and rather than selection to modulate cotranslational folding. Conclusions Our results reveal a weak relationship between codon usage and protein structure, indicating that differences in selection on codon usage between structures are slight. In addition to the magnitude of differences in selection between protein structures being slight, the observed shifts appear to be idiosyncratic and largely codon-specific rather than systematic reversals in the nature of selection. Overall, our work demonstrates the statistical power and benefits of studying selective shifts on codon usage or other genomic features from an explicitly evolutionary approach. Limitations of this approach and future potential research avenues are discussed.https://doi.org/10.1186/s12864-022-08635-0Codon usage biasProtein foldingProtein secondary structurePopulation genetics
spellingShingle Alexander L. Cope
Michael A. Gilchrist
Quantifying shifts in natural selection on codon usage between protein regions: a population genetics approach
BMC Genomics
Codon usage bias
Protein folding
Protein secondary structure
Population genetics
title Quantifying shifts in natural selection on codon usage between protein regions: a population genetics approach
title_full Quantifying shifts in natural selection on codon usage between protein regions: a population genetics approach
title_fullStr Quantifying shifts in natural selection on codon usage between protein regions: a population genetics approach
title_full_unstemmed Quantifying shifts in natural selection on codon usage between protein regions: a population genetics approach
title_short Quantifying shifts in natural selection on codon usage between protein regions: a population genetics approach
title_sort quantifying shifts in natural selection on codon usage between protein regions a population genetics approach
topic Codon usage bias
Protein folding
Protein secondary structure
Population genetics
url https://doi.org/10.1186/s12864-022-08635-0
work_keys_str_mv AT alexanderlcope quantifyingshiftsinnaturalselectiononcodonusagebetweenproteinregionsapopulationgeneticsapproach
AT michaelagilchrist quantifyingshiftsinnaturalselectiononcodonusagebetweenproteinregionsapopulationgeneticsapproach