Most amino acids can be encoded by more than one synonymous codon, but these are rarely used with equal frequency. In many coding sequences the usage patterns of rare versus common synonymous codons is nonrandom and under selection. Moreover, synonymous substitutions that alter these patterns can have a substantial impact on the folding efficiency of the encoded protein. This has ignited broad interest in exploring synonymous codon usage patterns. For many protein chemists, biophysicists and structural biologists, the primary motivation for codon analysis is identifying and preserving usage patterns most likely to impact high-yield production of functional proteins. Here we describe the core functions and new features of %MinMax, a codon usage calculator freely available as a web-based portal and downloadable script (http://www.codons.org). %MinMax evaluates the relative usage frequencies of the synonymous codons used to encode a protein sequence of interest and compares these results to a rigorous null model. Crucially, for analyzing codon usage in common host organisms %MinMax requires only the coding sequence as input; with a user-input codon frequency table, %MinMax can be used to evaluate synonymous codon usage patterns for any coding sequence from any fully sequenced genome. %MinMax makes no assumptions regarding the impact of transfer ribonucleic acid concentrations or other molecular-level interactions on translation rates, yet its output is sufficient to predict the effects of synonymous codon substitutions on cotranslational folding mechanisms. A simple calculation included within %MinMax can be used to harmonize codon usage frequencies for heterologous gene expression.
There is a growing appreciation that synonymous codon usage, although historically regarded as phenotypically silent, can instead alter a wide range of mechanisms related to functional protein production, a term we use here to describe the net effect of transcription (mRNA synthesis), mRNA half-life, translation (protein synthesis) and the probability of a protein folding correctly to its active, functional structure. In particular, recent discoveries have highlighted the important role that sub-optimal codons can play in modifying co-translational protein folding. These results have drawn increased attention to the patterns of synonymous codon usage within coding sequences, particularly in light of the discovery that these patterns can be conserved across evolution for homologous proteins. Because synonymous codon usage differs between organisms, for heterologous gene expression it can be desirable to make synonymous codon substitutions to match the codon usage pattern from the original organism in the heterologous expression host. Here we present CHARMING (for Codon HARMonizING), a robust and versatile algorithm to design mRNA sequences for heterologous gene expression and other related codon harmonization tasks. CHARMING can be run as a downloadable Python script or via a web portal at http://www.codons.org.
%MinMax, a model of intra-gene translational elongation rate, relies on codon usage frequencies. Historically, %MinMax has used tables that measure codon usage bias for all genes in an organism, such as those found at HIVE-CUT. In this paper, we provide evidence that codon usage bias based on all genes is insufficient to accurately measure absolute translation rate. We show that alternative ”High-φ” codon usage tables, generated by another model (ROC-SEMPPR), are a promising alternative. By creating a hybrid model, future codon usage analyses and their applications (e.g., codon harmonization) are likely to more accurately measure the ”tempo” of translation elongation. We also suggest a High- φ alternative to the Codon Adaptation Index (CAI), a classic metric of codon usage bias based on highly expressed genes. Significantly, our new alternative is equally well correlated with empirical data as traditional CAI without using experimentally determined expression counts as input.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.