As the SARS-CoV-2 pandemic is rapidly progressing, the need for the development of an effective vaccine is critical. A promising approach for vaccine development is to generate, through codon pair deoptimization, an attenuated virus. This approach carries the advantage that it only requires limited knowledge specific to the virus in question, other than its genome sequence. Therefore, it is well suited for emerging viruses, for which we may not have extensive data. We performed comprehensive in silico analyses of several features of SARS-CoV-2 genomic sequence (e.g., codon usage, codon pair usage, dinucleotide/junction dinucleotide usage, RNA structure around the frameshift region) in comparison with other members of the coronaviridae family of viruses, the overall human genome, and the transcriptome of specific human tissues such as lung, which are primarily targeted by the virus. Our analysis identified the spike (S) and nucleocapsid (N) proteins as promising targets for deoptimization and suggests a roadmap for SARS-CoV-2 vaccine development, which can be generalizable to other viruses.
Synonymous codons occur with different frequencies in different organisms, a phenomenon termed codon usage bias. Codon optimization, a common term for a variety of approaches used widely by the biopharmaceutical industry, involves synonymous substitutions to increase protein expression. It had long been presumed that synonymous variants, which, by definition, do not alter the primary amino acid sequence, have no effect on protein structure and function. However, a critical mass of reports suggests that synonymous codon variations may impact protein conformation. To investigate the impact of synonymous codons usage on protein expression and function, we designed an optimized coagulation factor IX (FIX) variant and used multiple methods to compare its properties to the wild-type FIX upon expression in HEK293T cells. We found that the two variants differ in their conformation, even when controlling for the difference in expression levels. Using ribosome profiling, we identified robust changes in the translational kinetics of the two variants and were able to identify a region in the gene that may have a role in altering the conformation of the protein. Our data have direct implications for codon optimization strategies, for production of recombinant proteins and gene therapies.
Synonymous variants within coding regions may influence protein expression and function. We have previously reported increased protein expression levels ex vivo (~120% in comparison to wild-type) from a synonymous polymorphism variant, c.354G>A [p.P118P], of the ADAMTS13 gene, encoding a plasma protease responsible for von Willebrand Factor (VWF) degradation. In the current study, we investigated the potential mechanism(s) behind the increased protein expression levels from this variant and its effect on ADAMTS13 physico-chemical properties. Cell-free assays showed enhanced translation of the c.354G>A variant and the analysis of codon usage characteristics suggested that introduction of the frequently used codon/codon pair(s) may have been potentially responsible for this effect. Limited proteolysis, however, showed no substantial influence of altered translation on protein conformation. Analysis of post-translational modifications also showed no notable differences but identified three previously unreported glycosylation markers. Despite these similarities, p.P118P variant unexpectedly showed higher specific activity. Structural analysis using modeled interactions indicated that subtle conformational changes arising from altered translation kinetics could affect interactions between an exosite of ADAMTS13 and VWF resulting in altered specific activity. This report highlights how a single synonymous nucleotide variation can impact cellular expression and specific activity in the absence of measurable impact on protein structure.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.