Structural Phylogenomics Retrodicts the Origin of the Genetic Code and Uncovers the Evolutionary Impact of Protein Flexibility

Caetano‐Anollés, Gustavo; Wang, Minglei; Caetano-Anollés, Derek

doi:10.1371/journal.pone.0072225

Cited by 62 publications

(94 citation statements)

References 118 publications

(244 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…The most serious challenge to the various scenarios described in this review is the significant evidence from Koonin’s group [149,171,159–161] and others [53,172] that speciation of the Class I aaRS did not occur until relatively late in the generation of the proteome. Those studies are based on the most current thinking on phylogenetic reconstruction, yet they appear to be inconsistent with a fundamental role for objects like the bi-directional Protozyme gene products near the root of the proteome.…”

Section: Remaining Questionsmentioning

confidence: 99%

Coding of Class I and II Aminoacyl-tRNA Synthetases

Carter

2017

Advances in Experimental Medicine and Biology

101

View full text Add to dashboard Cite

SUMMARY The aminoacyl-tRNA synthetases and their cognate transfer RNAs translate the universal genetic code. The twenty canonical amino acids are sufficiently diverse to create a selective advantage for dividing amino acid activation between two distinct, apparently unrelated superfamilies of synthetases, Class I amino acids being generally larger and less polar, Class II amino acids smaller and more polar. Biochemical, bioinformatic, and protein engineering experiments support the hypothesis that the two Classes descended from opposite strands of the same ancestral gene. Parallel experimental deconstructions of Class I and II synthetases reveal parallel losses in catalytic proficiency at two novel modular levels—protozymes and Urzymes—associated with the evolution of catalytic activity. Bi-directional coding supports an important unification of the proteome; affords a genetic relatedness metric—middle base-pairing frequencies in sense/antisense alignments—that probes more deeply into the evolutionary history of translation than do single multiple sequence alignments; and has facilitated the analysis of hitherto unknown coding relationships in tRNA sequences. Reconstruction of native synthetases by modular thermodynamic cycles facilitated by domain engineering emphasizes the subtlety associated with achieving high specificity, shedding new light on allosteric relationships in contemporary synthetases. Synthetase Urzyme structural biology suggests that they are catalytically active molten globules, broadening the potential manifold of polypeptide catalysts accessible to primitive genetic coding and motivating revisions of the origins of catalysis. Finally, bi-directional genetic coding of some of the oldest genes in the proteome places major limitations on the likelihood that any RNA World preceded the origins of coded proteins.

show abstract

Section: Remaining Questionsmentioning

confidence: 99%

Coding of Class I and II Aminoacyl-tRNA Synthetases

Carter

2017

Advances in Experimental Medicine and Biology

101

View full text Add to dashboard Cite

show abstract

“…Systematic protein structure classifications, SCOP (54) and CATH (55), fail to identify Urzymes of either aaRS class as ancestral forms (10). However, aaRS Urzymes represent plausible ancestors for a wide spectrum of contemporary proteins.…”

Section: Phylogenetics/genomicsmentioning

confidence: 99%

“…rier to accessing early developmental evolution; phylogenetic trees based on multiple sequence alignments representing essentially modern enzymes lose coherence at that stage and do not root in the invariant structural cores (10).…”

mentioning

confidence: 99%

Urzymology: Experimental Access to a Key Transition in the Appearance of Enzymes

Carter

2014

Journal of Biological Chemistry

View full text Add to dashboard Cite

Urzymes are catalysts derived from invariant cores of protein superfamilies. Urzymes from both aminoacyl-tRNA synthetase classes possess sophisticated catalytic mechanisms: pre-steady state bursts, significant transition-state stabilization of both amino acid activation, and tRNA acylation. However, they have insufficient specificity to ensure a fully developed genetic code, suggesting that they participated in synthesizing statistical proteins. They represent a robust experimental platform from which to articulate and test hypotheses both about their own ancestors and about how they, in turn, evolved into modern enzymes. They help reshape numerous paradigms from the RNA World hypothesis to protein structure databases and allostery.

show abstract

“…The catalytic domains of classes I and II aaRS enzymes (belonging to SCOP families d.104.1.1 and c.26.1.1, respectively) are the first to appear in the timeline ~3.7 Gy ago (Caetano-Anollés et al, 2013). These domains harbor pre-transfer and post-transfer editing and trans-editing activities.…”

Section: The Cloverleaf Structure Of Trna Unfolds Early In Evolutionmentioning

confidence: 99%