Storage and directed transfer of information is the key requirement for the development of life. Yet any information stored on our genes is useless without its correct interpretation. The genetic code defines the rule set to decode this information. Aminoacyl-tRNA synthetases are at the heart of this process. We extensively characterize how these enzymes distinguish all natural amino acids based on the computational analysis of crystallographic structure data. The results of this meta-analysis show that the correct read-out of genetic information is a delicate interplay between the composition of the binding site, non-covalent interactions, error correction mechanisms, and steric effects. One of the most profound open questions in biology is how the genetic code was established. While proteins are encoded by nucleic acid blueprints, decoding this information in turn requires proteins. The emergence of this self-referencing system poses a chicken-or-egg dilemma and its origin is still heavily debated 1,2. Aminoacyl-tRNA synthetases (aaRSs) implement the correct assignment of amino acids to their codons and are thus inherently connected to the emergence of genetic coding. These enzymes link tRNA molecules with their amino acid cargo and are consequently vital for protein biosynthesis. Beside the correct recognition of tRNA features 3 , highly specific non-covalent interactions in the binding sites of aaRSs are required to correctly detect the designated amino acid 4-7 and to prevent errors in biosynthesis 5,8. The minimization of such errors represents the utmost barrier for the development of biological complexity 9 and accurate specification of aaRS binding sites is proposed to be one of the major determinants for the closure of the genetic code 10. Beside binding side features, recognition fidelity is controlled by the ratio of concentrations of aaRSs and cognate tRNA molecules 11 and may involve spatial secondary structures motifs in addition to side chain configurations 12,13. Evolution. The evolutionary origin of aaRSs is hard to track. Phylogenetic analyses of aaRS sequences show that they do not follow the standard model of life 14 ; the development of aaRSs was nearly complete before the Last Universal Common Ancestor (LUCA) 15,16. Their complex evolutionary history included horizontal gene transfer, fusion, duplication, and recombination events 14,17-21. Sequence analyses 22 and subsequent structure investigations 23,24 revealed that aaRSs can be divided into two distinct classes (Class I and Class II) that share no similarities at sequence or structure level. Each of the classes is responsible for 10 of the 20 proteinogenic amino acids and can be further grouped into subclasses 15. One exception to this class separation rule is lysyl-tRNA synthetase (LysRS), where euryarchaeal genomes were shown to contain a Class I form 25 instead of the standard Class II form. Most eukaryotic genomes contain the complete set of 20 aaRSs. However, some species lack certain aaRS-encoding genes and compensate for this by ...