The broad spectrum of activities displayed by CRISPR-Cas systems has led to biotechnological innovations that are poised to transform human therapeutics. Therefore, the comprehensive characterization of distinct Cas proteins is highly desirable. Here we expand the repertoire of nucleases for mammalian genome editing using the archetypal Streptococcus thermophilus CRISPR1-Cas9 (St1Cas9). We define functional protospacer adjacent motif (PAM) sequences and variables required for robust and efficient editing in vitro. Expression of holoSt1Cas9 from a single adeno-associated viral (rAAV) vector in the neonatal liver rescued lethality and metabolic defects in a mouse model of hereditary tyrosinemia type I demonstrating effective cleavage activity in vivo. Furthermore, we identified potent anti-CRISPR proteins to regulate the activity of both St1Cas9 and the related type II-A Staphylococcus aureus Cas9 (SaCas9). This work expands the targeting range and versatility of CRISPR-associated enzymes and should encourage studies to determine its structure, genome-wide specificity profile and sgRNA design rules.
INTRODUCTIONClustered regularly interspaced short palindromic repeats (CRISPR) and CRISPR-associated (Cas) proteins form a prokaryotic adaptive immune system and some of its components have been harnessed for robust genome editing 1 . Type II-based editing tools rely on a large multidomain endonuclease, Cas9, guided to its DNA target by an engineered single-guide RNA (sgRNA) chimera 2 (See 3, 4 for a classification of CRISPR-Cas systems). The Cas9-sgRNA binary complex finds its target through recognition of a short sequence called the protospacer adjacent motif (PAM) and subsequent base pairing of the guide RNA with the DNA to generate a specific doublestrand break (DSB) 1,5 . While Streptococcus pyogenes (SpCas9) remains the most widely used Cas9 variant for genome engineering, the diversity of naturally occurring RNA-guided nucleases is astonishing 4 . Hence, Cas9 enzymes from different microbial species can contribute to the expansion of the CRISPR toolset by increasing targeting density, improving activity and specificity as well as easing delivery 1,6 .In principle, engineering complementary CRISPRCas systems from distinct bacterial species should be relatively straightforward, as they have been minimized to only two components. However, many such enzymes were found inactive in human cells despite being accurately reprogrammed for DNA binding and cleavage in vitro [7][8][9][10] . Nevertheless, the full potential of selected enzymes can be unleashed using machine learning to establish sgRNA design rules 11,12 .Perhaps the most striking example of the value of alternative Cas9 enzymes is the implementation of the type II-A Cas9 from Staphylococcus aureus (SaCas9) for in vivo editing using recombinant adeno-associated virus (rAAV) vectors 7,13, 14 . More recently, Campylobacter jejuni and Neisseria meningitidis Cas9s from the type II-C 15 CRISPRCas systems have been added to this repertoire 16,17 .In vivo genome edi...