CRISPR-Cas9-based genetic screens are a powerful new tool in biology. By simply altering the sequence of the single-guide RNA (sgRNA), Cas9 can be reprogrammed to target different sites in the genome with relative ease, but the on-target activity and off-target effects of individual sgRNAs can vary widely. Here, we use recently-devised sgRNA design rules to create human and mouse genome-wide libraries, perform positive and negative selection screens and observe that the use of these rules produced improved results. Additionally, we profile the off-target activity of thousands of sgRNAs and develop a metric to predict off-target sites. We incorporate these findings from large-scale, empirical data to improve our computational design rules and create optimized sgRNA libraries that maximize on-target activity and minimize off-target effects to enable more effective and efficient genetic screens and genome engineering.
Components of the prokaryotic clustered regularly interspersed
palindromic repeat (CRISPR) loci have recently been repurposed for use in
mammalian cells1–6. The Cas9 protein can be
programmed with a single guide RNA (sgRNA) to generate site-specific DNA breaks,
but there are few known rules governing on-target efficacy of this
system7,8. We created a pool of sgRNAs, tiling across all
possible target sites of a panel of six endogenous mouse and three endogenous
human genes and quantitatively assessed their ability to produce null alleles of
their target gene by antibody staining and flow cytometry. We discovered
sequence features that improved activity, including a further optimization of
the proto-spacer adjacent motif (PAM) of Streptococcus pyogenes
Cas9. The results from 1,841 sgRNAs were used to construct a predictive model of
sgRNA activity to improve sgRNA design for gene editing and genetic screens. We
provide an online tool for the design of highly active sgRNAs for any gene of
interest.
The creation of genome-wide libraries for CRISPR knockout (CRISPRko), interference (CRISPRi), and activation (CRISPRa) has enabled the systematic interrogation of gene function. Here, we show that our recently-described CRISPRko library (Brunello) is more effective than previously published libraries at distinguishing essential and non-essential genes, providing approximately the same perturbation-level performance improvement over GeCKO libraries as GeCKO provided over RNAi. Additionally, we present genome-wide libraries for CRISPRi (Dolcetto) and CRISPRa (Calabrese), and show in negative selection screens that Dolcetto, with fewer sgRNAs per gene, outperforms existing CRISPRi libraries and achieves comparable performance to CRISPRko in detecting essential genes. We also perform positive selection CRISPRa screens and demonstrate that Calabrese outperforms the SAM approach at identifying vemurafenib resistance genes. We further compare CRISPRa to genome-scale libraries of open reading frames (ORFs). Together, these libraries represent a suite of genome-wide tools to efficiently interrogate gene function with multiple modalities.
Combinatorial genetic screening using CRISPR-Cas9 is a useful approach to uncover redundant genes and to explore complex gene networks. However, current approaches suffer from interference between the single-guide RNAs (sgRNAs) and from limited gene targeting activity. To increase the efficiency of combinatorial screening, we employ orthogonal Cas9 enzymes from S. aureus and S. pyogenes. We used machine learning to establish S. aureus Cas9 sgRNA design rules and paired S. aureus Cas9 with S. pyogenes Cas9 to achieve dual targeting in a high fraction of cells. We also developed a lentiviral vector and cloning strategy to generate high-complexity pooled dual-knockout libraries to identify synthetic lethal and buffering gene pairs across multiple cell types, including MAPK pathway genes and apoptotic genes. Our orthologous approach enabled a screen combining gene knockouts with transcriptional activation, which revealed genetic interactions with TP53. The “Big Papi” (Paired aureus and pyogenes for intereactions) approach described here will be widely applicable for the study of combinatorial phenotypes.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.