Cys2His2 zinc fingers (C2H2-ZFs) comprise the largest class of metazoan DNA-binding domains. Despite this domain's well-defined DNA-recognition interface, and its successful use in the design of chimeric proteins capable of targeting genomic regions of interest, much remains unknown about its DNA-binding landscape. To help bridge this gap in fundamental knowledge and to provide a resource for design-oriented applications, we screened large synthetic protein libraries to select binding C2H2-ZF domains for each possible three base pair target. The resulting data consist of >160 000 unique domain–DNA interactions and comprise the most comprehensive investigation of C2H2-ZF DNA-binding interactions to date. An integrated analysis of these independent screens yielded DNA-binding profiles for tens of thousands of domains and led to the successful design and prediction of C2H2-ZF DNA-binding specificities. Computational analyses uncovered important aspects of C2H2-ZF domain–DNA interactions, including the roles of within-finger context and domain position on base recognition. We observed the existence of numerous distinct binding strategies for each possible three base pair target and an apparent balance between affinity and specificity of binding. In sum, our comprehensive data help elucidate the complex binding landscape of C2H2-ZF domains and provide a foundation for efforts to determine, predict and engineer their DNA-binding specificities.
The Cys2His2 zinc finger (ZF) is the most frequently found sequence-specific DNA-binding domain in eukaryotic proteins. The ZF’s modular protein–DNA interface has also served as a platform for genome engineering applications. Despite decades of intense study, a predictive understanding of the DNA-binding specificities of either natural or engineered ZF domains remains elusive. To help fill this gap, we developed an integrated experimental-computational approach to enrich and recover distinct groups of ZFs that bind common targets. To showcase the power of our approach, we built several large ZF libraries and demonstrated their excellent diversity. As proof of principle, we used one of these ZF libraries to select and recover thousands of ZFs that bind several 3-nt targets of interest. We were then able to computationally cluster these recovered ZFs to reveal several distinct classes of proteins, all recovered from a single selection, to bind the same target. Finally, for each target studied, we confirmed that one or more representative ZFs yield the desired specificity. In sum, the described approach enables comprehensive large-scale selection and characterization of ZF specificities and should be a great aid in furthering our understanding of the ZF domain.
Characterizing the molecular interactions of viruses in natural microbial populations offers insights into virus–host dynamics in complex ecosystems. We identify the resistance of Sulfolobus islandicus to Sulfolobus spindle‐shaped virus (SSV9) conferred by chromosomal deletions of pilin genes, pilA1 and pilA2 that are individually able to complement resistance. Mutants with deletions of both pilA1 and pilA2 or the prepilin peptidase, PibD, show the reduction in the number of pilins observed in TEM and reduced surface adherence but still adsorb SSV9. The proteinaceous outer S‐layer proteins, SlaA and SlaB, are not required for adsorption nor infection demonstrating that the S‐layer is not the primary receptor for SSV9 surface binding. Strains lacking both pilins are resistant to a broad panel of SSVs as well as a panel of unrelated S. islandicus rod‐shaped viruses (SIRVs). Unlike SSV9, we show that pilA1 or pilA2 is required for SIRV8 adsorption. In sequenced Sulfolobus strains from around the globe, one copy of each pilA1 and pilA2 is maintained and show codon‐level diversification, demonstrating their importance in nature. By characterizing the molecular interactions at the initiation of infection between S. islandicus and two different types of viruses we hope to increase the understanding of virus–host interactions in the archaeal domain.
Engineered nucleases have transformed biological research and offer great therapeutic potential by enabling the straightforward modification of desired genomic sequences. While many nuclease platforms have proven functional, all can produce unanticipated off-target lesions and have difficulty discriminating between homologous sequences, limiting their therapeutic application. Here we describe a multi-reporter selection system that allows the screening of large protein libraries to uncover variants able to discriminate between sequences with substantial homology. We have used this system to identify zinc-finger nucleases that exhibit high cleavage activity (up to 60% indels) at their targets within the CCR5 and HBB genes and strong discrimination against homologous sequences within CCR2 and HBD. An unbiased screen for off-target lesions using a novel set of CCR5-targeting nucleases confirms negligible CCR2 activity and demonstrates minimal off-target activity genome wide. This system offers a straightforward approach to generate nucleases that discriminate between similar targets and provide exceptional genome-wide specificity.
Gene TarGeTinG and Gene CorreCTion iii integration sites were detected at the TALEN and CRISPR target sites, we did not detect any off-target activity, indicating high specificity. Additionally, K562 cells were nucleofected with TALEN-expressing plasmids and a donor template containing a GFP expression cassette flanked by 800bp homologous sequences to the TRAC locus. Thereby we were able to verify targeted gene addition through HDR of TALEN-mediated DSB in about 10% of treated cells. TALEN and CRISPR that show high efficiency and specificity to their target sequence will be used for generating T-cells with high avidity TCR.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.