2013
DOI: 10.1093/nar/gkt890
|View full text |Cite
|
Sign up to set email alerts
|

De novo prediction of DNA-binding specificities for Cys2His2 zinc finger proteins

Abstract: Proteins with sequence-specific DNA binding function are important for a wide range of biological activities. De novo prediction of their DNA-binding specificities from sequence alone would be a great aid in inferring cellular networks. Here we introduce a method for predicting DNA-binding specificities for Cys2His2 zinc fingers (C2H2-ZFs), the largest family of DNA-binding proteins in metazoans. We develop a general approach, based on empirical calculations of pairwise amino acid–nucleotide interaction energi… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

7
219
0
1

Year Published

2014
2014
2022
2022

Publication Types

Select...
8
1

Relationship

0
9

Authors

Journals

citations
Cited by 195 publications
(227 citation statements)
references
References 60 publications
(74 reference statements)
7
219
0
1
Order By: Relevance
“…These included the alleles for the mice used in our study. We generated a PWM of the putative binding site for each of these 74 alleles using the polynomial SVM and the method of Persikov et al (2009) and Persikov and Singh (2014) and then compared each motif to the set of all 74 binding sites using STAMP (Mahony and Benos 2007). We found that all motifs aligned to the binding site for their respective allele of PRDM9 with an E-value <0.005 (Supplemental Table S2).…”
Section: Identification Of Sequence Motifsmentioning
confidence: 99%
“…These included the alleles for the mice used in our study. We generated a PWM of the putative binding site for each of these 74 alleles using the polynomial SVM and the method of Persikov et al (2009) and Persikov and Singh (2014) and then compared each motif to the set of all 74 binding sites using STAMP (Mahony and Benos 2007). We found that all motifs aligned to the binding site for their respective allele of PRDM9 with an E-value <0.005 (Supplemental Table S2).…”
Section: Identification Of Sequence Motifsmentioning
confidence: 99%
“…The identities of these amino acids are the principle determinants of the DNA sequence recognized (Supplemental Fig. S2a,b;Gupta et al 2014;Persikov and Singh 2014).…”
mentioning
confidence: 99%
“…Most commonly, proceeding leftward in the amino acid sequence toward the N terminus, residues at positions À1, À4, and À7 (or À8) make base-specific contacts through their side chains; the identities of these amino acids are the principle determinants of the DNA sequence recognized (Supplemental Fig. S1A), although by no means the only ones (Gupta et al 2014;Persikov and Singh 2014).…”
mentioning
confidence: 99%