2022
DOI: 10.1101/gr.276606.122
|View full text |Cite
|
Sign up to set email alerts
|

Learning probabilistic protein–DNA recognition codes from DNA-binding specificities using structural mappings

Abstract: Knowledge of how proteins interact with DNA is essential for understanding gene regulation. Although DNA-binding specificities for thousands of transcription factors (TFs) have been determined, the specific amino acid–base interactions comprising their structural interfaces are largely unknown. This lack of resolution hampers attempts to leverage these data in order to predict specificities for uncharacterized TFs or TFs mutated in disease. Here we introduce recognition code learning via automated mapping of p… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

0
7
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
6
2

Relationship

0
8

Authors

Journals

citations
Cited by 12 publications
(10 citation statements)
references
References 61 publications
0
7
0
Order By: Relevance
“…In this context, DeepPBS operates on these predicted complexes to yield the binding specificity of the system, which can guide improved modeling of protein-DNA complexes. DeepPBS, despite its generality, exhibits comparable performance to the recently described family-specific method rCLAMPS 28 .…”
Section: Discussionmentioning
confidence: 96%
See 2 more Smart Citations
“…In this context, DeepPBS operates on these predicted complexes to yield the binding specificity of the system, which can guide improved modeling of protein-DNA complexes. DeepPBS, despite its generality, exhibits comparable performance to the recently described family-specific method rCLAMPS 28 .…”
Section: Discussionmentioning
confidence: 96%
“…3e-g). We show that this pipeline is competitive with the recent family-specific model rCLAMPS 28 (Fig. 3h-i) while being more generalizable: specifically, DeepPBS is family-agnostic, can handle biological assemblies, and can predict flanking preferences.…”
Section: Introductionmentioning
confidence: 90%
See 1 more Smart Citation
“…In this work we have developed a structurebased approach to predict specific binding motifs of TFs, to identify cis-regulatory elements and to automatically model the structure of the transcription complex entailing the regulation. A similar structural-learning approach was recently developed and applied on homeodomain and C2H2-Zf families 63 that predicted the PWMs by mapping experimentally known PWMs in the contacts of the interface of the TF-DNA structure, which in principle (but not suggested) it can be applied to any other TF by providing the structure of the complex with DNA. Our approach has been implemented in a server for the scientific community named ModCRE.…”
Section: Discussionmentioning
confidence: 99%
“…Nevertheless, new tools and methodological approaches are developing to predict regulatory elements in different datasets. In the future, they might help overcome motif discovery limitations in non-model organisms (Ksouri et al, 2021;Novakovsky et al, 2021;Wetzel et al, 2022).…”
Section: Introductionmentioning
confidence: 99%