2023
DOI: 10.1101/2023.05.11.540401
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

De novodistillation of thermodynamic affinity from deep learning regulatory sequence models ofin vivoprotein-DNA binding

Abstract: Transcription factors (TF) are proteins that bind DNA in a sequence-specific manner to regulate gene transcription. Despite their unique intrinsic sequence preferences, in vivo genomic occupancy profiles of TFs differ across cellular contexts. Hence, deciphering the sequence determinants of TF binding, both intrinsic and context-specific, is essential to understand gene regulation and the impact of regulatory, non-coding genetic variation. Biophysical models trained on in vitro TF binding assays can estimate i… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

1
12
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
4
2

Relationship

3
3

Authors

Journals

citations
Cited by 7 publications
(13 citation statements)
references
References 118 publications
1
12
0
Order By: Relevance
“…5A), BPNet accurately predicted log-transformed read counts for held-out data ( R 2 = 0.52), with binding profiles that reproduced those observed experimentally (Fig. 5A) ( 86 ). Returned contribution weight matrices (CWMs), which identify short subsequences most predictive of TF binding, revealed E-box–like motifs (CACGTG) that sometimes included a flanking preference for CG dinucleotides, consistent with in vitro preferences (Figs.…”
Section: Resultssupporting
confidence: 70%
See 3 more Smart Citations
“…5A), BPNet accurately predicted log-transformed read counts for held-out data ( R 2 = 0.52), with binding profiles that reproduced those observed experimentally (Fig. 5A) ( 86 ). Returned contribution weight matrices (CWMs), which identify short subsequences most predictive of TF binding, revealed E-box–like motifs (CACGTG) that sometimes included a flanking preference for CG dinucleotides, consistent with in vitro preferences (Figs.…”
Section: Resultssupporting
confidence: 70%
“…This case study of STRs further underscores the limitations of motif-based models in predicting TF occupancy from sequence ( 37 , 86 , 127 130 ), because STRs composed of overlapping instances of even low-affinity sites bearing little resemblance to the known motif can substantially alter binding. Binding of the same TF to dissimilar motifs has previously been reported and attributed to alternate binding modes driven by either entropic or enthalpic effects ( 131 133 ).…”
Section: Discussionmentioning
confidence: 75%
See 2 more Smart Citations
“…The models learn hierarchical layers of de novo sequence pattern detectors that can encode sequence motifs and their higher-order syntax. Interpretation of these models has revealed novel insights into the cisregulatory code of TF binding including sequence preferences and affinity landscapes of individual TFs (Avsec, Weilert, et al, 2021;Alexandari et al, 2023), soft motif syntax mediated TF cooperativity (Avsec, Weilert, et al, 2021;de Almeida et al, 2022), and effects of sequence variation and repeats (Horton et al, 2023;Alexandari et al, 2023;Avsec, Agarwal, et al, 2021;Chen et al, 2022). While neural networks have been used to dissect the sequence basis of chromatin accessibility of diverse cell types (Maslova et al, 2020;Kim et al, 2021;Janssens et al, 2022;Ameen et al, 2022), they have yet to be used to decipher cis-regulatory drivers of quantitative chromatin dynamics from single-cell profiling across continuous cell state transitions during reprogramming.…”
Section: Introductionmentioning
confidence: 99%