2018
DOI: 10.1101/316752
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

An information theoretic treatment of sequence-to-expression modeling

Abstract: Studying a gene's regulatory mechanisms is a tedious process that involves identification of candidate regulators by transcription factor (TF) knockout or over-expression experiments, delineation of enhancers by reporter assays, and demonstration of direct TF influence by site mutagenesis, among other approaches. Such experiments are often chosen based on the biologist's intuition, from several testable hypotheses. We pursue the goal of making this process systematic by using ideas from information theory to r… Show more

Help me understand this report
View published versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
4
0

Year Published

2020
2020
2021
2021

Publication Types

Select...
1
1

Relationship

2
0

Authors

Journals

citations
Cited by 2 publications
(4 citation statements)
references
References 55 publications
(71 reference statements)
0
4
0
Order By: Relevance
“…When testing three different modes of repression, we were surprised to note that a “direct repression” mechanism, where a bound repressor’s regulatory contribution does not depend on how far its binding site is from an activator’s site, has the same predictive ability as the two “short-range repression” mechanisms tested. In light of literature evidence for short-range repression (55, 56), this finding suggests that discerning true mechanisms from modeling of experimental results ex post facto may not be adequately powered, and that active learning approaches (57, 58) that suggest the most informative future experiments may be called for.…”
Section: Discussionmentioning
confidence: 98%
“…When testing three different modes of repression, we were surprised to note that a “direct repression” mechanism, where a bound repressor’s regulatory contribution does not depend on how far its binding site is from an activator’s site, has the same predictive ability as the two “short-range repression” mechanisms tested. In light of literature evidence for short-range repression (55, 56), this finding suggests that discerning true mechanisms from modeling of experimental results ex post facto may not be adequately powered, and that active learning approaches (57, 58) that suggest the most informative future experiments may be called for.…”
Section: Discussionmentioning
confidence: 98%
“…The model relies upon predetermined binding motifs (position weight matrices) of the TFs to identify and quantify binding site strengths in the enhancer. Here, we used an ensemble of models, trained on an enhancer of the gene intermediate neuroblasts defective (ind) in our previous work (9), and applied it to the wntD enhancer. Binding motifs for Dorsal, Zld, and Cic were obtained from FlyFactorSurvey (21).…”
Section: Methodsmentioning
confidence: 99%
“…We constructed a probability distribution over these models following the procedure in ref. 9 and computed the predicted effect of each singlenucleotide mutation in the enhancer as the average, over this distribution of the RMSE between predicted DV expression profiles with and without the mutation. An "A" to a "C" mutation at position −358 in the 500-bp enhancer, which targets a high-specificity position in a Cic-binding site and had among the largest predicted effects of any single mutation, was chosen for further study.…”
Section: Methodsmentioning
confidence: 99%
See 1 more Smart Citation