Enhancer architecture and chromatin accessibility constrain phenotypic space during development

Galupa, Rafael; Alvarez-Canales, Gilberto; Borst, Noa Ottilie; Fuqua, Timothy; Gándara, Lautaro; Misunou, Natalia; Richter, Kerstin; Alves, Mariana R. P.; Karumbi, Esther; Perkins, Melinda Liu; Kocijan, Tin; Rushlow, Christine; Crocker, Justin

doi:10.1101/2022.06.02.494376

Cited by 7 publications

(5 citation statements)

References 107 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Nonetheless, testing random sequences within the context of developing embryos could provide another source of data to understand how enhancers encode tissue-specific expression. 90 In the future, integration of genomic regions, synthetic designed, and random sequences will contribute to our understanding of enhancer grammar. Despite the complexity of studying enhancers in developing embryos, our study demonstrates that enhancer grammar is critical for encoding notochord activity and our observation of the same logics and grammar signatures in both Ciona and vertebrates hints at conservation of these grammatical constraints across chordates.…”

Section: Discussionmentioning

confidence: 99%

Diverse logics and grammar encode notochord enhancers

Song¹,

Ragsac²,

Tellez³

et al. 2023

Cell Reports

View full text Add to dashboard Cite

Section: Discussionmentioning

confidence: 99%

Diverse logics and grammar encode notochord enhancers

Song¹,

Ragsac²,

Tellez³

et al. 2023

Cell Reports

View full text Add to dashboard Cite

“…Testing random sequences has proven to be an effective strategy for isolating functional nucleic acids 104 and proteins 105 , learning 5'UTR 106,107 , splicing 108,109 , and polyadenylation logic 110 . Importantly, random sequences have developmental enhancer activity 111 , and we recently showed that random DNA provides ideal data for deciphering cis-regulation 14,25,31 . For instance, our (de Boer) recent proof-of-principle study in yeast showed that random DNA had diverse cis-regulatory activity, and this enabled us to design an experiment for simultaneous quantification of the expression levels encoded by each of >30 million random 80 bp sequences (~2.4Gb of regulatory DNA; 184 yeast genome equivalents) 25 .…”

Section: Synthetic Dna As a Potential Solutionmentioning

confidence: 99%

“…While learning cis-regulation from measurements of random DNA activity may seem counter-intuitive, the reason why this works is simple: there are many TFs (~1,639 in humans 2 ), and their binding sites occur frequently by chance 25,111,112 , enabling one to select for binding sites for individual TFs or pairs of TFs from random DNA 37,113,114 . While a random sequence having a binding site for a specific TF is unlikely, it is almost certain to have binding sites for some TF, leading to high expression diversity among random sequences 25 , and distinct expression programs across cell types 111,115 . This propensity for random sequences to frequently be active is also consistent with the soft regulatory syntax of the billboard model 23 .…”

Section: Synthetic Dna As a Potential Solutionmentioning

confidence: 99%

Hold out the genome: A roadmap to solving the cis-regulatory code

Boer

Taipale

2023

Preprint

View full text Add to dashboard Cite

Gene expression is regulated by transcription factors that work together to read cis-regulatory DNA sequences. The “cis-regulatory code” - the rules that cells use to determine when, where, and how much genes should be expressed - has proven to be exceedingly complex, but recent advances in the scale and resolution of functional genomics assays and Machine Learning have enabled significant progress towards deciphering this code. However, we will likely never solve the cis-regulatory code if we restrict ourselves to models trained only on genomic sequences; regions of homology can easily lead to overestimation of predictive performance, and there is insufficient sequence diversity in our genomes to learn all relevant parameters. Fortunately, randomly synthesized DNA sequences enable us to test a far larger sequence space than exists in our genomes in each experiment, and designed DNA sequences enable a targeted query of the sequence space to maximally improve the models. Since cells use the same biochemical principles to interpret DNA regardless of its source, models that are trained on these synthetic data can predict genomic activity, often better than genome-trained models. Here, we provide an outlook on the field, and propose a roadmap towards solving the cis-regulatory code by training models exclusively on non-genomic DNA sequences, and using genomic sequences solely for evaluating the resulting models.

show abstract

“…Drosophila embryos were fixed and stained following standard protocols 73 . In particular, stage-5 embryos were acquired from a 5-hr egg-laying window at room temperature.…”

Section: Embryo Fixation Antibody Staining and Fluorescent In Situ Hy...mentioning

confidence: 99%

Rapid response of fly populations to gene dosage across development and generations

Li¹,

Gándara²,

Ekelof³

et al. 2022

Preprint

Self Cite

View full text Add to dashboard Cite

It remains unknown how developmental systems evolve in response to variable genetic and environmental conditions. Here, we have examined the evolvability of the classic bicoid network in Drosophila, which is essential for anterior-posterior patterning in the early embryo. This network can be synthetically perturbed by increasing the dosage of bicoid, which causes a posterior shift of the network's regulatory outputs and a decrease in fitness. To directly monitor network evolution across populations with extra copies of bicoid, we performed genome-wide EMS mutagenesis, followed by experimental evolution. After only 8-15 generations, evolved populations have normalized patterns of gene expression and increased survival. Using a phenomics approach, we find that populations normalized through rapid increases in embryo size driven by maternal changes in metabolism and ovariole development. We extend our results to wild populations of flies, demonstrating strong predictability. Together, our results necessitate a broader view of regulatory network evolution at the systems level. This study highlights the power of synthetic evolution using animal systems, a generalizable platform for the dissection of gene regulation and complex genomes.

show abstract

Enhancer architecture and chromatin accessibility constrain phenotypic space during development

Cited by 7 publications

References 107 publications

Diverse logics and grammar encode notochord enhancers

Diverse logics and grammar encode notochord enhancers

Hold out the genome: A roadmap to solving the cis-regulatory code

Rapid response of fly populations to gene dosage across development and generations

Contact Info

Product

Resources

About