SignificanceWe demonstrate a high-accuracy method for predicting enhancers genome-wide with >85% precision as validated by transgenic reporter assays in Drosophila embryos. This accuracy in a metazoan system enables us to predict with high confidence 1,640 enhancers genome-wide that participate in body segmentation during early development. The predicted enhancers are demarcated by heterogeneous collections of epigenetic marks; many strong enhancers are free from classic indicators of activity, including H3K27ac, but are bound by key transcription factors.
Inorganic fullerene-like WS 2 and MoS 2 nanoparticles have been synthesized using exclusively solid precursors, by reaction of the corresponding metal oxide nanopowder, sulfur and a hydrogen-releasing agent (NaBH 4 or LiAlH 4 ), achieved either by conventional furnace heating up to ~900 °C or by photothermal ablation at far higher temperatures driven by highly concentrated white light. In contrast to the established syntheses that require toxic and hazardous gases, working solely with solid precursors permits relatively safer reactor conditions conducive to industrial scale-up.
Identifying functional enhancers elements in metazoan systems is a major challenge.For example, large-scale validation of enhancers predicted by ENCODE reveal false positive rates of at least 70%. Here we use the pregrastrula patterning network of Drosophila melanogaster to demonstrate that loss in accuracy in held out data results from heterogeneity of functional signatures in enhancer elements. We show that two classes of enhancer are active during early Drosophila embryogenesis and that by focusing on a single, relatively homogeneous class of elements, over 98% prediction accuracy can be achieved in a balanced, completely held-out test set. The class of well predicted elements is composed predominantly of enhancers driving multi-stage, segmentation patterns, which we designate segmentation driving enhancers (SDE). Prediction is driven by the DNA occupancy of early developmental transcription factors, with almost no additional power derived from histone modifications. We further show that improved accuracy is not a property of a particular prediction method: after conditioning on the SDE set, naïve Bayes and logistic regression perform as well as more sophisticated tools. Applying this method to a genome-wide scan, we predict 1,640 SDEs that cover 1.6% of the genome, 916 of which are novel. An analysis of 32 novel SDEs using wholemount embryonic imaging of stably integrated reporter constructs chosen throughout our prediction rank-list showed >90% drove expression patterns. We achieved 86.7% precision on a genome-wide scan, with an estimated recall of at least 98%, indicating high accuracy and completeness in annotating this class of functional elements.. CC-BY-ND 4.0 International license It is made available under a (which was not peer-reviewed) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity.The copyright holder for this preprint . http://dx.doi.org/10.1101/250241 doi: bioRxiv preprint first posted online Jan. 18, 2018; 3 Significance StatementWe demonstrate a high accuracy method for predicting enhancers genome wide with > 85% precision as validated by transgenic reporter assays in Drosophila embryos. This is the first time such accuracy has been achieved in a metazoan system, allowing us to predict with highconfidence 1640 enhancers, 916 of which are novel. The predicted enhancers are demarcated by heterogeneous collections of epigenetic marks; many strong enhancers are free from classical indicators of activity, including H3K27ac, but are bound by key transcription factors.H3K27ac, often used as a one-dimensional predictor of enhancer activity, is an uninformative parameter in our data.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.