6Identifying the molecular mechanisms that control differential gene expression (DE) is a major goal of basic and disease 7 biology. Combining the strengths of systems biology and deep learning in a model called DEcode, we are able to predict 8 DE more accurately than traditional sequence-based methods, which do not utilize systems biology data. To determine 9 the biological origins of this accuracy, we identify the most predictive regulators and types of regulatory interactions in 10 DEcode, contrasting their roles across many human tissues. Diverse systems biology, ontological and disease-related 11 assessments all point to the predominant influence of post-translational RNA-binding factors on DE. Through the 12 combinatorial gene regulation that is captured in DEcode, it is even possible to predict relatively subtle person-to-person 13 variation in gene expression. We demonstrate the broad applicability of these clinically-relevant predictions by predicting 14 drivers of aging throughout the human lifespan, gene coexpression relationships on a genome-wide scale, and frequent 15 DE in diverse conditions. Researchers can freely access DEcode to utilize genomic big data in identifying influential 16 molecular mechanisms for any human expression data -www.differentialexpression.org. 17 18 While all human cells share DNA sequences, gene regulation differs among cell types and developmental stages, and in 19 response to environmental cues and stimuli. Accordingly, when gene expression is not properly regulated, cellular 20 homeostasis can be perturbed, often affecting cell function and leading to disease 1 . These distinctions between cell 21 states are observed as differential expression (DE) of gene transcripts. DE have been cataloged for tens of thousands of 22 gene expression datasets, in the context of distinctions between species, organs, and conditions. Despite the important 23These particularly challenging predictions are validated on a genome-wide scale, as we identify key drivers of 49 coexpression, and also drivers for phenotype-associated differential expression. These tests and applications indicate 50 DEcode can combine multiple recent data sources, to extract regulators for arbitrary human DE signatures. 51 3 Results 52Promoter and RNA features predict differential expression across human tissues 53The overarching goal of this study is to accurately predict gene expression as a function of molecular interactions. These 54 results should be tissue-specific, but also highlight major regulatory principles across tissues, and ideally have sufficient 55 accuracy to predict the relatively small expression changes observed between individual humans. To accomplish this, we 56 utilized deep convolutional neural networks in a system called DEcode that can predict inter-tissue variations and inter-57 person variations in gene expression levels from promoter and mRNA features (Figure 1). The promoter features 58 included: the genomic locations of binding sites of 762 TFs and the mRNA features encompassed the loca...
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2025 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.