Contact: arhatzig@inf.uth.gr Supplementary Fig. 1: Concordance between multimapping loci and corresponding biotypes. Proportion of multimapping reads with associated annotation of the same or different RNA biotype for each of the multimapping loci. A substantial number of reads with two and three multimapping positions associate with annotations of dissimilar small RNA types.
Cell-fate decisions during development are controlled by densely interconnected gene regulatory networks (GRNs) consisting of many genes. Inferring and predictively modeling these GRNs is crucial for understanding development and other physiological processes. Gene circuits, coupled differential equations that represent gene product synthesis with a switch-like function, provide a biologically realistic framework for modeling the time evolution of gene expression. However, their use has been limited to smaller networks due to the computational expense of inferring model parameters from gene expression data using global non-linear optimization. Here we show that the switch-like nature of gene regulation can be exploited to break the gene circuit inference problem into two simpler optimization problems that are amenable to computationally efficient supervised learning techniques. We present FIGR (Fast Inference of Gene Regulation), a novel classification-based inference approach to determining gene circuit parameters. We demonstrate FIGR’s effectiveness on synthetic data generated from random gene circuits of up to 50 genes as well as experimental data from the gap gene system of Drosophila melanogaster, a benchmark for inferring dynamical GRN models. FIGR is faster than global non-linear optimization by a factor of 600 and its computational complexity scales much better with GRN size. On a practical level, FIGR can accurately infer the biologically realistic gap gene network in under a minute on desktop-class hardware instead of requiring hours of parallel computing. We anticipate that FIGR would enable the inference of much larger biologically realistic GRNs than was possible before.
Cellular differentiation during hematopoiesis is guided by gene regulatory networks (GRNs) comprising transcription factors (TFs) and the effectors of cytokine signaling. Based largely on analyses conducted at steady state, these GRNs are thought to be organized as a hierarchy of bistable switches, with antagonism between Gata1 and PU.1 driving red- and white-blood cell differentiation. Here, we utilize transient gene expression patterns to infer the genetic architecture—the type and strength of regulatory interconnections—and dynamics of a twelve-gene GRN including key TFs and cytokine receptors. We trained gene circuits, dynamical models that learn genetic architecture, on high temporal-resolution gene-expression data from the differentiation of an inducible cell line into erythrocytes and neutrophils. The model is able to predict the consequences of gene knockout, knockdown, and overexpression experiments and the inferred interconnections are largely consistent with prior empirical evidence. The inferred genetic architecture is densely interconnected rather than hierarchical, featuring extensive cross-antagonism between genes from alternative lineages and positive feedback from cytokine receptors. The analysis of the dynamics of gene regulation in the model reveals that PU.1 is one of the last genes to be upregulated in neutrophil conditions and that the upregulation of PU.1 and other neutrophil genes is driven by Cebpa and Gfi1 instead. This model inference is confirmed in an independent single-cell RNA-Seq dataset from mouse bone marrow in which Cebpa and Gfi1 expression precedes the neutrophil-specific upregulation of PU.1 during differentiation. These results demonstrate that full PU.1 upregulation during neutrophil development involves regulatory influences extrinsic to the Gata1-PU.1 bistable switch. Furthermore, although there is extensive cross-antagonism between erythroid and neutrophil genes, it does not have a hierarchical structure. More generally, we show that the combination of high-resolution time series data and data-driven dynamical modeling can uncover the dynamics and causality of developmental events that might otherwise be obscured.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.