Recent advances in single-cell sequencing technologies have provided unprecedented opportunities to measure the gene expression profile and RNA velocity of individual cells. However, modeling transcriptional dynamics is computationally challenging because of the high-dimensional, sparse nature of the single-cell gene expression measurements and the nonlinear regulatory relationships. Here, we present
DeepVelo
, a neural network–based ordinary differential equation that can model complex transcriptome dynamics by describing continuous-time gene expression changes within individual cells. We apply
DeepVelo
to public datasets from different sequencing platforms to (i) formulate transcriptome dynamics on different time scales, (ii) measure the instability of cell states, and (iii) identify developmental driver genes via perturbation analysis. Benchmarking against the state-of-the-art methods shows that
DeepVelo
can learn a more accurate representation of the velocity field. Furthermore, our perturbation studies reveal that single-cell dynamical systems could exhibit chaotic properties. In summary,
DeepVelo
allows data-driven discoveries of differential equations that delineate single-cell transcriptome dynamics.
A common challenge in neuroscience is how to decompose noisy, multi-source signals measured in experiments into biophysically interpretable components. Analysis of cortical surface electrical potentials (CSEPs) measured using electrocorticography arrays (ECoG) typifies this problem. We hypothesized that high frequency (70-1,000 Hz) CSEPs are composed of broadband (i.e., power-law) and bandlimited components with potentially differing biophysical origins. In particular, the high-gamma band (70-150 Hz) has been shown to be highly predictive for encoding and decoding behaviors and stimuli. Despite its demonstrated importance, whether high-gamma is composed of a bandlimited signal is poorly understood. To address this gap, we recorded CSEPs from rat auditory cortex and demonstrate that the evoked CSEPs are composed of multiple distinct frequency components, including high-gamma. We then show, using a novel robust regression method, that at fast timescales and on single trials during speech production, human high-gamma amplitude cannot be explained by a modulating power-law component; thus, high-gamma is band-limited. Furthermore, we show that the power-law component is less predictive of produced speech compared to the raw high-gamma amplitude. Finally, we show that the largest variance component of human ECoG signals is low-frequency and band-limited, not broadband. Together these results demonstrate that there are multiple, band-limited components of high frequency power in cortical surface electrical potentials, including the high-gamma band, which may have different biophysical origins.
Mapping chromatin insulator loops is crucial to investigating genome evolution, elucidating critical biological functions, and ultimately quantifying variant impact in diseases. However, chromatin conformation profiling assays are usually expensive, time-consuming, and may report fuzzy insulator annotations with low resolution. Therefore, we propose a weakly supervised deep learning method, InsuLock, to address these challenges. Specifically, InsuLock first utilizes a Siamese neural network to predict the existence of insulators within a given region (up to 2000 bp). Then, it uses an object detection module for precise insulator boundary localization via gradient-weighted class activation mapping (~40 bp resolution). Finally, it quantifies variant impacts by comparing the insulator score differences between the wild-type and mutant alleles. We applied InsuLock on various bulk and single-cell datasets for performance testing and benchmarking. We showed that it outperformed existing methods with an AUROC of ~0.96 and condensed insulator annotations to ~2.5% of their original size while still demonstrating higher conservation scores and better motif enrichments. Finally, we utilized InsuLock to make cell-type-specific variant impacts from brain scATAC-seq data and identified a schizophrenia GWAS variant disrupting an insulator loop proximal to a known risk gene, indicating a possible new mechanism of action for the disease.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.