Key challenges for human genetics, precision medicine and evolutionary biology include deciphering the regulatory code of gene expression and understanding the transcriptional effects of genome variation. However, this is extremely difficult because of the enormous scale of the noncoding mutation space. We developed a deep learning-based framework, ExPecto, that can accurately predict, ab initio from a DNA sequence, the tissue-specific transcriptional effects of mutations, including those that are rare or that have not been observed. We prioritized causal variants within disease- or trait-associated loci from all publicly available genome-wide association studies and experimentally validated predictions for four immune-related diseases. By exploiting the scalability of ExPecto, we characterized the regulatory mutation space for human RNA polymerase II-transcribed genes by in silico saturation mutagenesis and profiled > 140 million promoter-proximal mutations. This enables probing of evolutionary constraints on gene expression and ab initio prediction of mutation disease effects, making ExPecto an end-to-end computational framework for the in silico prediction of expression and disease risk.
We address the challenge of detecting the contribution of noncoding mutations to disease with a deep-learning-based framework that predicts specific regulatory effects and the deleterious impact of genetic variants. Applying this framework to 1,790 Autism Spectrum Disorder (ASD) simplex families reveals disease causality of noncoding mutations: ASD probands harbor both transcriptional and post-transcriptional regulation-disrupting de novo mutations of significantly higher functional impact than unaffected siblings. Further analysis suggests involvement of noncoding mutations in synaptic transmission and neuronal development, and taken together with prior studies reveal a convergent genetic landscape of coding and noncoding mutations in ASD. We demonstrate that sequences carrying prioritized proband mutations possess allele-specific regulatory activity, and highlight a link between noncoding mutations and IQ heterogeneity in ASD probands. Our predictive genomics framework illuminates the role of noncoding mutations in ASD, prioritizes high impact mutations for further study, and is broadly applicable to complex human diseases.
This paper evaluates the performance of tools for the extraction of metadata from scientific articles. Accurate metadata extraction is an important task for automating the management of digital libraries. This comparative study is a guide for developers looking to integrate the most suitable and effective metadata extraction tool into their software. We shed light on the strengths and weaknesses of seven tools in common use. In our evaluation using papers from the arXiv collection, GROBID delivered the best results, followed by Mendeley Desktop. SciPlore Xtract, PDFMeat, and SVMHeaderParse also delivered good results depending on the metadata type to be extracted.
We report that heating chemical vapor deposition grown monolayer MoS2 in air at temperatures as low as 285 °C for 2 h results in rapid degradation of the monolayer within 2.5 weeks of ambient air exposure after heating. We find that the rapid degradation proceeds via the growth of dendrites on the basal plane that have a fractal dimension close to that of diffusion-limited aggregation. We also observe dendrites in unheated samples that have been in ambient air for a year. We explain the rapid degradation after heating to an increase in MoO3. We propose that the mechanism for dendrite growth involves the diffusion of H2O to oxide sites. This results in the liquefication of the oxides. The liquefied oxides do not protect the surface from further oxidation. Putting heated samples in a dry box for 2 weeks immediately after heating prevents the rapid degradation from occurring.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.