State-of-the-art machine reading methods extract, in hours, hundreds of thousands of events from the biomedical literature. However, many of the extracted biomolecular interactions are incorrect or not relevant for computational modeling of a system of interest. Therefore, rapid, automated methods are required to filter and select accurate and useful information. The FiLter for Understanding True Events (FLUTE) tool uses public protein interaction databases to filter interactions that have been extracted by machines from databases such as PubMed and score them for accuracy. Confidence in the interactions allows for rapid and accurate model assembly. As our results show, FLUTE can reliably determine the confidence in the biomolecular interactions extracted by fast machine readers and at the same time provide a speedup in interaction filtering by three orders of magnitude. Database URL: https://bitbucket.org/biodesignlab/flute.
The large amount of knowledge contained in the scientific literature can be mined using natural language processing and utilized to automatically construct models of complex networks in order to obtain a greater understanding of complex systems. In this paper, we describe the Dynamic System Explanation (DySE) framework, which configures hybrid models and executes simulations over time, relying on a granular computing approach and a range of different element update functions. A standardized tabular format assembles the collected knowledge into networks for parameterization. The Discrete Stochastic Heterogeneous (DiSH) simulator outputs trajectories of state changes for all model elements, thus providing a means for running thousands of in silico scenarios in seconds. Trajectories are also analyzed using statistical model checking to verify against known or desired system properties, determined from text or numerical data. DySE can automatically extend models when additional knowledge is available, and model extension is integrated with model checking to test the validity of additional interactions and dynamics, and
The amount of biomedical literature has vastly increased over the past few decades. As a result, the sheer quantity of accessible information is overwhelming, and complicates manual information retrieval. Automated methods seek to speed up information retrieval from biomedical literature. However, such automated methods are still too time-intensive to survey all existing biomedical literature. We present a methodology for automatically generating literature queries that select relevant papers based on biological data. By using differentially expressed genes to inform our literature searches, we focus information extraction on mechanistic signaling details that are crucial for the disease or context of interest.
Signaling network models are usually assembled from information in literature and expert knowledge or inferred from data. The goal of modeling is to gain mechanistic understanding of key signaling pathways and provide predictions on how perturbations affect large-scale processes such as disease progression. For glioblastoma multiforme (GBM), this task is critical, given the lack of effective treatments and pace of disease progression. Both manual and automated assembly of signaling networks from data or literature have drawbacks. Existing GBM networks, as well as networks assembled using state-of-the-art machine reading, fall short when judged by the quality and quantity of information, as well as certain attributes of the overall network structure. The contributions of this work are two-fold. First, we propose an automated methodology for verification of signaling networks. Next, we discuss automation of network assembly and extension that relies on methods and resources used for network verification, thus, implicitly including verification in these processes. In addition to these methods, we also present, and verify a comprehensive GBM network assembled with a hybrid of manual and automated methods. Finally, we demonstrate that, while an automated network assembly is fast, such networks still lack precision and realistic network topology.
Glioblastomas and glioblastoma stem cells are heterogeneous with respect to mutations, gene expression, and response to drugs. To make predictive responses of individual GBM stem cell lines to drugs, we have constructed a causal model of glioblastoma stem cell signaling. The core model was built starting from pathways identified from TCGA mutation data with the addition of the Jak/STAT, Hedgehog, and Notch pathways. Elements and relations between them were validated and extended using the PCNet interaction database and the INDRA database which includes machine read extractions from the biomedical literature. The result is a high confidence executable model consisting of 209 element and 370 rules of interaction between the elements. Stochastic simulations of the model provide dynamic (quantile) changes in time and responses to perturbations. The output provides activity of individual nodes as well as a cellular output state of cell cycle progression, apoptosis, or differentiation. To simulate the responses of individual cell lines to kinase inhibitors, the model was initialized using DNA sequencing data, RNA-seq, and reverse phase protein array (RPPA) data from each cell line. Comparing the results of the simulations to the drug responses of 11 different kinase targets, the model was 88% accurate in predicting effects on growth and survival. The model was further tested by comparing the effects of Mek inhibition of each of the cell lines in model to the results observed in the RPPA data which overlap by 127 elements. In this case, there was 62% concordance between the model and data when binned into quintiles. Discrepancies between the model predictions and the data are being investigated to determine whether the model logic or extent needs to be revised to improve the model. This modeling approach is a step toward developing algorithms for personalized therapeutics for GBM based on multi-omics data.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.