Mining chemical-induced disease relations embedded in the vast biomedical literature could facilitate a wide range of computational biomedical applications, such as pharmacovigilance. The BioCreative V organized a Chemical Disease Relation (CDR) Track regarding chemical-induced disease relation extraction from biomedical literature in 2015. We participated in all subtasks of this challenge. In this article, we present our participation system Chemical Disease Relation Extraction SysTem (CD-REST), an end-to-end system for extracting chemical-induced disease relations in biomedical literature. CD-REST consists of two main components: (1) a chemical and disease named entity recognition and normalization module, which employs the Conditional Random Fields algorithm for entity recognition and a Vector Space Model-based approach for normalization; and (2) a relation extraction module that classifies both sentence-level and document-level candidate drug–disease pairs by support vector machines. Our system achieved the best performance on the chemical-induced disease relation extraction subtask in the BioCreative V CDR Track, demonstrating the effectiveness of our proposed machine learning-based approaches for automatic extraction of chemical-induced disease relations in biomedical literature. The CD-REST system provides web services using HTTP POST request. The web services can be accessed from http://clinicalnlptool.com/cdr. The online CD-REST demonstration system is available at http://clinicalnlptool.com/cdr/cdr.html.Database URL: http://clinicalnlptool.com/cdr; http://clinicalnlptool.com/cdr/cdr.html
Highlights► Evolutionary and structural analyses of all SFTSV gene sequences plus a new isolate. ► Discovery of evidence for homologous recombination in SFTSV. ► Reconstruction of SFTSV epidemic history; extant lineages originated 50–150 years ago. ► Structural conservation between SFTSV and RVFV nucleocapsid residues.
BackgroundSevere fever with thrombocytopenia syndrome (SFTS) is an emerging infectious disease discovered in rural areas of Central China in 2009, caused by a novel bunyavirus, SFTS virus (SFTSV). The disease usually presents as fever, thrombocytopenia, and leukocytopenia, with case-fatality rates ranging from 2.5% to 30%. Haemaphysalis longicornis was suspected to be the most likely vector of SFTSV. By the end of 2012, the disease had expanded to 13 provinces of China. SFTS patients have been reported in Japan and South Korea, and a disease similar to SFTS has been reported in the United States.Methodology/Principal FindingsWe characterized the epidemiologic features of 504 confirmed SFTS cases in Xinyang Region, the most severely SFTS-afflicted region in China from 2011 to 2012, and assessed the environmental risk factors. All cases occurred during March to November, with the epidemic peaking from May to July. The patients' ages ranged from 7 to 87 years (median 61 years), and the annual incidence increased with age (χ2 test for trend, P<0.001). The female-to-male ratio of cases was 1.58, and 97.0% of the cases were farmers who resided in the southern and western parts of the region. The Poisson regression analysis revealed that the spatial variations of SFTS incidence were significantly associated with the shrub, forest, and rain-fed cropland areas.ConclusionsThe distribution of SFTS showed highly significant temporal and spatial heterogeneity in Xinyang Region, with the majority of SFTS cases being elderly farmers who resided in the southern and western parts of the region, mostly acquiring infection between May and July when H. longicornis is highly active. The shrub, rain-fed, and rain-fed cropland areas were associated with high risk for this disease.
BackgroundSuicide has been one of the leading causes of deaths in the United States. One major cause of suicide is psychiatric stressors. The detection of psychiatric stressors in an at risk population will facilitate the early prevention of suicidal behaviors and suicide. In recent years, the widespread popularity and real-time information sharing flow of social media allow potential early intervention in a large-scale population. However, few automated approaches have been proposed to extract psychiatric stressors from Twitter. The goal of this study was to investigate techniques for recognizing suicide related psychiatric stressors from Twitter using deep learning based methods and transfer learning strategy which leverages an existing annotation dataset from clinical text.MethodsFirst, a dataset of suicide-related tweets was collected from Twitter streaming data with a multiple-step pipeline including keyword-based retrieving, filtering and further refining using an automated binary classifier. Specifically, a convolutional neural networks (CNN) based algorithm was used to build the binary classifier. Next, psychiatric stressors were annotated in the suicide-related tweets. The stressor recognition problem is conceptualized as a typical named entity recognition (NER) task and tackled using recurrent neural networks (RNN) based methods. Moreover, to reduce the annotation cost and improve the performance, transfer learning strategy was adopted by leveraging existing annotation from clinical text.Results & conclusionsTo our best knowledge, this is the first effort to extract psychiatric stressors from Twitter data using deep learning based approaches. Comparison to traditional machine learning algorithms shows the superiority of deep learning based approaches. CNN is leading the performance at identifying suicide-related tweets with a precision of 78% and an F-1 measure of 83%, outperforming Support Vector Machine (SVM), Extra Trees (ET), etc. RNN based psychiatric stressors recognition obtains the best F-1 measure of 53.25% by exact match and 67.94% by inexact match, outperforming Conditional Random Fields (CRF). Moreover, transfer learning from clinical notes for the Twitter corpus outperforms the training with Twitter corpus only with an F-1 measure of 54.9% by exact match. The results indicate the advantages of deep learning based methods for the automated stressors recognition from social media.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.