Named entity recognition is a challenging task that has traditionally required large amounts of knowledge in the form of feature engineering and lexicons to achieve high performance.In this paper, we present a novel neural network architecture that automatically detects word-and character-level features using a hybrid bidirectional LSTM and CNN architecture, eliminating the need for most feature engineering. We also propose a novel method of encoding partial lexicon matches in neural networks and compare it to existing approaches. Extensive evaluation shows that, given only tokenized text and publicly available word embeddings, our system is competitive on the CoNLL-2003 dataset and surpasses the previously reported state of the art performance on the OntoNotes 5.0 dataset by 2.13 F1 points. By using two lexicons constructed from publicly-available sources, we establish new state of the art performance with an F1 score of 91.62 on CoNLL-2003 and 86.28 on OntoNotes, surpassing systems that employ heavy feature engineering, proprietary lexicons, and rich entity linking information.
This shared task focuses on identifying unusual, previously-unseen entities in the context of emerging discussions. Named entities form the basis of many modern approaches to other tasks (like event clustering and summarization), but recall on them is a real problem in noisy text -even among annotators. This drop tends to be due to novel entities and surface forms. Take for example the tweet "so.. kktny in 30 mins?!" -even human experts find the entity kktny hard to detect and resolve. The goal of this task is to provide a definition of emerging and of rare entities, and based on that, also datasets for detecting these entities. The task as described in this paper evaluated the ability of participating entries to detect and classify novel and emerging named entities in noisy text.
Promiscuous plasmids replicate in a wide range of bacteria and therefore play a key role in the dissemination of various host-beneficial traits, including antibiotic resistance. Despite the medical relevance, little is known about the evolutionary dynamics through which drug resistance plasmids adapt to new hosts and thereby persist in the absence of antibiotics. We previously showed that the incompatibility group P-1 (IncP-1) minireplicon pMS0506 drastically improved its stability in novel host Shewanella oneidensis MR-1 after 1,000 generations under antibiotic selection for the plasmid. The only mutations found were those affecting the N terminus of the plasmid replication initiation protein TrfA1. Our aim in this study was to gain insight into the dynamics of plasmid evolution. Changes in stability and genotype frequencies of pMS0506 were monitored in evolving populations of MR-1 (pMS0506). Genotypes were determined by sequencing trfA1 amplicons from individual clones and by 454 pyrosequencing of whole plasmids from entire populations. Stability of pMS0506 drastically improved by generation 200. Many evolved plasmid genotypes with point mutations as well as in-frame and frameshift deletions and duplications in trfA1 were observed in all lineages with both sequencing methods. Strikingly, multiple genotypes were simultaneously present at high frequencies (>10%) in each population. Their relative abundances changed over time, but after 1,000 generations only one or two genotypes dominated the populations. This suggests that hosts with different plasmid genotypes were competing with each other, thus affecting the evolutionary trajectory. Plasmids can thus rapidly improve their stability, and clonal interference plays a significant role in plasmid-host adaptation dynamics.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.