Recent advances in deep neural models allow us to build reliable named entity recognition (NER) systems without handcrafting features. However, such methods require large amounts of manually-labeled training data. There have been efforts on replacing human annotations with distant supervision (in conjunction with external dictionaries), but the generated noisy labels pose significant challenges on learning effective neural models. Here we propose two neural models to suit noisy distant supervision from the dictionary. First, under the traditional sequence labeling framework, we propose a revised fuzzy CRF layer to handle tokens with multiple possible labels. After identifying the nature of noisy labels in distant supervision, we go beyond the traditional framework and propose a novel, more effective neural model AutoNER with a new Tie or Break scheme. In addition, we discuss how to refine distant supervision for better NER performance. Extensive experiments on three benchmark datasets demonstrate that AutoNER achieves the best performance when only using dictionaries with no additional human effort, and delivers competitive results with state-of-the-art supervised benchmarks.
Neuronal ensembles that hold specific memory (memory engrams) have been identified in the hippocampus, amygdala, or cortex. However, it has been hypothesized that engrams of a specific memory are distributed among multiple brain regions that are functionally connected, referred to as a unified engram complex. Here, we report a partial map of the engram complex for contextual fear conditioning memory by characterizing encoding activated neuronal ensembles in 247 regions using tissue phenotyping in mice. The mapping was aided by an engram index, which identified 117 cFos+ brain regions holding engrams with high probability, and brain-wide reactivation of these neuronal ensembles by recall. Optogenetic manipulation experiments revealed engram ensembles, many of which were functionally connected to hippocampal or amygdala engrams. Simultaneous chemogenetic reactivation of multiple engram ensembles conferred a greater level of memory recall than reactivation of a single engram ensemble, reflecting the natural memory recall process. Overall, our study supports the unified engram complex hypothesis for memory storage.
We present a novel end-to-end reinforcement learning approach to automatic taxonomy induction from a set of terms. While prior methods treat the problem as a two-phase task (i.e., detecting hypernymy pairs followed by organizing these pairs into a tree-structured hierarchy), we argue that such two-phase methods may suffer from error propagation, and cannot effectively optimize metrics that capture the holistic structure of a taxonomy. In our approach, the representations of term pairs are learned using multiple sources of information and used to determine which term to select and where to place it on the taxonomy via a policy network. All components are trained in an end-to-end manner with cumulative rewards, measured by a holistic tree metric over the training taxonomies. Experiments on two public datasets of different domains show that our approach outperforms prior state-ofthe-art taxonomy induction methods up to 19.6% on ancestor F1. 1
A new feature selection method based on kernelized fuzzy rough sets (KFRS) and the memetic algorithm (MA) is proposed for transient stability assessment of power systems. Considering the possible real-time information provided by wide-area measurement systems, a group of system-level classification features are extracted from the power system operation parameters to build the original feature set. By defining a KFRS-based generalized classification function as the separability criterion, the memetic algorithm based on binary differential evolution (BDE) and Tabu search (TS) is employed to obtain the optimal feature subsets with the maximized classification capability. The proposed method may avoid the information loss caused by the feature discretization process of the rough-set based attribute selection, and comprehensively utilize the advantages of BDE and TS to improve the solution quality and search efficiency. The effectiveness of the proposed method is validated by the application results on the New England 39-bus power system and the southern power system of Hebei province.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.