2016
DOI: 10.1162/tacl_a_00104
|View full text |Cite
|
Sign up to set email alerts
|

Named Entity Recognition with Bidirectional LSTM-CNNs

Abstract: Named entity recognition is a challenging task that has traditionally required large amounts of knowledge in the form of feature engineering and lexicons to achieve high performance.In this paper, we present a novel neural network architecture that automatically detects word-and character-level features using a hybrid bidirectional LSTM and CNN architecture, eliminating the need for most feature engineering. We also propose a novel method of encoding partial lexicon matches in neural networks and compare it to… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

6
712
1
13

Year Published

2017
2017
2021
2021

Publication Types

Select...
6

Relationship

0
6

Authors

Journals

citations
Cited by 1,478 publications
(732 citation statements)
references
References 23 publications
6
712
1
13
Order By: Relevance
“…Similar to [18, 19], we introduce distributed representations for features and extend BI-LSTM by adding a hidden layer after the LSTM layer, which concatenates the word representations generated by LSTM layer and the feature representations, as shown in Figure 3. The features used in BI-LSTM-FEA are: sentence information, section information, general NER information and dictionary features, which are the same as features mentioned in section “CRF-based Method”.…”
Section: Methodsmentioning
confidence: 99%
See 4 more Smart Citations
“…Similar to [18, 19], we introduce distributed representations for features and extend BI-LSTM by adding a hidden layer after the LSTM layer, which concatenates the word representations generated by LSTM layer and the feature representations, as shown in Figure 3. The features used in BI-LSTM-FEA are: sentence information, section information, general NER information and dictionary features, which are the same as features mentioned in section “CRF-based Method”.…”
Section: Methodsmentioning
confidence: 99%
“…In our system, an ensemble classifier is deployed to combine the outputs of three individual machine learning-based subsystems, and a rule-based subsystem is used to identify some formulaic PHI instances. The three machine learning-based subsystems are a CRF-based system with a large number of hand-crafted features [12], a bidirectional LSTM-based system without any hand-crafted features [16, 17], and a variant of bidirectional LSTM-based system with a small quantity of hand-crafted features [18, 19]. Moreover, we also evaluate our system on the 2014 i2b2 challenge corpus and compare it with other state-of-the-art systems.…”
Section: Introductionmentioning
confidence: 99%
See 3 more Smart Citations