Joint entity recognition and relation extraction as a multi-head selection problem

Bekoulis, Giannis; Deleu, Johannes; Demeester, Thomas; Develder, Chris

doi:10.1016/j.eswa.2018.07.032

Cited by 363 publications

(221 citation statements)

References 26 publications

Supporting

Mentioning

220

Contrasting

Order By: Relevance

“…To be able to compare our results with previous works (Bekoulis et al 2018;Miwa and Bansal 2016;Katiyar and Cardie 2017) on EMD, we identify the head of the entity mention rather than the whole mention.…”

Section: Entity Mention Detection (Emd)mentioning

confidence: 99%

“…Unlike (Hashimoto et al 2017) and other previous work (Katiyar and Cardie 2017;Bekoulis et al 2018;Augenstein, Ruder, and Søgaard 2018), we do not learn label embeddings, meaning that the (supervised) output/prediction of a layer is not directly fed to the following layer through an embedding learned during training. Nonetheless, sharing embeddings and stacking hierarchical encoders allows us to share the supervision from each task along the full structure of our model and achieve state-of-the-art performance.…”

Section: Related Workmentioning

confidence: 99%

See 1 more Smart Citation

A Hierarchical Multi-Task Approach for Learning Embeddings from Semantic Tasks

Sanh

Wolf

Ruder

2019

AAAI

186

170

View full text Add to dashboard Cite

Much effort has been devoted to evaluate whether multi-task learning can be leveraged to learn rich representations that can be used in various Natural Language Processing (NLP) down-stream applications. However, there is still a lack of understanding of the settings in which multi-task learning has a significant effect. In this work, we introduce a hierarchical model trained in a multi-task learning setup on a set of carefully selected semantic tasks. The model is trained in a hierarchical fashion to introduce an inductive bias by supervising a set of low level tasks at the bottom layers of the model and more complex tasks at the top layers of the model. This model achieves state-of-the-art results on a number of tasks, namely Named Entity Recognition, Entity Mention Detection and Relation Extraction without hand-engineered features or external NLP tools like syntactic parsers. The hierarchical training supervision induces a set of shared semantic representations at lower layers of the model. We show that as we move from the bottom to the top layers of the model, the hidden states of the layers tend to represent more complex semantic information.

show abstract

Section: Entity Mention Detection (Emd)mentioning

confidence: 99%

Section: Related Workmentioning

confidence: 99%

A Hierarchical Multi-Task Approach for Learning Embeddings from Semantic Tasks

Sanh

Wolf

Ruder

2019

AAAI

186

170

View full text Add to dashboard Cite

show abstract

“…End-to-end results: The first six rows in Table 1 compare our results with previous state-of-the-art published results on the same test set. In particular, our model obtains 2+% absolute higher NER and RC scores (Setup 1) than the BiLSTM-CRF-based multihead selection model [5]. We also obtain 7+% higher EC and RC scores (Setup 2) than Adel and Schütze (2017) [1].…”

Section: Resultsmentioning

confidence: 69%

“…Dataset: We use the benchmark "entity and relation recognition" dataset CoNLL04 from [27]. Following [4,5], we use the 64%/16%/20% training/development/test presplit available from Adel and Schütze (2017) [1], in which the test set was previously also used by Gupta et al (2016) [9]. Implementation: Our model is implemented using DYNET v2.0 [21].…”

Section: Methodsmentioning

confidence: 99%

End-to-End Neural Relation Extraction Using Deep Biaffine Attention

Nguyen

Verspoor

2019

Lecture Notes in Computer Science

View full text Add to dashboard Cite

We propose a neural network model for joint extraction of named entities and relations between them, without any hand-crafted features. The key contribution of our model is to extend a BiLSTM-CRF-based entity recognition model with a deep biaffine attention layer to model second-order interactions between latent features for relation classification, specifically attending to the role of an entity in a directional relationship. On the benchmark "relation and entity recognition" dataset CoNLL04, experimental results show that our model outperforms previous models, producing new state-of-the-art performances.

show abstract

“…We model the relation extraction task as a multi-label head selection problem (Bekoulis et al, 2018b;. In our model, each word w i can be involved in multiple relations with other words.…”

Section: Joint Learning As Head Selectionmentioning

confidence: 99%

Adversarial training for multi-context joint entity and relation extraction

Bekoulis¹,

Deleu²,

Demeester³

et al. 2018

Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing

Self Cite

163

View full text Add to dashboard Cite

Adversarial training (AT) is a regularization method that can be used to improve the robustness of neural network methods by adding small perturbations in the training data. We show how to use AT for the tasks of entity recognition and relation extraction. In particular, we demonstrate that applying AT to a general purpose baseline model for jointly extracting entities and relations, allows improving the state-of-the-art effectiveness on several datasets in different contexts (i.e., news, biomedical, and real estate data) and for different languages (English and Dutch).

show abstract

Joint entity recognition and relation extraction as a multi-head selection problem

Cited by 363 publications

References 26 publications

A Hierarchical Multi-Task Approach for Learning Embeddings from Semantic Tasks

A Hierarchical Multi-Task Approach for Learning Embeddings from Semantic Tasks

End-to-End Neural Relation Extraction Using Deep Biaffine Attention

Adversarial training for multi-context joint entity and relation extraction

Contact Info

Product

Resources

About