2022
DOI: 10.48550/arxiv.2203.00545
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

DAMO-NLP at SemEval-2022 Task 11: A Knowledge-based System for Multilingual Named Entity Recognition

Abstract: The MultiCoNER shared task aims at detecting semantically ambiguous and complex named entities in short and low-context settings for multiple languages. The lack of contexts makes the recognition of ambiguous named entities challenging. To alleviate this issue, our team DAMO-NLP proposes a knowledge-based system, where we build a multilingual knowledge base based on Wikipedia to provide related context information to the named entity recognition (NER) model. Given an input sentence, our system effectively retr… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
4
0

Year Published

2022
2022
2023
2023

Publication Types

Select...
2
1
1

Relationship

0
4

Authors

Journals

citations
Cited by 4 publications
(4 citation statements)
references
References 17 publications
0
4
0
Order By: Relevance
“…In the future, we would like to experiment with other model architectures for more complex NER tasks such as those presented at SemEval-2022 (Malmasi et al, 2022), of particular interest is the work from Wang et al (2022). We would like to include more native Quechua speaking annotators in order to improve the data set even more.…”
Section: Discussionmentioning
confidence: 99%
“…In the future, we would like to experiment with other model architectures for more complex NER tasks such as those presented at SemEval-2022 (Malmasi et al, 2022), of particular interest is the work from Wang et al (2022). We would like to include more native Quechua speaking annotators in order to improve the data set even more.…”
Section: Discussionmentioning
confidence: 99%
“…In last year's MultiCoNER shared task, the two winning systems employed different strategies. [17] used a large-scale retrieval approach to gather relevant paragraphs related to the target sentence, which were concatenated and used as input to a transformer-CRF system. The aim was to build a multilingual knowledge base relying on Wikipedia.…”
Section: Related Workmentioning
confidence: 99%
“…Wang et al [22] proposed a knowledge-based system for multilingual NER using a multi-stage fine-tuning approach for the MultiCoNER SemEval 2022 task 11 . Inspired by their multi-stage fine-tuning, we also adapt their approach for our final system.…”
Section: Multi-stage Fine-tuningmentioning
confidence: 99%
“…We also experimented with the knowledge-based system for multilingual NER that was proposed by Wang et al [22]. We used their implementation to enrich the original AjMC datasets with a knowledge base and implemented their context approach in the Flair library.…”
Section: Challengesmentioning
confidence: 99%