Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers) 2018
DOI: 10.18653/v1/p18-1186
|View full text |Cite
|
Sign up to set email alerts
|

Multimodal Named Entity Disambiguation for Noisy Social Media Posts

Abstract: We introduce the new Multimodal Named Entity Disambiguation (MNED) task for multimodal social media posts such as Snapchat or Instagram captions, which are composed of short captions with accompanying images. Social media posts bring significant challenges for disambiguation tasks because 1) ambiguity not only comes from polysemous entities, but also from inconsistent or incomplete notations, 2) very limited context is provided with surrounding words, and 3) there are many emerging entities often unseen during… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
58
0

Year Published

2020
2020
2022
2022

Publication Types

Select...
5
1

Relationship

0
6

Authors

Journals

citations
Cited by 68 publications
(70 citation statements)
references
References 15 publications
0
58
0
Order By: Relevance
“…To our knowledge, [36] is the only other work that leverages multimodal information for an entity disambiguation task in a social media context. However, they use a dataset of 12K annotated image-caption pairs from Snapchat (which is not made available), whereas our work relies on a much larger base (85K samples for the benchmark and 2M to build the KB) and is more in line with the state-of-the-art which mostly uses Twitter as case study for EL on social media.…”
Section: Related Workmentioning
confidence: 99%
See 2 more Smart Citations
“…To our knowledge, [36] is the only other work that leverages multimodal information for an entity disambiguation task in a social media context. However, they use a dataset of 12K annotated image-caption pairs from Snapchat (which is not made available), whereas our work relies on a much larger base (85K samples for the benchmark and 2M to build the KB) and is more in line with the state-of-the-art which mostly uses Twitter as case study for EL on social media.…”
Section: Related Workmentioning
confidence: 99%
“…In our case, due to the nature of the dataset, we only use a simple inclusion (i.e. the mention words are present in the entity name) but a more complex lexical distance that takes into account more variations could be considered [36]. The disambiguation of m i is then formalized as finding the best multimodal similarity measure between the tweet containing the mention and the timeline of the correct entity, both containing text and images:…”
Section: Problem Formulationmentioning
confidence: 99%
See 1 more Smart Citation
“…The data set is obtained from the entity linking task for Chinese short text from the China Conference on Knowledge Graph and Semantic Computing (CCKS 2019) 1 . There are 90,000 pieces of data in the data set and 399,252 pieces of data in the external knowledge base, all of which are short Chinese texts.…”
Section: F Algorithm Descriptionmentioning
confidence: 99%
“…Entity disambiguation is an important task in knowledge graphs and a key technology in the field of information extraction and integration that affects the accuracy of many downstream tasks in natural language processing, such as information retrieval and intelligent question answering. Recently, the research on entity disambiguation based on knowledge graphs has gradually increased, for example, Seungwhan Moon et al [1], Prabal Agarwal et al [2]. Causes of entity ambiguity can be classified into diversity and ambiguity, namely, synonymy and polysemy [3].…”
mentioning
confidence: 99%