Proceedings of the 7th ACM International Conference on Web Search and Data Mining 2014
DOI: 10.1145/2556195.2556230
|View full text |Cite
|
Sign up to set email alerts
|

Entity linking at the tail

Abstract: Web search is seeing a paradigm shift from keyword based search to an entity-centric organization of web data. To support web search with this deeper level of understanding, a web-scale entity linking system must have 3 key properties: First, its feature extraction must be robust to the diversity of web documents and their varied writing styles and content structures. Second, it must maintain high-precision linking for "tail" (unpopular) entities that is robust to the existence of confounding entities outside … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
3
0

Year Published

2015
2015
2024
2024

Publication Types

Select...
4
2
2

Relationship

0
8

Authors

Journals

citations
Cited by 17 publications
(3 citation statements)
references
References 30 publications
0
3
0
Order By: Relevance
“…This could be accomplished by adding a document-level representation or by leveraging other mentions in the document. English-focused work on rare entity performance (Orr et al, 2020;Jin et al, 2014) may provide additional direction.…”
Section: Discussionmentioning
confidence: 99%
“…This could be accomplished by adding a document-level representation or by leveraging other mentions in the document. English-focused work on rare entity performance (Orr et al, 2020;Jin et al, 2014) may provide additional direction.…”
Section: Discussionmentioning
confidence: 99%
“…To serve this navigational request ⁵¹A process known as named-entity extraction , Etzioni et al, 2005, Hoffart et al, 2013 and disambiguation [Jin et al, 2014]. To serve this navigational request ⁵¹A process known as named-entity extraction , Etzioni et al, 2005, Hoffart et al, 2013 and disambiguation [Jin et al, 2014].…”
Section: Example 12mentioning
confidence: 99%
“…Recent studies show that NER models face challenges in predicting entities that are not encountered during training [11,25,34], encompassing both Out-of-Vocabulary (OOV) and Out-of-Domain (OOD) entities. These challenges frequently arise in web social media and the medical domain [7,18]. Consequently, NER models' ability to detect and classify decreases when they encounter these entities, which we attribute to the "lack of knowledge" of fine-tuned small models.…”
Section: Introductionmentioning
confidence: 96%