Proceedings of the 9th International Conference on Semantic Systems 2013
DOI: 10.1145/2506182.2506198
|View full text |Cite
|
Sign up to set email alerts
|

Improving efficiency and accuracy in multilingual entity extraction

Abstract: There has recently been an increased interest in named entity recognition and disambiguation systems at major conferences such as WWW, SIGIR, ACL, KDD, etc. However, most work has focused on algorithms and evaluations, leaving little space for implementation details. In this paper, we discuss some implementation and data processing challenges we encountered while developing a new multilingual version of DBpedia Spotlight that is faster, more accurate and easier to configure. We compare our solution to the prev… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

1
314
0
3

Year Published

2016
2016
2019
2019

Publication Types

Select...
5
3
2

Relationship

0
10

Authors

Journals

citations
Cited by 431 publications
(318 citation statements)
references
References 3 publications
1
314
0
3
Order By: Relevance
“…Several approaches and APIs have been proposed for extracting named entities from text documents and linking them to LOD. One of the most used APIs is DBpedia Spotlight [54,55], which allows for automatically annotating text documents with DBpedia URIs. This tool is used in several LOD enabled data mining approaches, e.g., [56][57][58][59].…”
Section: Using Lod To Interpret Unstructured Datamentioning
confidence: 99%
“…Several approaches and APIs have been proposed for extracting named entities from text documents and linking them to LOD. One of the most used APIs is DBpedia Spotlight [54,55], which allows for automatically annotating text documents with DBpedia URIs. This tool is used in several LOD enabled data mining approaches, e.g., [56][57][58][59].…”
Section: Using Lod To Interpret Unstructured Datamentioning
confidence: 99%
“…Textual concepts: The textual concepts used in evaluation of the multimedia retrieval task are extracted using the DBpedia Spotlight annotation tool, which is an open source project for automatic annotation of DBpedia entities in natural language text [6].…”
Section: Feature Extractionmentioning
confidence: 99%
“…With the development of multilingual Wikipedia, researchers have been employing it in many multilingual applications [3,16,17,20,23,24]. Similar to the English-only contexts, each dimension in a multilingual context representation vector represented the relatedness of the target entity with a set of entities/words in the corresponding language.…”
Section: Related Workmentioning
confidence: 99%