2015
DOI: 10.1007/978-3-319-24309-2_27
|View full text |Cite
|
Sign up to set email alerts
|

Automatic Identification and Disambiguation of Concepts and Named Entities in the Multilingual Wikipedia

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
9
0

Year Published

2016
2016
2023
2023

Publication Types

Select...
4
3
1

Relationship

2
6

Authors

Journals

citations
Cited by 9 publications
(9 citation statements)
references
References 18 publications
0
9
0
Order By: Relevance
“…To learn our embeddings, we adapt the freely available sample of 500k articles of Babelfied English Wikipedia (Scozzafava et al, 2015). To our knowledge, this is one of the largest published and evaluated sense-annotated corpora, containing over 500 million words, of which over 100 million are annotated with Babel synsets, with an estimated synset annotation accuracy of 77.8%.…”
Section: Building Supersense Embeddingsmentioning
confidence: 99%
See 1 more Smart Citation
“…To learn our embeddings, we adapt the freely available sample of 500k articles of Babelfied English Wikipedia (Scozzafava et al, 2015). To our knowledge, this is one of the largest published and evaluated sense-annotated corpora, containing over 500 million words, of which over 100 million are annotated with Babel synsets, with an estimated synset annotation accuracy of 77.8%.…”
Section: Building Supersense Embeddingsmentioning
confidence: 99%
“…Similarly to , they find that SemCor may not be a sufficient resource for supersense tagging adaption to different domains. Therefore, in our work, we explore the potential of using an automatically annotated Babelfied Wikipedia corpus (Scozzafava et al, 2015) for this task.…”
Section: Supersense Taggingmentioning
confidence: 99%
“…5 Babel-Net is a large multilingual encyclopedic dictionary and semantic network, comprising approximately 16 million entries for concepts and named entities linked by semantic relations. As training corpus we used the English portion of BabelWiki, 6 a multilingual corpus comprising the English Wikipedia (Scozzafava et al, 2015). The corpus was automatically annotated with named entities and concepts using Babelfy (Moro et al, 2014), a state-ofthe-art disambiguation and entity linking system, based on the BabelNet semantic network.…”
Section: Implementation Detailsmentioning
confidence: 99%
“…However, hand-labeled sense annotations are notoriously difficult to obtain on a large scale, and manually curated corpora (Miller et al, 1993;Passonneau et al, 2012) have a limited size. Given that scaling the manual annotation process becomes practically unfeasible when both lexicographic and encyclopedic knowledge is addressed (Schubert, 2006), recent years have witnessed efforts to produce larger sense-annotated corpora automatically (Moro et al, 2014a;Taghipour and Ng, 2015a;Scozzafava et al, 2015;Raganato et al, 2016). Even though these automatic approaches produce noisier corpora, it has been shown that training on them leads to better supervised and semi-supervised models (Taghipour and Ng, 2015b;Raganato et al, 2016;Yuan et al, 2016;Raganato et al, 2017), as well as to effective embedded representations for senses (Iacobacci et al, 2015;Flekova and Gurevych, 2016).…”
Section: Introductionmentioning
confidence: 99%