2017
DOI: 10.1007/s10579-017-9395-6
|View full text |Cite
|
Sign up to set email alerts
|

Creation and evaluation of large keyphrase extraction collections with multiple opinions

Abstract: While several Automatic Keyphrase Extraction (AKE) techniques have been developed and analyzed, there is little consensus on the definition of the task and a lack of overview of the effectiveness of different techniques. Proper evaluation of keyphrase extraction requires large test collections with multiple opinions, currently not available for research. In this paper, we (i) present a set of test collections derived from various sources with multiple annotations (which we also refer to as opinions in the rema… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
8
0

Year Published

2019
2019
2022
2022

Publication Types

Select...
4
1
1

Relationship

0
6

Authors

Journals

citations
Cited by 11 publications
(8 citation statements)
references
References 25 publications
0
8
0
Order By: Relevance
“…To sum up, the problem of subjectivity could be partially addressed using multiple annotators or treating supervised keyphrase extraction as Positive Unlabeled Learning (Chuang, Manning, & Heer, ; Sterckx et al, ). In this direction, Sterckx, Caragea, Demeester, and Develder () propose as solution the creation of new collections for the evaluation of keyphrase extraction methods, derived from various sources with multiple annotations. The golden set of keyphrases, which is utilized for evaluation reasons, also incorporates subjectivity issues.…”
Section: Supervised Methodsmentioning
confidence: 99%
“…To sum up, the problem of subjectivity could be partially addressed using multiple annotators or treating supervised keyphrase extraction as Positive Unlabeled Learning (Chuang, Manning, & Heer, ; Sterckx et al, ). In this direction, Sterckx, Caragea, Demeester, and Develder () propose as solution the creation of new collections for the evaluation of keyphrase extraction methods, derived from various sources with multiple annotations. The golden set of keyphrases, which is utilized for evaluation reasons, also incorporates subjectivity issues.…”
Section: Supervised Methodsmentioning
confidence: 99%
“…( 2020 ), Dutch Sterckx et al. ( 2018 ), Polish (Pak2018), Spanish Aquino and Lanzarini ( 2015 ), Portuguese Marujo et al. ( 2013 ), and French Bougouin et al.…”
Section: Background Informationmentioning
confidence: 99%
“…The classifications discussed in this article are limited to the GKET. Language Independence Academic researchers are working on AKE from texual documents of different languages, for instance, Telugu Naidu et al (2018), Chinese Chen et al (2020a), Li et al (2017), English (KPTimes and JPTimes), German Kölbl et al (2020), Dutch Sterckx et al (2018), Polish (Pak2018), Spanish Aquino and Lanzarini (2015), Portuguese , and French Bougouin et al (2013). The structure of the word distribution in Chinese and English documents is well studied using GoW evolved from these documents and different patterns are observed.…”
Section: Classification Of the Graphical Keyword Extraction Techniquesmentioning
confidence: 99%
“…In this direction, Sterckx et al (2018) propose as solution the creation of new collections for the evaluation of keyphrase extraction methods, derived from various sources with multiple annotations. This, however, comes at the expense of additional annotation effort.…”
Section: Subjectivity and Class Imbalancementioning
confidence: 99%
“…Furthermore, several approaches to the creation of such datasets have been proposed, in the past (Medelyan et al, 2009;Sterckx et al, 2018)…”
Section: Datasetsmentioning
confidence: 99%