2018
DOI: 10.1016/j.ipm.2018.06.004
|View full text |Cite
|
Sign up to set email alerts
|

Local word vectors guiding keyphrase extraction

Abstract: Automated keyphrase extraction is a fundamental textual information processing task concerned with the selection of representative phrases from a document that summarize its content. This work presents a novel unsupervised method for keyphrase extraction, whose main innovation is the use of local word embeddings (in particular GloVe vectors), i.e., embeddings trained from the single document under consideration. We argue that such local representation of words and keyphrases are able to accurately capture thei… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
43
0
2

Year Published

2019
2019
2023
2023

Publication Types

Select...
4
4

Relationship

1
7

Authors

Journals

citations
Cited by 77 publications
(45 citation statements)
references
References 29 publications
0
43
0
2
Order By: Relevance
“…This method can clearly distinguish the target document from others. Papagiannopoulou and Tsoumaka proposed RVA [24], a local word vectors guiding keyphrase extraction model, which uses the average of all the candidate phrases' embeddings trained on individual files with GloVe as the reference vector, and then the similarity between the embeddings of candidate keyphrase and the reference vector is calculated and used as the score to rank. Bennani-Smires et al proposed EmbedRank [25], which uses the cosine similarity between the embeddings of candidate keyphrase and the sentence embeddings of the document.…”
Section: Embedding-based Keyphrase Extractionmentioning
confidence: 99%
See 1 more Smart Citation
“…This method can clearly distinguish the target document from others. Papagiannopoulou and Tsoumaka proposed RVA [24], a local word vectors guiding keyphrase extraction model, which uses the average of all the candidate phrases' embeddings trained on individual files with GloVe as the reference vector, and then the similarity between the embeddings of candidate keyphrase and the reference vector is calculated and used as the score to rank. Bennani-Smires et al proposed EmbedRank [25], which uses the cosine similarity between the embeddings of candidate keyphrase and the sentence embeddings of the document.…”
Section: Embedding-based Keyphrase Extractionmentioning
confidence: 99%
“…The graph-based models 2 are TextRank [3], SingleRank [10], TopicRank [11], Position-Rank [13] and Multipartite [12]. The embedding-based models are RVA [24] and EmbedRank [25].…”
Section: B Compare With Other Baselinesmentioning
confidence: 99%
“…Furthermore, RVA's performance witnesses the effectiveness of keyphrase extraction from scientific publications using only their titles and abstracts. Papagiannopoulou and Tsoumakas () conducted experiments using the same evaluation approach (Rousseau & Vazirgiannis, ) and two versions of each unsupervised keyphrase extraction method, one with the title/abstract and one with the full‐text of each article. The abstract version of both RVA and the graph‐based methods is better than the fulltext version according to the partial match evaluation approach, possibly due to the redundancy which is included in the fulltexts.…”
Section: Empirical Evaluation Studymentioning
confidence: 99%
“…Topics are defined as single and multiword with significant similarity. With the extensive use of deep learning in various fields [36], many new models have also been proposed in the field of keywords extraction [17], [37]- [39], and achieved the best results in many datasets. However, many keywords extraction methods based on deep learning are generally supervised, and the model training will take a long time, so it will not be compared with the methods proposed in this article.…”
Section: Related Workmentioning
confidence: 99%