RANLP 2017 - Recent Advances in Natural Language Processing Meet Deep Learning 2017
DOI: 10.26615/978-954-452-049-6_098
|View full text |Cite
|
Sign up to set email alerts
|

A Graph-based Text Similarity Measure That Employs Named Entity Information

Abstract: Text comparison is an interesting though hard task, with many applications in Natural Language Processing. This work introduces a new text-similarity measure, which employs named-entities' information extracted from the texts and the ngram graphs' model for representing documents. Using OpenCalais as a namedentity recognition service and the JIN-SECT toolkit for constructing and managing n-gram graphs, the text similarity measure is embedded in a text clustering algorithm (k-Means). The evaluation of the produ… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2019
2019
2023
2023

Publication Types

Select...
3
3

Relationship

1
5

Authors

Journals

citations
Cited by 7 publications
(1 citation statement)
references
References 16 publications
0
1
0
Order By: Relevance
“…POS tags) and combine them with improved preprocessing, to avoid possible noise in the related features. Concerning NGGs in Hate Speech detection, we want to apply the findings from the work of [27] on NGG variations, to represent short texts with only the important n-grams of the text (e.g. through a TF-IDF filtering process and/or a named entity recognizer).…”
Section: Discussionmentioning
confidence: 99%
“…POS tags) and combine them with improved preprocessing, to avoid possible noise in the related features. Concerning NGGs in Hate Speech detection, we want to apply the findings from the work of [27] on NGG variations, to represent short texts with only the important n-grams of the text (e.g. through a TF-IDF filtering process and/or a named entity recognizer).…”
Section: Discussionmentioning
confidence: 99%