2012
DOI: 10.1007/978-3-642-27443-5_80
|View full text |Cite
|
Sign up to set email alerts
|

Analysis of Similarity Measures with WordNet Based Text Document Clustering

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
16
0
1

Year Published

2012
2012
2022
2022

Publication Types

Select...
3
3
2

Relationship

0
8

Authors

Journals

citations
Cited by 20 publications
(17 citation statements)
references
References 7 publications
0
16
0
1
Order By: Relevance
“…We use the Pearson distance (a transform of the Pearson correlation [21]), as opposed to the many other distance measures [1,2,20,21], as the Pearson distance often produces a clustering that is closer to the true clustering (i.e., a closer match to the manually assigned clusters) [25,40]. We find that the Pearson distance performs well when clustering system signatures (see Section 5).…”
Section: Distance Calculationmentioning
confidence: 95%
“…We use the Pearson distance (a transform of the Pearson correlation [21]), as opposed to the many other distance measures [1,2,20,21], as the Pearson distance often produces a clustering that is closer to the true clustering (i.e., a closer match to the manually assigned clusters) [25,40]. We find that the Pearson distance performs well when clustering system signatures (see Section 5).…”
Section: Distance Calculationmentioning
confidence: 95%
“…as an intersection of an object that is divided by union of the object (Sandhya, et al, 2008;Leskovec, Rajaraman, & Ullman, 2011). Jaccard Similarity of the set S and T are JS (S, T), with the following equation…”
Section: Association Rule Miningmentioning
confidence: 99%
“…To calculate the value of similarity, this research used Jaccard Similarity because euclidean distance is a measure of the distance, whereas the cosine similarity, the Jaccard coefficient and the Pearson coefficient is a measure of similarity (Sandhya, et al, 2008) and also because Jaccard Similarity is suitable for a wide range of applications, such as textual similarity or similarity of the buying habits of customers (Leskovec, Rajaraman, & Ullman, 2011). The end result of this detection application is the trustines value of reviewers, the honesty value of reviews, and the reliability value of a product.…”
Section: Introductionmentioning
confidence: 99%
“…The Euclidean distance has been used in a variety of studies [5], [11] as a basis for comparison as well as simple metric.…”
Section: A Euclidean Distancementioning
confidence: 99%
“…The most notable of these are the Euclidean distance, cosine similarity, Pearson correlation coefficient, Kullback-Leibler divergence and extended Jaccard coefficient, which have all been analysed in comparative studies [5], [11], [12].…”
Section: Introductionmentioning
confidence: 99%