2015
DOI: 10.1371/journal.pone.0144059
|View full text |Cite
|
Sign up to set email alerts
|

A Comparison Study on Similarity and Dissimilarity Measures in Clustering Continuous Data

Abstract: Similarity or distance measures are core components used by distance-based clustering algorithms to cluster similar data points into the same clusters, while dissimilar or distant data points are placed into different clusters. The performance of similarity measures is mostly addressed in two or three-dimensional spaces, beyond which, to the best of our knowledge, there is no empirical study that has revealed the behavior of similarity measures when dealing with high-dimensional datasets. To fill this gap, a t… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
148
0
4

Year Published

2017
2017
2023
2023

Publication Types

Select...
7
2

Relationship

0
9

Authors

Journals

citations
Cited by 323 publications
(152 citation statements)
references
References 34 publications
0
148
0
4
Order By: Relevance
“…The Manhattan distance observes the absolute distance between two data items and dimensions. This distance measure is known as a city block distance, absolute value distance, rectilinear distance, taxicab distance or L1 distance family and gives cluster shape as hyper-rectangular [36]. Manhattan distance formulation shows as Eq.2 based upon the Eq.1.…”
Section: Distance Measures Taxonomymentioning
confidence: 99%
“…The Manhattan distance observes the absolute distance between two data items and dimensions. This distance measure is known as a city block distance, absolute value distance, rectilinear distance, taxicab distance or L1 distance family and gives cluster shape as hyper-rectangular [36]. Manhattan distance formulation shows as Eq.2 based upon the Eq.1.…”
Section: Distance Measures Taxonomymentioning
confidence: 99%
“…If the value is <50 it is dissimilar or difference (F 1 ). [7] Stability studies and report Stability studies were performed using stability chamber (Eye Instruments Pvt. Ltd., Ahmedabad) on D-3 batches tablets for 6 months.…”
Section: Similarity and Dissimilarity Studymentioning
confidence: 99%
“…If these conditions are provided, clustering can be successful, otherwise, it is unsuccessful. Clustering, which is an important method in information inference with all these features, has several application fields such as pattern recognition, bio-information, text mining and machine learning [7,8].…”
Section: Introductionmentioning
confidence: 99%