2008 IEEE Symposium on Computers and Communications 2008
DOI: 10.1109/iscc.2008.4625693
|View full text |Cite
|
Sign up to set email alerts
|

Document clustering based on diffusion maps and a comparison of the k-means performances in various spaces

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
7
0

Year Published

2017
2017
2019
2019

Publication Types

Select...
3
1
1

Relationship

0
5

Authors

Journals

citations
Cited by 6 publications
(7 citation statements)
references
References 10 publications
0
7
0
Order By: Relevance
“…The angular component is based on the cosine similarity and expressed as: (3) One can notice that, if x or y are equal to 0 , then   ,0…”
Section: Distance/dissimilarity Components In Sodamentioning
confidence: 99%
See 2 more Smart Citations
“…The angular component is based on the cosine similarity and expressed as: (3) One can notice that, if x or y are equal to 0 , then   ,0…”
Section: Distance/dissimilarity Components In Sodamentioning
confidence: 99%
“…Data partitioning and clustering techniques have been widely used in different areas of the economy and society [3], [16], [35] However, despite being considered to be an unsupervised form of machine learning, traditional clustering techniques require prior knowledge and handcrafting to operate. Users need to define a number of parameters and make assumptions in advance, i.e.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…For example, the k-means [7], [10], mean-shift clustering [14], k nearest neighbour classification [15] algorithms may use the newly introduced direction-aware distance to enhance the ability in dealing with high dimensional data.…”
Section: The Application Of the Proposed Distance To Traditional Clusmentioning
confidence: 99%
“…Therefore, it is more often used in the natural language processing (NLP) problems [7]- [11]. In NLP problems, machine learning algorithms, for example, k-means [7], [10], mean shift [11], etc., are used to cluster very high dimensional vectors representing the documents together based on the cosine similarity. Nonetheless, the cosine similarity is a pseudo metric because it does not obey the triangle inequality (it obeys the Cauchy-Schwarz inequality [12]).…”
Section: Introductionmentioning
confidence: 99%