2019
DOI: 10.1007/978-3-030-29349-9_5
|View full text |Cite
|
Sign up to set email alerts
|

Approximating Spectral Clustering via Sampling: A Review

Abstract: Spectral clustering refers to a family of unsupervised learning algorithms that compute a spectral embedding of the original data based on the eigenvectors of a similarity graph. This non-linear transformation of the data is both the key of these algorithms' success and their Achilles heel: forming a graph and computing its dominant eigenvectors can indeed be computationally prohibitive when dealing with more that a few tens of thousands of points. In this paper, we review the principal research efforts aiming… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
28
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
7
2
1

Relationship

0
10

Authors

Journals

citations
Cited by 38 publications
(28 citation statements)
references
References 128 publications
0
28
0
Order By: Relevance
“…The cluster assignments are disregarded in this case because k typically exceeds the actual number of classes or unique labels. Solving this optimization problem is NP-hard and the Lloyd-Max algorithm is a popular iterative technique for approximating the optimal solution [32]. However, this algorithm requires the entire data set X to be stored in the main memory and incurs a high computational cost, taking O(npm) time per iteration.…”
Section: Preliminaries and Landmark Selection Techniquesmentioning
confidence: 99%
“…The cluster assignments are disregarded in this case because k typically exceeds the actual number of classes or unique labels. Solving this optimization problem is NP-hard and the Lloyd-Max algorithm is a popular iterative technique for approximating the optimal solution [32]. However, this algorithm requires the entire data set X to be stored in the main memory and incurs a high computational cost, taking O(npm) time per iteration.…”
Section: Preliminaries and Landmark Selection Techniquesmentioning
confidence: 99%
“…Various methods have been proposed to accelerate spectral clustering by computing an approximate spectral embedding of the original data. Recent work [23] presented an excellent review of the literature on this topic for interested readers. In this paper, we divide the related work into two main categories: (1) methods that circumvent the computation of the full kernel matrix, and (2) techniques that consider the similarity graph as one of the inputs to spectral clustering and, thus, ignore the cost associated with step 2 of Alg.…”
Section: A Related Work On Accelerating Spectral Clusteringmentioning
confidence: 99%
“…Beyond SC, many other methods have been proposed, such as Maximum Likelihood or variational approaches, which are consistent for the SBM and DSBM [6,30,29], Bayesian approaches [49], learning-based approaches [2], or neural networks [5]. Many variants of the SC itself exist, often to accelerate computation [41]. We focus here on the traditional SC.…”
Section: Introductionmentioning
confidence: 99%