2021
DOI: 10.48550/arxiv.2108.05525
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Clustering with UMAP: Why and How Connectivity Matters

Abstract: Topology based dimensionality reduction methods such as t-SNE and UMAP have seen increasing success and popularity in highdimensional data. These methods have strong mathematical foundations and are based on the intuition that the topology in low dimensions should be close to that of high dimensions. Given that the initial topological structure is a precursor to the success of the algorithm, this naturally raises the question: What makes a "good" topological structure for dimensionality reduction? In this pape… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2024
2024
2024
2024

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(1 citation statement)
references
References 19 publications
0
1
0
Order By: Relevance
“…Secondly, the UMAP algorithm drove the 10-NN graph from this matrix in Euclidean space with pynndescent optimization. Thirdly, a 20-NN graph and distance matrix was used to build a 20-mutual-NN graph using the minimal tree path-connectivity model (functions: min_spanning_tree(), create_connected_graph(), nd_new_nn(), mutual_nn_nearest() in microclusters_to_groups_cv_feature_selection_svc.ipynb) 127 . Fourthly, we derived a fuzzy simplical set for the ensuing graph using the UMAP algorithm 136 .…”
Section: Area-speci C Hypothalamic Astrocytes Revealed Astrotrapmentioning
confidence: 99%
“…Secondly, the UMAP algorithm drove the 10-NN graph from this matrix in Euclidean space with pynndescent optimization. Thirdly, a 20-NN graph and distance matrix was used to build a 20-mutual-NN graph using the minimal tree path-connectivity model (functions: min_spanning_tree(), create_connected_graph(), nd_new_nn(), mutual_nn_nearest() in microclusters_to_groups_cv_feature_selection_svc.ipynb) 127 . Fourthly, we derived a fuzzy simplical set for the ensuing graph using the UMAP algorithm 136 .…”
Section: Area-speci C Hypothalamic Astrocytes Revealed Astrotrapmentioning
confidence: 99%