2018
DOI: 10.21105/joss.00861
|View full text |Cite
|
Sign up to set email alerts
|

UMAP: Uniform Manifold Approximation and Projection

Abstract: UMAP (Uniform Manifold Approximation and Projection) is a novel manifold learning technique for dimension reduction. UMAP is constructed from a theoretical framework based in Riemannian geometry and algebraic topology. e result is a practical scalable algorithm that applies to real world data. e UMAP algorithm is competitive with t-SNE for visualization quality, and arguably preserves more of the global structure with superior run time performance. Furthermore, UMAP has no computational restrictions on embeddi… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

42
6,972
1
17

Year Published

2018
2018
2024
2024

Publication Types

Select...
7
1

Relationship

0
8

Authors

Journals

citations
Cited by 8,730 publications
(7,612 citation statements)
references
References 20 publications
42
6,972
1
17
Order By: Relevance
“…Principal components showing significant overrepresentation of genes linked to the GO annotations “cell proliferation” and “cell cycle” in the top or bottom 1% quantile of loadings were removed in a fashion similar to the CCcorrect function in RaceID3. Cells were clustered using the Louvain algorithm (Blondel et al, 2008), and visualized using UMAP (McInnes and Healy, 2018). …”
Section: Star Methodsmentioning
confidence: 99%
“…Principal components showing significant overrepresentation of genes linked to the GO annotations “cell proliferation” and “cell cycle” in the top or bottom 1% quantile of loadings were removed in a fashion similar to the CCcorrect function in RaceID3. Cells were clustered using the Louvain algorithm (Blondel et al, 2008), and visualized using UMAP (McInnes and Healy, 2018). …”
Section: Star Methodsmentioning
confidence: 99%
“…However, t-SNE often fails to accurately capture global structure in the data, such as distances between clusters, making interpreting higher order features of t- SNE plots difficult. While a recent method, UMAP, addresses the issue of capturing global structure in discrete datasets, it seems to still distort single cell gene expression trajectories (McInnes and Healy, 2018). …”
Section: Introductionmentioning
confidence: 99%
“…In this context, we consider matrices that have rows for every spot and columns for every feature (which could be genes or spatial factor activities). Utilizing t-distributed stochastic neighbor embedding (t-SNE) [14] or similar methods, such as principal component analysis (PCA) or uniform manifold approximation and projection (UMAP) [15], the number of columns of the matrices is reduced to three. The data are then rescaled into the unit cube and the rows used as coordinates in color space to colorize the spots in a spatial plot.…”
Section: Visual Summarization and Clusteringmentioning
confidence: 99%
“…However, they do not faithfully reflect the discrete count nature of RNA-Seq data. Alternatives are discrete count expression models, such as models based on the Poisson [13] or the negative binomial [14,15] distributions, with the latter being better suited for modeling over-dispersed gene expression data.ZINB-WaVE [14] offers a zero-inflated negative binomial (ZINB) regression framework for unknown covariate discovery, including gene-level covariates. Embedded in a hierarchical probabilistic ZINB model, scVI [15] utilizes deep neural networks to model gene-level responses based on both a latent space and known sample-level covariates.…”
mentioning
confidence: 99%
See 1 more Smart Citation