2019
DOI: 10.12688/f1000research.21642.1
|View full text |Cite
|
Sign up to set email alerts
|

Generalized EmbedSOM on quadtree-structured self-organizing maps

Abstract: EmbedSOM is a simple and fast dimensionality reduction algorithm, originally developed for its applications in single-cell cytometry data analysis. We present an updated version of EmbedSOM, viewed as an algorithm for landmark-based embedding enrichment, and demonstrate that it works well even with manifold-learning techniques other than the self-organizing maps. Using this generalization, we introduce an inwards-growing variant of self-organizing maps that is designed to mitigate some earlier identified defic… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1

Citation Types

0
11
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
5
1
1

Relationship

4
3

Authors

Journals

citations
Cited by 9 publications
(11 citation statements)
references
References 12 publications
0
11
0
Order By: Relevance
“…The task of the SOM training is to assign values to the neurons so that the training dataset is covered by neighborhoods of the neurons, and, at the same time, that the topology of the neurons is preserved in the trained network. A 2D grid is one of the most commonly used topologies because it simplifies interpretation of the results as neuron values positioned in the 2D space, and related visualization purposes (e.g., EmbedSOM [ 15 ]). At the same time, the trained network can serve as a simple clustering of the input dataset, classifying each data point to its nearest neuron.…”
Section: Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…The task of the SOM training is to assign values to the neurons so that the training dataset is covered by neighborhoods of the neurons, and, at the same time, that the topology of the neurons is preserved in the trained network. A 2D grid is one of the most commonly used topologies because it simplifies interpretation of the results as neuron values positioned in the 2D space, and related visualization purposes (e.g., EmbedSOM [ 15 ]). At the same time, the trained network can serve as a simple clustering of the input dataset, classifying each data point to its nearest neuron.…”
Section: Methodsmentioning
confidence: 99%
“…To simplify visualization of the results, GigaSOM.jl includes a parallel reimplementation of the EmbedSOM algorithm in Julia [ 15 ], which quickly provides interpretable visualizations of the cell distribution within the datasets. EmbedSOM computes an embedding of the cells to 2D space, similarly as the popular t-SNE or UMAP algorithms [ 31 , 32 ].…”
Section: Methodsmentioning
confidence: 99%
“…A 2-dimensional grid is one of the most commonly used topologies, because it simpli es interpretation of the results as neuron values positioned in the 2-dimensional space, and related visualization purposes (e.g. EmbedSOM [14]). At the same time, the trained network can serve as a simple clustering of the input dataset, classifying each data point to its nearest neuron.…”
Section: Methodsmentioning
confidence: 99%
“…EmbedSOM is a visualization-oriented method of non-linear dimensionality reduction that works by describing a high-dimensional point by its location relative to landmarks equipped with a topology, and reproducing the point in a lowdimensional space using an explicit low-dimensional projection of the landmarks with the same topology [15]. The ability to e ectively work with a simpli ed model of the data di erentiates it from other dimensionality reduction methods; in turn it o ers superior performance by reducing the amount of necessary computation as well as by opening parallelization potential, since the computations of the projections of many individual points are independent.…”
Section: Landmark-directed Dimensionality Reductionmentioning
confidence: 99%
“…The most popular methods, typically based on neighborhood embedding computed by stochastic descent, force-based layouting or neural autoencoders, reach applicability limits when the dataset size is too large. To tackle the limitations, we have previously developed EmbedSOM [15], a dimensionality reduction and visualization algorithm based on self-organizing maps (SOMs) [13]. EmbedSOM provided an order-of-magnitude speedup on datasets typical for the single-cell cytometry data visualization while retaining competitive quality of the results.…”
mentioning
confidence: 99%