Distill 2016
DOI: 10.23915/distill.00002
|View full text |Cite
|
Sign up to set email alerts
|

How to Use t-SNE Effectively

Abstract: We present a new technique called "t-SNE" that visualizes high-dimensional data by giving each datapoint a location in a two or three-dimensional map. The technique is a variation of Stochastic Neighbor Embedding (Hinton and Roweis, 2002) that is much easier to optimize, and produces significantly better visualizations by reducing the tendency to crowd points together in the center of the map. t-SNE is better than existing techniques at creating a single map that reveals structure at many different scales. Thi… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

3
526
0
3

Year Published

2017
2017
2024
2024

Publication Types

Select...
9

Relationship

0
9

Authors

Journals

citations
Cited by 704 publications
(532 citation statements)
references
References 22 publications
3
526
0
3
Order By: Relevance
“…Thus, these visualizations may exaggerate differences between cell populations and overlook potential connections between these populations. A further difficulty is the choice of its perplexity parameter, as t‐SNE graphs may show strongly different numbers of clusters depending on its value (Wattenberg et al , ). Common alternatives to t‐SNE are the Uniform Approximation and Projection method (UMAP; preprint: McInnes & Healy, ) or graph‐based tools such as SPRING (Weinreb et al , ).…”
Section: Introductionmentioning
confidence: 99%
“…Thus, these visualizations may exaggerate differences between cell populations and overlook potential connections between these populations. A further difficulty is the choice of its perplexity parameter, as t‐SNE graphs may show strongly different numbers of clusters depending on its value (Wattenberg et al , ). Common alternatives to t‐SNE are the Uniform Approximation and Projection method (UMAP; preprint: McInnes & Healy, ) or graph‐based tools such as SPRING (Weinreb et al , ).…”
Section: Introductionmentioning
confidence: 99%
“…(c) t-SNE (126) provides more insight, revealing local similarities as well as overall variation in the data set. However, t-SNE can be more difficult to apply, as it requires setting a manually adjustable parameter (perplexity) (127). (d ) Diffusion maps (128,129) model relationships between points in the data set as a diffusion process that is then reduced to a lower-dimensional map.…”
Section: Visualization Challengesmentioning
confidence: 99%
“…Results from such methods are directly interpreted by researchers in the context of biological questions being targeted. Although detailed technical knowledge of a method used is usually not required, it is important to understand the consequence of assumptions and parameters used 21 , 22 …”
Section: Brief Historical Outlookmentioning
confidence: 99%
“…This includes choosing a particular method, as many methods often can be used to address the same problem. Moreover, not all parameters can be automatically ‘learned’ from data, and the choice of parameters can dramatically affect results 21 , 22 …”
Section: Perspectives and Conclusionmentioning
confidence: 99%