2022
DOI: 10.1007/s13042-022-01635-2
|View full text |Cite
|
Sign up to set email alerts
|

Confidence estimation for t-SNE embeddings using random forest

Abstract: Dimensionality reduction algorithms are commonly used for reducing the dimension of multi-dimensional data to visualize them on a standard display. Although many dimensionality reduction algorithms such as the t-distributed Stochastic Neighborhood Embedding aim to preserve close neighborhoods in low-dimensional space, they might not accomplish that for every sample of the data and eventually produce erroneous representations. In this study, we developed a supervised confidence estimation algorithm for detectin… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

2
10
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
2
1
1

Relationship

1
3

Authors

Journals

citations
Cited by 4 publications
(12 citation statements)
references
References 42 publications
2
10
0
Order By: Relevance
“…In this table, the total number of erroneous samples for each dataset was also provided since the ratio of the correctly detected erroneous samples could be useful for interpreting the performance differences between the datasets. In most cases, our predictions generally showed higher concordance to the ground truth erroneous samples in comparison to Yigin et al 8 and the NPR score. It is noteworthy that the success of detecting erroneous samples has increased at a much higher rate than 8 particularly in the lowest 10 RF-scored samples, even occasionally a success of 10 out of 10 had been attained.…”
Section: Resultssupporting
confidence: 54%
See 4 more Smart Citations
“…In this table, the total number of erroneous samples for each dataset was also provided since the ratio of the correctly detected erroneous samples could be useful for interpreting the performance differences between the datasets. In most cases, our predictions generally showed higher concordance to the ground truth erroneous samples in comparison to Yigin et al 8 and the NPR score. It is noteworthy that the success of detecting erroneous samples has increased at a much higher rate than 8 particularly in the lowest 10 RF-scored samples, even occasionally a success of 10 out of 10 had been attained.…”
Section: Resultssupporting
confidence: 54%
“…The distance measures are presented in Table 1 . In accordance with 8 , we trained an RF regressor on the training sets of the AMB18 and Baron Human datasets by using each distance measure individually as a feature. Next, we evaluate the performance of each distance measure with intra-dataset experiments.…”
Section: Resultsmentioning
confidence: 99%
See 3 more Smart Citations