Proceedings of the 2019 3rd International Conference on Big Data Research 2019
DOI: 10.1145/3372454.3372481
|View full text |Cite
|
Sign up to set email alerts
|

Online Embedding and Clustering of Data Streams

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
1
0

Year Published

2022
2022
2023
2023

Publication Types

Select...
2

Relationship

1
1

Authors

Journals

citations
Cited by 2 publications
(2 citation statements)
references
References 10 publications
0
1
0
Order By: Relevance
“…We adjust these algorithms to apply them on data streams and we perform clustering on the embedded data. In this particular case, when compared against t-SNE, UMAP shows better performance in terms of execution time and silhouette score of the embedded data, and therefore, UMAP is more suitable for data streams [4]. This finding is also supported by Bahri et al [5] where they survey dimensionality reduction techniques and empirically compare five of them, as applied on data streams.…”
Section: Introductionmentioning
confidence: 54%
“…We adjust these algorithms to apply them on data streams and we perform clustering on the embedded data. In this particular case, when compared against t-SNE, UMAP shows better performance in terms of execution time and silhouette score of the embedded data, and therefore, UMAP is more suitable for data streams [4]. This finding is also supported by Bahri et al [5] where they survey dimensionality reduction techniques and empirically compare five of them, as applied on data streams.…”
Section: Introductionmentioning
confidence: 54%
“…Notably, while Distance Consistency (DSC) [59] was designed for DR visual quality evaluation [19,56,58], it can also be viewed as a CVM since it considers only the separation of class labels in the embeddings. EVM-based evaluation Given Z, δ , P L , and a clustering technique C providing a partition P C = C(Z, δ ) of the embedded data, m E (P C , P L ) represents CLM between P L and Z. K-Means and the adjusted rand index are commonly used for C and m E , respectively [31,71,74].…”
Section: Using Cvm To Evaluate Clmmentioning
confidence: 99%