2016
DOI: 10.48550/arxiv.1608.06861
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Parallel K-Medoids++ Spatial Clustering Algorithm Based on MapReduce

et al.
Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2019
2019
2024
2024

Publication Types

Select...
2
1

Relationship

0
3

Authors

Journals

citations
Cited by 3 publications
(2 citation statements)
references
References 3 publications
0
2
0
Order By: Relevance
“…We are, to the best of our knowledge, the first team to scale a semi-metric K-medoids algorithm to 1 billion points (with 6 attributes) (see BillionOne dataset appendix F.2.1). PAMAE [37] does run on a dataset of around 4 billion, but the data is restricted to Euclidean space, other distributed semi-metric K-medoids algorithms [19,22,28,40,41] are not run at this scale. To compare to other non-Euclidean algorithms, HPDBSCAN [16] demonstrates runs upto 82 million points with 4 attributes.…”
Section: Results and Comparisionsmentioning
confidence: 99%
“…We are, to the best of our knowledge, the first team to scale a semi-metric K-medoids algorithm to 1 billion points (with 6 attributes) (see BillionOne dataset appendix F.2.1). PAMAE [37] does run on a dataset of around 4 billion, but the data is restricted to Euclidean space, other distributed semi-metric K-medoids algorithms [19,22,28,40,41] are not run at this scale. To compare to other non-Euclidean algorithms, HPDBSCAN [16] demonstrates runs upto 82 million points with 4 attributes.…”
Section: Results and Comparisionsmentioning
confidence: 99%
“…As the number of distances collected rises, its efficiency decreases. The work presented in [18] suggests a MapReduce-based k-medoids++ spatial clustering algorithm for large geographical data. Initialization and MapReduce decrease iterations.…”
Section: Related Workmentioning
confidence: 99%