2012 IEEE Fifth International Conference on Cloud Computing 2012
DOI: 10.1109/cloud.2012.42
|View full text |Cite
|
Sign up to set email alerts
|

Efficient Map/Reduce-Based DBSCAN Algorithm with Optimized Data Partition

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
44
0
4

Year Published

2014
2014
2020
2020

Publication Types

Select...
6

Relationship

0
6

Authors

Journals

citations
Cited by 75 publications
(48 citation statements)
references
References 12 publications
0
44
0
4
Order By: Relevance
“…This overlap is needed because of the possibility that the data points of a cluster are spread across different partitions, so there should be an overlap (a joint or boundary region) between adjacent partitions. Selecting a 2ε-wide boundary region ensures sufficient information for the merge phase [9]. As an example, consider a twodimensional space in which each of the dimensions is normalized between 0 and 1, and the interval of each dimension is divided into two portions.…”
Section: Create a Grid For The Data Spacementioning
confidence: 99%
See 4 more Smart Citations
“…This overlap is needed because of the possibility that the data points of a cluster are spread across different partitions, so there should be an overlap (a joint or boundary region) between adjacent partitions. Selecting a 2ε-wide boundary region ensures sufficient information for the merge phase [9]. As an example, consider a twodimensional space in which each of the dimensions is normalized between 0 and 1, and the interval of each dimension is divided into two portions.…”
Section: Create a Grid For The Data Spacementioning
confidence: 99%
“…In addition, a core point reduction approach is proposed and implemented for more efficient and yet accurate data summarization and regeneration. Note that MapReduce-based implementation of DBSCAN algorithm has been studied in [8,9,10] that are relying on different partitioning strategies. In this thesis, a grid-based partitioning approach is used for an implementation of DBSCAN algorithm that is compatible with GMM summarization, which are both completely developed in the MapReduce framework.…”
Section: Contributionsmentioning
confidence: 99%
See 3 more Smart Citations