“…Gotz et al [43] present HPDBSCAN, an algorithm for both shared-memory and distributed-memory based on partitioning the data among processors, running DBSCAN locally on each partition, and then merging the clusters together. Exact and approximate distributed DBSCAN algorithms have been designed using the MapReduce [7,34,39,51,53,63,90,92] and Spark [32,49,54,68,69,82] paradigms. RP-DBSCAN [82], which is an approximate DBSCAN algorithm, has been shown to be the state-of-the-art for MapReduce and Spark.…”