2020
DOI: 10.1088/1742-6596/1617/1/012088
|View full text |Cite
|
Sign up to set email alerts
|

Application of DBSCAN Algorithm in Data Sampling

Abstract: Data sampling is to sample hidden, previously unknown knowledge and rules that have potential value for decision-making from massive data. Cluster analysis is an important research topic in the field of data sampling. How to extract the information and knowledge that people care about, unknown, and help to analyze the decision-making process from massive data is a problem that people urgently need to solve. In this paper, after using the genetic algorithm-based method to obtain a better initial clustering cent… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
4
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
4
3
1

Relationship

0
8

Authors

Journals

citations
Cited by 9 publications
(4 citation statements)
references
References 5 publications
0
4
0
Order By: Relevance
“…This algorithm involves two parameters: the first one is 'minpts' (minimum number of points), determining the density required for a region to be considered, and the second one utilizes 'minpts' for clustering. Another is eps (ϵ), this parameter works for a distance measure that is used to locate the points (Deng, 2020).…”
Section: Dbscan (Density-based Spatial Clustering Of Application With...mentioning
confidence: 99%
See 1 more Smart Citation
“…This algorithm involves two parameters: the first one is 'minpts' (minimum number of points), determining the density required for a region to be considered, and the second one utilizes 'minpts' for clustering. Another is eps (ϵ), this parameter works for a distance measure that is used to locate the points (Deng, 2020).…”
Section: Dbscan (Density-based Spatial Clustering Of Application With...mentioning
confidence: 99%
“…The DBSCAN clustering algorithm is useful with unsupervised learning of the data. This clustering algorithm can discover outliers (Deng, 2020). In fuzzy C-means clustering, each cluster is assigned a membership matrix, which determines the extent to which each sample is associated with the cluster (Li et al, 2009).…”
Section: Introductionmentioning
confidence: 99%
“…Considering the slow search speed and strong dependence on initial population selection of the genetic algorithm, Sherar and Zulkernine [29] implemented a parallel clustering algorithm with particle swarm optimization in the Apache Spark framework to improve the accuracy of data partitioning. Deng [30] proposed a parallel density clustering algorithm under MapReduce, which partitions data by using the particle swarm optimization algorithm and uses the k-dist graph to fnd local clustering parameters. In addition, Ashish et al [31] proposed a parallel clustering method using the bat algorithm, which achieves fast and efcient work by dividing large data sets into small blocks and then clustering these smaller blocks in parallel.…”
Section: State Of the Artmentioning
confidence: 99%
“…Hu et al proposed a parallel DBSCAN [17] based on genetic algorithm, which uses genetic algorithm to calculate the optimal value of neighborhood and .Wang Jun developed a parameter adaptive density clustering algorithm based on MapReduce [18] that combines the PSO method [19] to optimize the parameters of the DBSCAN algorithm in local clustering. Deng [20] proposed a parallel density clustering algorithm DPDPSO based on particle swarm optimization [21] and k-dist graph, which uses particle swarm optimization algorithm to obtain the best initial clustering center for partitioning, and uses k-dist graph to find local clustering parameters after partitioning. However, these algorithms also have a limitation that it is easy to fall into local optimization in the process of parameter optimization, so the parameter optimization ability of the algorithm needs to be further improved.…”
Section: Introductionmentioning
confidence: 99%