2022
DOI: 10.1109/access.2021.3134704
|View full text |Cite
|
Sign up to set email alerts
|

A Systematic Review of Density Grid-Based Clustering for Data Streams

Abstract: Various applications, such as electronic business, satellite remote sensing, intrusion discovery, and network traffic monitoring, generate large unbounded data stream sequences at a rapid pace. The clustering of data streams has attracted considerable interest due to the increasing usage of evolving data streams. In particular, evolving data streams affect clustering because they introduce numerous challenges, such as time and memory limits and one-pass clustering. Furthermore, researchers need to be able to d… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
7
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
9

Relationship

0
9

Authors

Journals

citations
Cited by 14 publications
(7 citation statements)
references
References 112 publications
(206 reference statements)
0
7
0
Order By: Relevance
“…There is a lack of quantization stage of the raw data before dealing with it in the creation of the core mini clusters. Other algorithms have adopted the grid concept for this quantization, such as MuDi-Stream [26]; however, they used the static grid in time and spatial attributes.…”
Section: Literature Reviewmentioning
confidence: 99%
“…There is a lack of quantization stage of the raw data before dealing with it in the creation of the core mini clusters. Other algorithms have adopted the grid concept for this quantization, such as MuDi-Stream [26]; however, they used the static grid in time and spatial attributes.…”
Section: Literature Reviewmentioning
confidence: 99%
“…Although the container does interconnect between the physical database and instance, if any instance fails, the workload manager automatically routes the survivors ensuring availability and reliability of the whole solution model. From the performance perspective, the relevant model is also sharded database introduced in 2017 [25], providing linear scalability, fault tolerance, and geographic data distribution by using horizontal fragmentation across multiple regions [23,26,27]. It is shown in Fig.…”
Section: Database Infrastructurementioning
confidence: 99%
“…Using the parallel option, the table block set is divided into several sub-parts, each assigned to one process run in a parallel mode. Therefore, an index can be created rapidly sooner, depending on the number of CPUs, physical data definition, disc storage allocation, configuration, distribution, and partitioning [26,46,47]. The following code highlights the description of parallelism.…”
Section: Parallel and Nologging Clausementioning
confidence: 99%
“…Bahri et al [6] thoroughly investigated the basic algorithms for various data stream mining tasks, including clustering. Tareq et al [7] conducted a systematic literature review on density and grid-based algorithms for clustering data streams by identifying their strengths and weaknesses and investigating how these algorithms address the problems of evolving data streams. Al-Khamees et al [8] classified data stream clustering algorithms into five categories: Partition-based, Hierarchical-based, Gridbased, Density-based, and Model-based.…”
Section: Introductionmentioning
confidence: 99%