2018
DOI: 10.3390/bdcc2040032
|View full text |Cite
|
Sign up to set email alerts
|

Data Stream Clustering Techniques, Applications, and Models: Comparative Analysis and Discussion

Abstract: Data growth in today’s world is exponential, many applications generate huge amount of data streams at very high speed such as smart grids, sensor networks, video surveillance, financial systems, medical science data, web click streams, network data, etc. In the case of traditional data mining, the data set is generally static in nature and available many times for processing and analysis. However, data stream mining has to satisfy constraints related to real-time response, bounded and limited memory, single-p… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
29
0
6

Year Published

2019
2019
2023
2023

Publication Types

Select...
4
4
1

Relationship

0
9

Authors

Journals

citations
Cited by 54 publications
(35 citation statements)
references
References 62 publications
(48 reference statements)
0
29
0
6
Order By: Relevance
“…It also supports data aggregation by removing data redundancy before transmitting them to the sink to gain energy conservation. Recently, in [6,8,29,30], the authors have presented a comprehensive overview of various data clustering techniques for in-network data reduction in periodic WSNs. Generally, these techniques can be classified into three different categories as presented in Figure 1, which are feature extraction-based, model-based and feature selection-based data clustering.…”
Section: Background Workmentioning
confidence: 99%
“…It also supports data aggregation by removing data redundancy before transmitting them to the sink to gain energy conservation. Recently, in [6,8,29,30], the authors have presented a comprehensive overview of various data clustering techniques for in-network data reduction in periodic WSNs. Generally, these techniques can be classified into three different categories as presented in Figure 1, which are feature extraction-based, model-based and feature selection-based data clustering.…”
Section: Background Workmentioning
confidence: 99%
“…(1) The first approach is to cluster the available data at different values of K and afterward compute validity measures to evaluate the quality of the attained clusters. There are many scalar validity measures in the literature [38] (2) The second approach is to work with an appropriately large number of K, then repeatedly decrease this number by combining similar clusters based on some predefined measures…”
Section: Fuzzy Clustering Many Clustering Algorithms Have Beenmentioning
confidence: 99%
“…Although the literature is replete with examples of data clustering strategies based on DE, WOA and BAT for the static domain, as, e.g., those presented in [13][14][15][16], little has been done for the dynamic environment due to the difficulties in handling data streams. The current state of dynamic clustering is therefore unsatisfactory as it mainly relies on algorithms based on techniques such as density microclustering and density grid based clustering, which require the tuning of several parameters to work effectively [17].…”
Section: Introductionmentioning
confidence: 99%
“…In terms of online microclustering, most algorithms in the literature are distance based [17,22,26], whereby new observations are either merged with existing microclusters or form new microclusters based on a distance threshold. The earliest form of distance based clustering strategy was the process of extracting information about a cluster into the form of a Clustering Feature (CF) vector.…”
Section: Introductionmentioning
confidence: 99%