2014
DOI: 10.5430/air.v3n1p38
|View full text |Cite
|
Sign up to set email alerts
|

A statistical approach for clustering in streaming data

Abstract: Recently data stream has been extensively explored due to its emergence in large deal of applications such as sensor networks, web click streams and network flows. Vast majority of researches in the context of data stream mining are devoted to supervise learning, whereas, in real word human practice label of data are rarely available to the learning algorithms. Hence, clustering as the most important unsupervised learning has been in the gravity of focus of quite a lot number of the researchers in data stream … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
4
0

Year Published

2018
2018
2022
2022

Publication Types

Select...
3
2

Relationship

0
5

Authors

Journals

citations
Cited by 6 publications
(4 citation statements)
references
References 47 publications
0
4
0
Order By: Relevance
“…Examples of this strategy have been reported previously. 3,5,[12][13][14][15] We refer to this second approach as online or incremental clustering. Algorithms for online clustering can themselves be divided into two subcategories.…”
Section: Background and Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…Examples of this strategy have been reported previously. 3,5,[12][13][14][15] We refer to this second approach as online or incremental clustering. Algorithms for online clustering can themselves be divided into two subcategories.…”
Section: Background and Related Workmentioning
confidence: 99%
“…This strategy is espoused, for example, in [9], [10], [11]; (2) using incremental learning techniques to find clusters in the evolving data stream. Examples of this strategy include [3], [5], [12], [13], [14], [15]. We refer to this second approach as online or incremental clustering.…”
Section: Background and Related Workmentioning
confidence: 99%
“…They maintain cluster "footprints" that summarize the clusters discovered and have a mechanism of incrementally updating those footprints as new vectors arrive. In [9]- [13], underlying probabilistic models are used and the footprint contains probability distribution parameters. Many of the density based streaming models also contain footprint entries that allow calculation of basic summary statistics [14]- [18].…”
Section: Introductionmentioning
confidence: 99%
“…[50][17][51][52], underlying probabilistic models are used and the footprint contains probability distribution parameters. Many of the density-based streaming models also contain footprint entries that allow calculation of basic summary statistics[53][54][55][56][57].…”
mentioning
confidence: 99%