Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining 2015
DOI: 10.1145/2783258.2783359
|View full text |Cite
|
Sign up to set email alerts
|

A PCA-Based Change Detection Framework for Multidimensional Data Streams

Abstract: Detecting changes in multidimensional data streams is an important and challenging task. In unsupervised change detection, changes are usually detected by comparing the distribution in a current (test) window with a reference window. It is thus essential to design divergence metrics and density estimators for comparing the data distributions, which are mostly done for univariate data. Detecting changes in multidimensional data streams brings difficulties to the density estimation and comparisons. In this paper… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
58
0

Year Published

2016
2016
2021
2021

Publication Types

Select...
5
2

Relationship

1
6

Authors

Journals

citations
Cited by 99 publications
(66 citation statements)
references
References 22 publications
0
58
0
Order By: Relevance
“…Novelty detection/ clustering methods OLINDDA (Spinosa et al, 2007), MINAS (Faria et al, 2013), Woo (Ryu et al, 2012), DETECTNOD (Hayat and Hashemi, 2010), ECSMiner (Masud et al, 2011), GC3 (Sethi et al, 2016b) Multivariate distribution monitoring CoC (Lee and Magoules, 2012), HDDDM (Ditzler and Polikar, 2011), PCA-detect (Kuncheva and Faithfull, 2014;Qahtan et al, 2015) Model dependent monitoring A-distance (Dredze et al, 2010), CDBD (Lindstrom et al, 2013), Margin (Dries and Rückert, 2009) when it matters. Implicit drift detectors rely on properties of the unlabeled data's feature values, to signal deviations.…”
Section: Review Of Research On Concept Drift Detectionmentioning
confidence: 99%
See 1 more Smart Citation
“…Novelty detection/ clustering methods OLINDDA (Spinosa et al, 2007), MINAS (Faria et al, 2013), Woo (Ryu et al, 2012), DETECTNOD (Hayat and Hashemi, 2010), ECSMiner (Masud et al, 2011), GC3 (Sethi et al, 2016b) Multivariate distribution monitoring CoC (Lee and Magoules, 2012), HDDDM (Ditzler and Polikar, 2011), PCA-detect (Kuncheva and Faithfull, 2014;Qahtan et al, 2015) Model dependent monitoring A-distance (Dredze et al, 2010), CDBD (Lindstrom et al, 2013), Margin (Dries and Rückert, 2009) when it matters. Implicit drift detectors rely on properties of the unlabeled data's feature values, to signal deviations.…”
Section: Review Of Research On Concept Drift Detectionmentioning
confidence: 99%
“…This has motivated the development of unlabeled drift detection techniques (Ditzler and Polikar, 2011;da Costa et al, 2016), which monitor changes to the feature distribution, as an early indicator of drift. However, existing methods using unlabeled data are essentially change detection techniques that detect any change to the data distribution, irrespective of its effects on the classification process (Lee and Magoules, 2012;Ditzler and Polikar, 2011;Kuncheva and Faithfull, 2014;Qahtan et al, 2015;da Costa et al, 2016). For the task of classification, change is relevant only when it causes model performance to degrade.…”
Section: Introductionmentioning
confidence: 99%
“…Intuitively, if the selectivity distribution is unchanged, the loading vectors 3 of the principal components are relatively stable, otherwise there can be certain rotation to the loading vectors [30]. By this means, we are able to detect the stream variation by monitoring the principal components of evaluation order vectors.…”
Section: Selectivity Monitoringmentioning
confidence: 99%
“…so different value range will affect the filters' significance in query execution. In this study, we tune the popularity range as [10,20], [20,30], [30,40] and [40, 50], i.e., 10% to 30% sharing degree. For each range, we generate the problem instances, create corresponding data streams, and run all strategies on the streams with simulated changing selectivity.…”
Section: Effect Of Popularity Rangementioning
confidence: 99%
See 1 more Smart Citation