2019
DOI: 10.1007/s11222-019-09858-0
|View full text |Cite
|
Sign up to set email alerts
|

Multiple changepoint detection in categorical data streams

Abstract: The need for efficient tools is pressing in the era of big data, particularly in streaming data applications. As data streams are ubiquitous, the ability to accurately detect multiple changepoints, without affecting the continuous flow of data, is an important issue. Change detection for categorical data streams is understudied, and existing work commonly introduces fixed control parameters while providing little insight into how they may be chosen. This is ill-suited to the streaming paradigm, motivating the … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
4
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
5
4
1

Relationship

0
10

Authors

Journals

citations
Cited by 23 publications
(5 citation statements)
references
References 41 publications
0
4
0
Order By: Relevance
“…Höhle (2010) presents methods to detect changes in categorical time series using multi-categorical regression models where they monitor the category probabilities seen within the data. Recently, Plasse and Adams (2019) propose to detect changepoints in multiple categorical data streams that monitor the category probabilities of a multinomial distribution. Generally these approaches focus on estimating changes in the probabilities of sensors activating within a single day but will not detect changes in the order of the events.…”
Section: Motivation and Structurementioning
confidence: 99%
“…Höhle (2010) presents methods to detect changes in categorical time series using multi-categorical regression models where they monitor the category probabilities seen within the data. Recently, Plasse and Adams (2019) propose to detect changepoints in multiple categorical data streams that monitor the category probabilities of a multinomial distribution. Generally these approaches focus on estimating changes in the probabilities of sensors activating within a single day but will not detect changes in the order of the events.…”
Section: Motivation and Structurementioning
confidence: 99%
“…In Ref. [26] the authors approximate the multinomial distribution of a stream of values of a categorical feature by using a relative frequencies histogram. Their change detection is based on the Kullback Leibler (KL) divergence between static and adaptive estimates of the multinomial densities.…”
Section: Related Workmentioning
confidence: 99%
“…Muhammad et al [15] proposed a Monte Carlo tree search (MTCS) method for retrieving features from very high dimensional data. Multiple change detection methods have been proposed for categorical data stream by Joshua Plasse and Niali Adams [16]. Kyosuke Nishida and Koichiro Yamauchi [17] developed a STEPD algorithm which uses a statistical test for equal proportion for online environment.…”
Section: Related Workmentioning
confidence: 99%