2017
DOI: 10.1007/s10115-017-1022-8
|View full text |Cite
|
Sign up to set email alerts
|

Prequential AUC: properties of the area under the ROC curve for data streams with concept drift

Abstract: Modern data-driven systems often require classifiers capable of dealing with streaming imbalanced data and concept changes. The assessment of learning algorithms in such scenarios is still a challenge, as existing online evaluation measures focus on efficiency, but are susceptible to class ratio changes over time. In case of static data, the area under the receiver operating characteristics curve, or simply AUC, is a popular measure for evaluating classifiers both on balanced and imbalanced class distributions… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

1
58
0

Year Published

2017
2017
2022
2022

Publication Types

Select...
4
4

Relationship

1
7

Authors

Journals

citations
Cited by 108 publications
(68 citation statements)
references
References 38 publications
(93 reference statements)
1
58
0
Order By: Relevance
“…A quite recent study [30] introduces an efficient algorithm for calculating Prequential AUC , suitable for assessing classifiers on evolving data streams. Its statistical properties and comparison against simpler point measures, such as G-mean or Kappa statistics, has been examined in [33] .…”
Section: Evaluation Measuresmentioning
confidence: 99%
“…A quite recent study [30] introduces an efficient algorithm for calculating Prequential AUC , suitable for assessing classifiers on evolving data streams. Its statistical properties and comparison against simpler point measures, such as G-mean or Kappa statistics, has been examined in [33] .…”
Section: Evaluation Measuresmentioning
confidence: 99%
“…The microclusters with the highest number of points N is then moved to an empty set C to initialise a new cluster. After calculating its centre c, with Equation (20), and radius r, with Equation (21), the -neighbourhood method is again used to find density reachable microclusters. Among them, a process is undertaken to detect the so-called border microclusters [35] inside C, which obviously are not present during the first iteration as C initially contains only one microcluster.…”
Section: Detecting and Forming New Clustersmentioning
confidence: 99%
“…There are two fundamentals aspects to take into consideration in data stream clustering, namely concept drift and concept evolution. The first aspect refers to the phenomenon when the data in the stream undergo changes in the statistical properties of the clusters with respect to the time [19,20] while the second to the event when there is an unseen novel cluster appearing in the stream [5,21].…”
Section: Introductionmentioning
confidence: 99%
“…However, AUC is only suitable for the offline learning situation. To improve the traditional AUC for online learning conditions, Brzezinski et al [29] modified the AUC and proposed the prequential area under the curve (PAUC). The value of the indicator is continually updated in the online learning situation.…”
Section: Evaluation Metricsmentioning
confidence: 99%