Proceedings of the Seventh ACM SIGKDD International Conference on Knowledge Discovery and Data Mining 2001
DOI: 10.1145/502512.502568
|View full text |Cite
|
Sign up to set email alerts
|

A streaming ensemble algorithm (SEA) for large-scale classification

Abstract: Ensemble methods have recently garnered a great deal of attention in the machine learning community. Techniques such as Boosting and Bagging have proven to be highly effective but require repeated resampling of the training data, making them inappropriate in a data mining context. The methods presented in this paper take advantage of plentiful data, building separate classifiers on sequential chunks of training points. These classifiers are combined into a fixedsize ensemble using a heuristic replacement strat… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

3
657
0
4

Year Published

2004
2004
2022
2022

Publication Types

Select...
3
3
2

Relationship

0
8

Authors

Journals

citations
Cited by 958 publications
(664 citation statements)
references
References 19 publications
3
657
0
4
Order By: Relevance
“…To overcome this drawback, an evaluation of the change detection algorithms was performed using the SEA concepts data set [38], a benchmark problem for concept drift. Figure 14 shows the error rate (computed using a naive-Bayes classifier), which presents three drifts.…”
Section: Experiments On a Public Data Setmentioning
confidence: 99%
“…To overcome this drawback, an evaluation of the change detection algorithms was performed using the SEA concepts data set [38], a benchmark problem for concept drift. Figure 14 shows the error rate (computed using a naive-Bayes classifier), which presents three drifts.…”
Section: Experiments On a Public Data Setmentioning
confidence: 99%
“…This artificial dataset contains abrupt concept drift, first introduced in [23]. It is generated using three attributes, where only the two first attributes are relevant.…”
Section: Sea Conceptsmentioning
confidence: 99%
“…A substantial amount of recent work has focused on continuous mining of data streams [4,10,11,15,16]. Typical applications include network traffic monitoring, credit card fraud detection and sensor network management systems.…”
Section: Introductionmentioning
confidence: 99%
“…Previous ensemble methods for drifting data streams have primarily relied on bagging-style techniques [15,16]. Street et al [15] gave an ensemble algorithm that builds one classifier per data block independently.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation