2014 22nd International Conference on Pattern Recognition 2014
DOI: 10.1109/icpr.2014.344
|View full text |Cite
|
Sign up to set email alerts
|

Hellinger Distance Trees for Imbalanced Streams

Abstract: Abstract-Classifiers trained on data sets possessing an imbalanced class distribution are known to exhibit poor generalisation performance. This is known as the imbalanced learning problem. The problem becomes particularly acute when we consider incremental classifiers operating on imbalanced data streams, especially when the learning objective is rare class identification. As accuracy may provide a misleading impression of performance on imbalanced data, existing stream classifiers based on accuracy can suffe… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2

Citation Types

1
31
0

Year Published

2016
2016
2021
2021

Publication Types

Select...
4
2

Relationship

0
6

Authors

Journals

citations
Cited by 27 publications
(32 citation statements)
references
References 24 publications
1
31
0
Order By: Relevance
“…Data streams are quasi-infinite sequences of information, which are temporally ordered and indeterminable in size (Gaber et al 2005;Lyon et al 2013Lyon et al , 2014. Data streams are produced by many modern computer systems (Gaber et al 2005) and are likely to arise from the increasing volumes of data output by modern radio telescopes, especially the SKA.…”
Section: Stream Classificationmentioning
confidence: 99%
See 4 more Smart Citations
“…Data streams are quasi-infinite sequences of information, which are temporally ordered and indeterminable in size (Gaber et al 2005;Lyon et al 2013Lyon et al , 2014. Data streams are produced by many modern computer systems (Gaber et al 2005) and are likely to arise from the increasing volumes of data output by modern radio telescopes, especially the SKA.…”
Section: Stream Classificationmentioning
confidence: 99%
“…Data streams are produced by many modern computer systems (Gaber et al 2005) and are likely to arise from the increasing volumes of data output by modern radio telescopes, especially the SKA. However many of the effective supervised machine learning techniques used for candidate selection do not work with streams (Lyon et al 2014). Adapting existing methods for use with streams is challenging, it remains an active goal of data mining research (Yang & Wu 2006;Gaber et al 2007).…”
Section: Stream Classificationmentioning
confidence: 99%
See 3 more Smart Citations