Cost-Sensitive Boosting

Masnadi-Shirazi, Hamed; Vasconcelos, Nuno

doi:10.1109/tpami.2010.71

Cited by 158 publications

(144 citation statements)

References 48 publications

Supporting

Mentioning

142

Contrasting

Unclassified

Order By: Relevance

“…As for the other variants, they are all methods that modify the training algorithm. CSB0 and CSB1 [17] do not use confidence rated predictions and based on the results of comparative studies [9,10,15], the two variants are typically dominated by CSB2. Asymmetric-Adaboost [18] was excluded from said studies as being similar to CSB2.…”

Section: Discussionmentioning

confidence: 99%

“…AdaCost [2,17] is also outperformed by AdaC2 and CSB2 and so is AdaC3 [16]. CS-AdaBoost [9,10], despite being the only method other than AdaC2 with a solid theoretical basis, has been characterized as 'time-consuming and imprecise' [19], as it lacks a closed form solution for α t and the optimization of its parameters is therefore computationally intensive.…”

Section: Discussionmentioning

confidence: 99%

“…To our knowledge, the only previous attempt at directly comparing asymmetric AdaBoost variants to calibrated AdaBoost was by Masnadi-Shirazi and Vasconselos [10]. The comparison was performed on imbalanced data, it included AdaC2 and CSB2 and the performance of calibrated AdaBoost was found to be slightly inferior to theirs.…”

Section: Discussionmentioning

confidence: 99%

“…However it is often regarded as skew-insensitive [15,17], meaning it is unable to handle asymmetric tasks. There exist many skew-sensitive AdaBoost variants, including AdaCost [2,17], CSB0, CSB1, CSB2 [17], Asymmetric-Adaboost [18], RareBoost [6], AdaC1, AdaC2, AdaC3 [16], CS-AdaBoost [9,10]. However, most of them are heuristic and as a result they lack the theoretical guarantees of the original AdaBoost [7].…”

Section: Introductionmentioning

confidence: 99%

See 3 more Smart Citations

Calibrating AdaBoost for Asymmetric Learning

Νικολάου

Brown

2015

Multiple Classifier Systems

View full text Add to dashboard Cite

Abstract. Asymmetric classification problems are characterized by class imbalance or unequal costs for different types of misclassifications. One of the main cited weaknesses of AdaBoost is its perceived inability to handle asymmetric problems. As a result, a multitude of asymmetric versions of AdaBoost have been proposed, mainly as heuristic modifications to the original algorithm. In this paper we challenge this approach and propose instead handling asymmetric tasks by properly calibrating the scores of the original AdaBoost so that they correspond to probability estimates. We then account for the asymmetry using classic decision theoretic approaches. Empirical comparisons of this approach against the most representative asymmetric Adaboost variants show that it compares favorably. Moreover, it retains the theoretical guarantees of the original AdaBoost and it can easily be adjusted to account for changes in class imbalance or costs without need for retraining.

show abstract

Section: Discussionmentioning

confidence: 99%

Section: Discussionmentioning

confidence: 99%

Section: Discussionmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

See 2 more Smart Citations

Calibrating AdaBoost for Asymmetric Learning

Νικολάου

Brown

2015

Multiple Classifier Systems

View full text Add to dashboard Cite

show abstract

“…Bagging [10] and boosting [30] are two popular methods for building ensembles of classifiers with a rich history of extensions [17,31,39,61,74,78]. In this section we outline various approaches which have been taken to make bagging and boosting methods overcome concept drift.…”

Section: Bagging and Boosting Based Methodsmentioning

confidence: 99%

Learning from streaming data with concept drift and imbalance: an overview

2012

View full text Add to dashboard Cite

The primary focus of machine learning has traditionally been on learning from data assumed to be sufficient and representative of the underlying fixed, yet unknown, distribution. Such restrictions on the problem domain paved the way for development of elegant algorithms with theoretically provable performance guarantees. As is often the case, however, real-world problems rarely fit neatly into such restricted models. For instance class distributions are often skewed, resulting in the "class imbalance" problem. Data drawn from non-stationary distributions is also common in real-world applications, resulting in the "concept drift" or "non-stationary learning" problem which is often associated with streaming data scenarios. Recently, these problems have independently experienced increased research attention, however, the combined problem of addressing all of the above mentioned issues has enjoyed relatively little research. If the ultimate goal of intelligent machine learning algorithms is to be able to address a wide spectrum of real-world scenarios, then the need for a general framework for learning from, and adapting to, a non-stationary environment that may introduce imbalanced data can be hardly overstated. In this paper, we first present an overview of each of these challenging areas, followed by a comprehensive review of recent research for developing such a general framework.

show abstract

Model ensembles

Cichosz

2015

Data Mining Algorithms

View full text Add to dashboard Cite

Cost-Sensitive Boosting

Cited by 158 publications

References 48 publications

Calibrating AdaBoost for Asymmetric Learning

Calibrating AdaBoost for Asymmetric Learning

Learning from streaming data with concept drift and imbalance: an overview

Model ensembles

Contact Info

Product

Resources

About