Cost Sensitive Learning in the Presence of Symmetric Label Noise

Tripathi, Sandhya; Hemachandra, N.

doi:10.1007/978-3-030-16148-4_2

Cited by 3 publications

(6 citation statements)

References 10 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…For imbalanced and symmetric label noise (ρ < 0.5) corrupted data, majority (minority) class continues to be in majority (minority) (Lemma 3 [16]).…”

Section: Wgans Based Schemes For Noisy Imbalanced Datamentioning

confidence: 99%

“…In the last decade, label noise problem has gained a lot of attention from researchers due to its prevalence in various real life situations. [12,13,14,15,5,16] focus on effect of label noise problems on non-deep classification schemes and provide solutions which either consider a label noise robust loss function or modifies the loss function to make it robust. For deep learning schemes, [17] propose two algorithms called forward and backward loss correction to learn from label noise corrupted data, [18] identify label noise robust loss functions to be used by neural networks and [19] provide consistency results when the noise is instance dependent.…”

Section: Related Workmentioning

confidence: 99%

“…We would like to emphasize that our scheme is different from conventional WGAN in two ways: 1) data representation approach, i.e., the labels are appended in addition to the features and 2) the latent variable is not standard normal or uniform, instead it follows corrupted and hence unknown data distribution D. Also, unlike the existing work on learning with noisy labels, use of a generative model in label noise problem allows us to circumvent the problem of either estimating or tuning the noise rates. Even though [7,33,16] do not need the noise rates, the context is restricted either to symmetric label noise or non-high dimensional datasets. Next, we formally describe our scheme.…”

Section: Wgan For Generating Correctly Labelled Datamentioning

confidence: 99%

“…Also, radial basis oversampling proposed in [40] only demonstrates their results when the majority class doesn't flip to minority (or vice versa) and hence doesn't completely deal with the complexity of imbalanced label noise learning. [16] provide solution for the label noise (but only SLN) and imbalance problem without requiring the noise rate.…”

Section: Wgans Based Schemes For Noisy Imbalanced Datamentioning

confidence: 99%

See 3 more Smart Citations

GANs for learning from very high class conditional noisy labels

Tripathi,

Hemachandra

2020

Preprint

Self Cite

View full text Add to dashboard Cite

We use Generative Adversarial Networks (GANs) to design a class conditional label noise (CCN) robust scheme for binary classification. It first generates a set of correctly labelled data points from noisy labelled data and 0.1% or 1% clean labels such that the generated and true (clean) labelled data distributions are close; generated labelled data is used to learn a good classifier.The mode collapse problem while generating correct feature-label pairs and the problem of skewed feature-label dimension ratio (∼ 784:1) are avoided by using Wasserstein GAN and a simple data representation change. Another WGAN with information theoretic flavour on top of the new representation is also proposed. The major advantage of both schemes is their significant improvement

show abstract

“…For imbalanced and symmetric label noise (ρ < 0.5) corrupted data, majority (minority) class continues to be in majority (minority) (Lemma 3 [16]).…”

Section: Wgans Based Schemes For Noisy Imbalanced Datamentioning

confidence: 99%

Section: Related Workmentioning

confidence: 99%

Section: Wgan For Generating Correctly Labelled Datamentioning

confidence: 99%

Section: Wgans Based Schemes For Noisy Imbalanced Datamentioning

confidence: 99%

See 2 more Smart Citations

GANs for learning from very high class conditional noisy labels

Tripathi,

Hemachandra

2020

Preprint

Self Cite

View full text Add to dashboard Cite

show abstract

“…As for binary classification task, Masnadishirazi and Vasconcelos [26] proposed a savage loss based on boost algorithms and theoretically proved its robustness to outliers. Tripath and Hemachandra [27] have demonstrated that 0-1 loss is tolerant for uniform label noise. These loss functions, however, are neither appropriate for multi-classification scenarios nor training a very deep neural network.…”

Section: Related Workmentioning

confidence: 99%

Making Deep Neural Networks Robust to Label Noise: Cross-Training With a Novel Loss Function

Qin

Zhang

et al. 2019

IEEE Access

View full text Add to dashboard Cite

Deep neural networks (DNNs) have achieved astonishing results on a variety of supervised learning tasks owing to a large scale of well-labeled training data. However, as recent researches have pointed out, the generalization performance of DNNs is likely to sharply deteriorate when training data contains label noise. In order to address this problem, a novel loss function is proposed to guide DNNs to pay more attention to clean samples via adaptively weighing the traditional cross-entropy loss. Under the guidance of this loss function, a cross-training strategy is designed by leveraging two synergic DNN models, each of which plays the roles of both updating its own parameters and generating curriculums for the other one. In addition, this paper further proposes an online data filtration mechanism and integrates it into the final cross-training framework, which simultaneously optimizes DNN models and filters out noisy samples. The proposed approach is evaluated through a great deal of experiments on several benchmark datasets with man-made or real-world label noise, and the results have demonstrated its robustness to different noise types and noise scales.INDEX TERMS Deep neural networks, label noise, cross-training, loss function, data filtration. I. INTRODUCTIONRecently, deep neural networks (DNNs) have achieved remarkable success in the scope of supervised machine learning tasks such as image classification, object detection and semantic analysis. The excellent performance of DNNs is mainly attributed to the accessibility of massive well-labeled data samples. However, it is too costly to manually annotate large-scale datasets. Crowd sourcing [1] and search engines [2] are the alternate paths for obtaining labeled data, but they are likely to introduce label noise, i.e., mislabeled samples. Although Rolnick et al. [3] have mentioned that DNNs are able to generalize well after training on noisy data, it requires a sufficiently large number of clean samples. Unfortunately, when there are limited correct samples mixed with label-corrupted ones, the generalization performance of DNNs will degrade dramatically [4]-[8].Take the popular deep learning model Wide-ResNet [9] as example, Fig. 1 illustrates the negative effect on its test performance when introducing different levels of label noise into the benchmark image datasets CIFAR-10 and The associate editor coordinating the review of this manuscript and approving it for publication was Isaac Triguero.

show abstract

What killed the Convex Booster ?

Mansour¹,

Nock²,

Williamson³

2022

Preprint

View full text Add to dashboard Cite

A landmark negative result of Long and Servedio established a worst-case spectacular failure of a supervised learning trio (loss, algorithm, model) otherwise praised for its high precision machinery. Hundreds of papers followed up on the two suspected culprits: the loss (for being convex) and/or the algorithm (for fitting a classical boosting blueprint). Here, we call to the half-century+ founding theory of losses for class probability estimation (properness), an extension of Long and Servedio's results and a new general boosting algorithm to demonstrate that the real culprit in their specific context was in fact the (linear) model class. We advocate for a more general stanpoint on the problem as we argue that the source of the negative result lies in the dark side of a pervasive -and otherwise prized -aspect of ML: parameterisation.

show abstract

Cost Sensitive Learning in the Presence of Symmetric Label Noise

Cited by 3 publications

References 10 publications

GANs for learning from very high class conditional noisy labels

GANs for learning from very high class conditional noisy labels

Making Deep Neural Networks Robust to Label Noise: Cross-Training With a Novel Loss Function

What killed the Convex Booster ?

Contact Info

Product

Resources

About