Class-wise Calibration: A Case Study on COVID-19 Hate Speech

Obadinma, Stephen; Guo, Hongyu; Zhu, Xiaodan

doi:10.21428/594757db.da1a3d44

Cited by 3 publications

(5 citation statements)

References 20 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Unlike vector and matrix scaling, RD-TS cannot change the relative ranking of logits, and therefore model accuracy is retained (in single-label settings). One line of future work could be to apply RD-TS on top of weighted temperature scaling, a method known to decrease variance in calibration error among classes (Obadinma et al, 2021). Another line of work would be to investigate whether improved certainty estimates can increase model accuracy (in multi-label settings where predictions are applied by meeting a certainty threshold), especially in out-of-domain problems.…”

Section: Discussionmentioning

confidence: 99%

See 1 more Smart Citation

Region-dependent temperature scaling for certainty calibration and application to class-imbalanced token classification

Dawkins¹,

Nejadgholi²

2022

Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers)

View full text Add to dashboard Cite

Certainty calibration is an important goal on the path to interpretability and trustworthy AI. Particularly in the context of human-in-theloop systems, high-quality low to mid-range certainty estimates are essential. In the presence of a dominant high-certainty class, for instance the non-entity class in NER problems, existing calibration error measures are completely insensitive to potentially large errors in this certainty region of interest. We introduce a region-balanced calibration error metric that weights all certainty regions equally. When low and mid certainty estimates are taken into account, calibration error is typically larger than previously reported. We introduce a simple extension of temperature scaling, requiring no additional computation, that can reduce both traditional and region-balanced notions of calibration error over existing baselines.

show abstract

Section: Discussionmentioning

confidence: 99%

“…Weighted temperature scaling (WTS): TS using a class-weighted NLL loss during convergence (Obadinma et al, 2021).…”

Section: Matrix Scaling (Ms)mentioning

confidence: 99%

Region-dependent temperature scaling for certainty calibration and application to class-imbalanced token classification

Dawkins¹,

Nejadgholi²

2022

Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers)

View full text Add to dashboard Cite

show abstract

“…To prevent the model trained on a long-tailed dataset from being skewed to high-frequency classes, Islam et al proposed a class distribution-aware temperature scaling, which uses class frequency information to get the temperature value. Similarly, Weighted temperature scaling is proposed by Qkadinma et al [149], which re-weights the loss function by the inverse class count to tune the scaling parameter.…”

Section: E Parametric Methodsmentioning

confidence: 99%

“…T ransf er Knowledge f rom Head to T ail [16] ⃝ ✕ ⃝ △ ✕ ⃝ ⃝ Scipy 20 Class-wise Loss Scaling [150] △ ✕ △ △ △ ⃝ ⃝ PyTorch 21 Region-dependent T emperature Scaling [143] ⃝ ✕ ⃝ △ ✕ ⃝ ⃝ URL invalid 22 Class-wise T emperature Scaling [144] ⃝ ✕ △ △ △ ⃝ ⃝ N/A N ormalized Calibration [17] △ △ △ ⃝ ✕ ⃝ △ PyTorch 23 Gaussian and Gamma Calibration [151] ⃝ ✕ △ ⃝ ✕ ✕ ⃝ URL invalid 24 CS-T S-AT C [145] △ ✕ △ △ △ ⃝ ⃝ PyTorch 25 Dual-Branch T emperature Scaling [148] △ ✕ △ △ △ ⃝ ⃝ N/A Class-distribution-aware T S [21] △ ✕ △ ⃝ △ ⃝ ⃝ URL invalid 14 W eighted T emperature Scaling [149] △…”

Section: Parametric Methods (Section Iv-e)mentioning

confidence: 99%

See 1 more Smart Citation

A survey on confidence calibration of deep learning under class imbalance data

Dong,

Jiang,

Pan

et al. 2024

Preprint

View full text Add to dashboard Cite

Confidence calibration in classification models, a technique to achieve accurate posterior probability estimation for classification results, is crucial for assessing the likelihood of correct decisions in real-world applications. Class imbalance data, which biases the learning of the model and subsequently skews the posterior probabilities of the model, makes confidence calibration more challenging. Especially for often more important minority classes with high uncertainty, confidence calibration is more complex and necessary. Unlike previous surveys that typically separately investigate confidence calibration or class imbalance, this paper comprehensively investigates confidence calibration methods for deep learning-based classification models under class imbalance. Firstly, the problem of confidence calibration under class imbalance data is outlined. Secondly, a novel exploratory analysis regarding the impact of class imbalance data on confidence calibration is carried out, which can explain some experimental findings in existing studies. Then, this paper conducts a comprehensive review of 57 state-of-the-art confidence calibration methods under class imbalance data, divides these methods into six groups according to method differences, and systematically compares seven properties to evaluate their superiority. Subsequently, some commonly used and emerging evaluation methods in this field are summarized. Finally, we discuss several promising research directions that may serve as a guideline for future studies.

show abstract