2013 Humaine Association Conference on Affective Computing and Intelligent Interaction 2013
DOI: 10.1109/acii.2013.47
|View full text |Cite
|
Sign up to set email alerts
|

Facing Imbalanced Data--Recommendations for the Use of Performance Metrics

Abstract: Recognizing facial action units (AUs) is important for situation analysis and automated video annotation. Previous work has emphasized face tracking and registration and the choice of features classifiers. Relatively neglected is the effect of imbalanced data for action unit detection. While the machine learning community has become aware of the problem of skewed data for training classifiers, little attention has been paid to how skew may bias performance metrics. To address this question, we conducted experi… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

4
371
0
3

Year Published

2014
2014
2023
2023

Publication Types

Select...
5
3

Relationship

0
8

Authors

Journals

citations
Cited by 583 publications
(378 citation statements)
references
References 25 publications
4
371
0
3
Order By: Relevance
“…In binary classifications, there are very few, if any, real-world scenarios where the data is evenly split between the positive and the negative category. The result of testing classifiers on highly skewed data that were created using balanced datasets, is generally the same: The performance drops for all threshold measures [15].…”
Section: Multi-gram Modelsmentioning
confidence: 92%
See 2 more Smart Citations
“…In binary classifications, there are very few, if any, real-world scenarios where the data is evenly split between the positive and the negative category. The result of testing classifiers on highly skewed data that were created using balanced datasets, is generally the same: The performance drops for all threshold measures [15].…”
Section: Multi-gram Modelsmentioning
confidence: 92%
“…Instead, we will here look at the F-Measure values when comparing the threshold performance of the models [15]. In Table 5, to the left, we can see the tri-gram performance using imbalanced testing data.…”
Section: Imbalanced Dataset Testingmentioning
confidence: 99%
See 1 more Smart Citation
“…Precision is the fraction of retrieved information relevant to the search while Recall is the fraction of the information related to the search query that is retrieved successfully [14,32]. The formulas for Sensitivity and Specificity are given below: …”
Section: Recall/sensitivity and Specificitymentioning
confidence: 99%
“…ǫ = misclassified test instances total test instances , while the latter measures the speed of the algorithm in seconds. It has been argued in machine learning literature, whether or not misclassification rate is a good accuracy measure, especially when dealing with datasets having considerable class skew [21]. However, other performance measures [22] are equally susceptible to class skew [21].…”
Section: B Performance Criteriamentioning
confidence: 99%