Proceedings of the 38th International ACM SIGIR Conference on Research and Development in Information Retrieval 2015
DOI: 10.1145/2766462.2767836
|View full text |Cite
|
Sign up to set email alerts
|

Adapted B-CUBED Metrics to Unbalanced Datasets

Abstract: B-CUBED metrics have recently been adopted in the evaluation of clustering results as well as in many other related tasks. However, this family of metrics is not well adapted when datasets are unbalanced. This issue is extremely frequent in Web results, where classes are distributed following a strong unbalanced pattern. In this paper, we present a modified version of B-CUBED metrics to overcome this situation. Results in toy and real datasets indicate that the proposed adaptation correctly considers the parti… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
2
1

Relationship

0
3

Authors

Journals

citations
Cited by 3 publications
(2 citation statements)
references
References 7 publications
(27 reference statements)
0
2
0
Order By: Relevance
“…F P and F B are two widely used metrics to evaluate clustering performance. Basically, the performance of F P and F B is dominated by the large-size clusters [2,21]. Note that we omit the metric of NMI in prior works [31,35,36,38,39,41] due to its tendency to choose the results with large number of clusters [1].…”
Section: Metrics For Face Clusteringmentioning
confidence: 99%
See 1 more Smart Citation
“…F P and F B are two widely used metrics to evaluate clustering performance. Basically, the performance of F P and F B is dominated by the large-size clusters [2,21]. Note that we omit the metric of NMI in prior works [31,35,36,38,39,41] due to its tendency to choose the results with large number of clusters [1].…”
Section: Metrics For Face Clusteringmentioning
confidence: 99%
“…In addition, performance of face clustering methods is usually evaluated based on Pairwise F-score (F P ) [32] and BCubed F-score (F B ) [2]. The two traditional metrics are biased toward large-size clusters [2,21], which grossly neglect the negative impact of incorrect partitions on small-size clusters. Those clusters create lots of burdens for subsequent applications because they misinformed the true number of clusters.…”
Section: Introductionmentioning
confidence: 99%