2022 21st ACM/IEEE International Conference on Information Processing in Sensor Networks (IPSN) 2022
DOI: 10.1109/ipsn54338.2022.00029
|View full text |Cite
|
Sign up to set email alerts
|

BalanceFL: Addressing Class Imbalance in Long-Tail Federated Learning

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
8
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
5
3

Relationship

0
8

Authors

Journals

citations
Cited by 32 publications
(8 citation statements)
references
References 22 publications
0
8
0
Order By: Relevance
“…However, previous studies specifically take into account the label skew non-IID when each client has a fixed number of classes (label size imbalance [8], [9], [28], [43]- [45]) or when the number of samples of a certain class is distributed to clients using the power-law or Dirichlet distribution (label distribution imbalance [13], [27], [39], [46]). Recently, some works consider the non-IID scenario which is more close to the real-world data such as the numbers of classes are often highly imbalanced [29], or following the cluster-skew non-IID distribution [30], [31]. Especially, cluster-skew is firstly introduced by [30] where there exists a data correlation between clients.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…However, previous studies specifically take into account the label skew non-IID when each client has a fixed number of classes (label size imbalance [8], [9], [28], [43]- [45]) or when the number of samples of a certain class is distributed to clients using the power-law or Dirichlet distribution (label distribution imbalance [13], [27], [39], [46]). Recently, some works consider the non-IID scenario which is more close to the real-world data such as the numbers of classes are often highly imbalanced [29], or following the cluster-skew non-IID distribution [30], [31]. Especially, cluster-skew is firstly introduced by [30] where there exists a data correlation between clients.…”
Section: Related Workmentioning
confidence: 99%
“…In particular, non-IID data may slow down the model convergence, destabilize local training at clients, and degrade model accuracy in consequence [15]- [19]. Numerous efforts have been devoted to overcoming the non-IID issue, which may be classified into two primary categories: (i) reduce the impact of non-IID data by optimizing the aggregation [10], [20], [21] or by optimizing the method to select the client each round [22]- [24] on the server-side, and (ii) enhancing training on the client side [9], [25]- [29]. However, current research on non-IID faces the two critical issues as follows.…”
Section: Introductionmentioning
confidence: 99%
“…The Astraea offers to solve the global imbalance problem by z-score-based data augmentation and down sampling as well as a mediator for rescheduling the attendance of collaborators by exploiting the Kullback-Leibler divergence of their distributions [18]. Shuai et al [19] offered BalanceFL method which aims to solve the local and global data imbalance by updating the locally trained models as it is trained on uniform data. In the BalanceFL study, the local imbalance problems are defined as causing from two reasons such as data amount and class missing, which are solved respectively by inter-class balancing and knowledge inheritance techniques [19].…”
Section: Fl Applications On Imbalanced Datamentioning
confidence: 99%
“…Shuai et al [19] offered BalanceFL method which aims to solve the local and global data imbalance by updating the locally trained models as it is trained on uniform data. In the BalanceFL study, the local imbalance problems are defined as causing from two reasons such as data amount and class missing, which are solved respectively by inter-class balancing and knowledge inheritance techniques [19].…”
Section: Fl Applications On Imbalanced Datamentioning
confidence: 99%
“…Accompanied by the prospect of FL, it faces the challenge of data heterogeneity, which is a universal issue arising from disparities in data distribution among clients due to data source differences between clients (Karimireddy et al 2020;. Furthermore, real-world data frequently exhibits the long-tailed phenomenon (Shuai et al 2022), that is, the class distribution is imbalanced, where the head classes have a large number of samples while the tail classes have a small number of samples. Longtailed data makes client-side models biased toward the head class (Shang et al 2022).…”
Section: Introductionmentioning
confidence: 99%