AnycostFL: Efficient On-Demand Federated Learning over Heterogeneous Edge Devices

Li, Peichun; Cheng, Guoliang; Huang, Xumin; Kang, Jiawen; Yu, Rong; Yuan, Wei; Pan, Miao

doi:10.48550/arxiv.2301.03062

Cited by 2 publications

(3 citation statements)

References 24 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Moreover, the unstructured pruning approach usually results in irregular weight matrixes in the pruned models that are difficult to compress, which requires specialized hardware and software libraries to accelerate the training speed [24]. To effectively decrease computation and communication overhead, the structured model pruning approach [20]- [22], [25], [26] was developed to prune both filters in convolution layers and neurons in FC layers to generate sub-models for devices to train. Note that in centralized learning, pruning filters in convolution layers have been demonstrated can effectively accelerate the learning speed without sacrificing too much accuracy [24], [27].…”

Section: A Related Workmentioning

confidence: 99%

“…The static model pruning approach in [21], [25] or local model composition approach in [28] distributed heterogeneous sub-models to devices for training and then aggregated them into a global inference model, which effectively reduced resource consumption for FL. The model shrinking and gradient compression approach in [26] enabled the local model training with elastic computation and communication overheads. The model pruning method in [22] dynamically adjusted the model size for resource-limited devices and significantly improved the cost-efficiency of FL.…”

Section: A Related Workmentioning

confidence: 99%

See 1 more Smart Citation

Adaptive Model Pruning for Communication and Computation Efficient Wireless Federated Learning

Chen,

Yi,

Shin

et al. 2024

IEEE Trans. Wireless Commun.

View full text Add to dashboard Cite

Most existing wireless federated learning (FL) studies focused on homogeneous model settings where devices train identical local models. In this setting, the devices with poor communication and computation capabilities may delay the global model update and degrade the performance of FL. Moreover, in the homogenous model settings, the scale of the global model is restricted by the device with the lowest capability. To tackle these challenges, this work proposes an adaptive model pruning-based FL (AMP-FL) framework, where the edge server dynamically generates sub-models by pruning the global model for devices' local training to adapt their heterogeneous computation capabilities and time-varying channel conditions. Since the involvement of diverse structures of devices' submodels in the global model updating may negatively affect the training convergence, we propose compensating for the gradients of pruned model regions by devices' historical gradients. We then introduce an age of information (AoI) metric to characterize the staleness of local gradients and theoretically analyze the convergence behaviour of AMP-FL. The convergence bound suggests scheduling devices with large AoI of gradients and pruning the model regions with small AoI for devices to improve the learning performance. Inspired by this, we define a new objective function, i.e., the average AoI of local gradients, to transform the inexplicit global loss minimization problem into a tractable one for device scheduling, model pruning, and resource block (RB) allocation design. Through detailed analysis, we derive the optimal model pruning strategy and transform the RB allocation problem into equivalent linear programming that can be effectively solved. Experimental results demonstrate the effectiveness and superiority of the proposed approaches. The proposed AMP-FL is capable of achieving 1.9x and 1.6x speed up for FL on MNIST and CIFAR-10 datasets in comparison with the FL schemes with homogeneous model settings.

show abstract

Section: A Related Workmentioning

confidence: 99%

Section: A Related Workmentioning

confidence: 99%

Adaptive Model Pruning for Communication and Computation Efficient Wireless Federated Learning

Chen,

Yi,

Shin

et al. 2024

IEEE Trans. Wireless Commun.

View full text Add to dashboard Cite

show abstract

“…Since then, significant strides have been made to improve FL's performance. There have been many studies trying to improve the performance of FL from different perspectives, such as: model heterogeneity, 47,48 non independently and identically distributed (non-IID) data, 18,49 communication efficiency, [50][51][52] robust FL. 26,41 However, most of the existing studies in FL assume that every client has a clean dataset are not designed for tackling with noisy labels.…”

Section: Related Workmentioning

confidence: 99%

An edge‐assisted federated contrastive learning method with local intrinsic dimensionality in noisy label environment

Wu,

Zhang,

Dai

et al. 2023

Softw Pract Exp

View full text Add to dashboard Cite

The advent of federated learning (FL) has presented a viable solution for distributed training in edge environment, while simultaneously ensuring the preservation of privacy. In real‐world scenarios, edge devices may be subject to label noise caused by environmental differences, automated weakly supervised annotation, malicious tampering, or even human error. However, the potential of the noisy samples have not been fully leveraged by prior studies on FL aimed at addressing label noise. Rather, they have primarily focused on conventional filtering or correction techniques to alleviate the impact of noisy labels. To tackle this challenge, a method, named DETECTION, is proposed in this article. It aims at effectively detecting noisy clients and mitigating the adverse impact of label noise while preserving data privacy. Specially, a confidence scoring mechanism based on local intrinsic dimensionality (LID) is investigated for distinguishing noisy clients from clean clients. Then, a loss function based on prototype contrastive learning is designed to optimize the local model. To address the varying levels of noise across clients, a LID weighted aggregation strategy (LA) is introduced. Experimental results on three datasets demonstrate the effectiveness of DETECTION in addressing the issue of label noise in FL while maintaining data privacy.

show abstract

AnycostFL: Efficient On-Demand Federated Learning over Heterogeneous Edge Devices

Cited by 2 publications

References 24 publications

Adaptive Model Pruning for Communication and Computation Efficient Wireless Federated Learning

Adaptive Model Pruning for Communication and Computation Efficient Wireless Federated Learning

An edge‐assisted federated contrastive learning method with local intrinsic dimensionality in noisy label environment

Contact Info

Product

Resources

About