Communication-Efficient Federated Learning via Optimal Client Sampling

Ribero, Mónica; Vikalo, Haris

doi:10.48550/arxiv.2007.15197

Cited by 18 publications

(29 citation statements)

References 44 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…datasets as indicated in their results. Some other empirical studies with similar approaches include [15], [16], but also do not consider or derive convergence bounds for their selection strategies. Some later work began to include analysis of the convergence of FL with device selection.…”

Section: R Wmentioning

confidence: 99%

“…Since the objective is an independent sum over n, we can perform the minimization separately for each device n. Algorithm 2 details to process in determining the optimal P n (t) and q t n in each round. We now present Theorem 2 which gives an analytical solution to (15) that can be computed distributively by the devices.…”

Section: Algorithm 2: Stochastic Client Samplingmentioning

confidence: 99%

“…Theorem 2. The solution to (15) is given by Algorithm 2 where the optimal values for each n is given by either the endpoints, i.e., P opt n (t) = P max , q t n = 1 or by…”

Section: Algorithm 2: Stochastic Client Samplingmentioning

confidence: 99%

“…To find the roots, we compute the gradient of the objective function for each n in (15) ∇f (q t n , P n (t)) =   …”

Section: A Proof Of Theoremmentioning

confidence: 99%

See 3 more Smart Citations

Communication-Efficient Device Scheduling for Federated Learning Using Stochastic Optimization

Perazzone¹,

Wang²,

Ji³

et al. 2022

Preprint

View full text Add to dashboard Cite

Federated learning (FL) is a useful tool in distributed machine learning that utilizes users' local datasets in a privacy-preserving manner. When deploying FL in a constrained wireless environment; however, training models in a time-efficient manner can be a challenging task due to intermittent connectivity of devices, heterogeneous connection quality, and non-i.i.d. data. In this paper, we provide a novel convergence analysis of nonconvex loss functions using FL on both i.i.d. and non-i.i.d. datasets with arbitrary device selection probabilities for each round. Then, using the derived convergence bound, we use stochastic optimization to develop a new client selection and power allocation algorithm that minimizes a function of the convergence bound and the average communication time under a transmit power constraint. We find an analytical solution to the minimization problem. One key feature of the algorithm is that knowledge of the channel statistics is not required and only the instantaneous channel state information needs to be known. Using the FEMNIST and CIFAR-10 datasets, we show through simulations that the communication time can be significantly decreased using our algorithm, compared to uniformly random participation.

show abstract

Section: R Wmentioning

confidence: 99%

Section: Algorithm 2: Stochastic Client Samplingmentioning

confidence: 99%

“…Theorem 2. The solution to (15) is given by Algorithm 2 where the optimal values for each n is given by either the endpoints, i.e., P opt n (t) = P max , q t n = 1 or by…”

Section: Algorithm 2: Stochastic Client Samplingmentioning

confidence: 99%

“…To find the roots, we compute the gradient of the objective function for each n in (15) ∇f (q t n , P n (t)) =   …”

Section: A Proof Of Theoremmentioning

confidence: 99%

See 2 more Smart Citations

Communication-Efficient Device Scheduling for Federated Learning Using Stochastic Optimization

Perazzone¹,

Wang²,

Ji³

et al. 2022

Preprint

View full text Add to dashboard Cite

show abstract

“…They propose a federated averaging scheme where the aggregation is weighted by probabilities of devices being inactive at a given communication round. Another selection approach is proposed in [9]. The approach suggests that clients with the most significant local updates are selected.…”

Section: Introductionmentioning

confidence: 99%

Client Selection in Federated Learning based on Gradients Importance

Ouiame¹,

Hammouti²,

Bergou³

2021

Preprint

View full text Add to dashboard Cite

Federated learning (FL) enables multiple devices to collaboratively learn a global model without sharing their personal data. In real-world applications, the different parties are likely to have heterogeneous data distribution and limited communication bandwidth. In this paper, we are interested in improving the communication efficiency of FL systems. We investigate and design a device selection strategy based on the importance of the gradient norms. In particular, our approach consists of selecting devices with the highest norms of gradient values at each communication round. We study the convergence and the performance of such a selection technique and compare it to existing ones. We perform several experiments with non-iid set-up. The results show the convergence of our method with a considerable increase of test accuracy comparing to the random selection.

show abstract

Pretraining Client Selection Algorithm Based on a Data Distribution Evaluation Model in Federated Learning

Xu,

Liu,

et al. 2024

IEEE Access

View full text Add to dashboard Cite

Federated Learning (FL) allows task initiators (servers) to utilize data from task participants (clients) to train machine learning models while protecting data privacy. However, in the FL system, when the client data are non-independently identically distributed (Non-IID), appropriate metrics are chosen to accurately evaluate the quality of the client data, accordingly to select a reasonable subset of clients, and thus ensure the accuracy of the FL aggregation model. In this paper, based on the experimental results, a data distribution evaluation model is proposed, which is based on two metrics: the volume of client data and its increment and the balance of global client data. This data distribution evaluation model enables more accurate evaluation of clients with Non-IID characteristics. Based on this evaluation model, this paper further proposes an FL client subset selection algorithm. This algorithm accurately evaluates the data value of each client, enabling the server to select the most valuable subset of clients before FL training, thus improving the accuracy of the federated learning aggregation model in scenarios with Non-IID client data. When training the FL aggregation model using the proposed method on datasets composed of CIFAR-10, Fashion-MNIST, and DEAP distributions, compared to the optimal baseline, the average precision scores increased by 5.99%, 4.79%, and 4.29% respectively. The improvement in accuracy is more pronounced in scenarios with Non-IID data, such as in the DEAP dataset distribution with the highest Non-IID degree, where the accuracy increased by 5.30% compared to the optimal baseline.

show abstract

Communication-Efficient Federated Learning via Optimal Client Sampling

Cited by 18 publications

References 44 publications

Communication-Efficient Device Scheduling for Federated Learning Using Stochastic Optimization

Communication-Efficient Device Scheduling for Federated Learning Using Stochastic Optimization

Client Selection in Federated Learning based on Gradients Importance

Pretraining Client Selection Algorithm Based on a Data Distribution Evaluation Model in Federated Learning

Contact Info

Product

Resources

About