Coded Stochastic ADMM for Decentralized Consensus Optimization With Edge Computing

Chen, Hao; Yu, Ye; Xiao, Ming; Skoglund, Mikael; Poor, H. Vincent

doi:10.1109/jiot.2021.3058116

Cited by 14 publications

(10 citation statements)

References 34 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Meanwhile, it is inefficient (sometimes even infeasible) to transmit all data to a central node for analysis. For the reason, distributed machine learning (DML), which stores and processes all or parts of data in different nodes, has attracted significant research interests and applications [ 1 , 3 , 4 , 5 , 6 , 7 , 8 , 9 , 10 , 11 , 12 , 13 , 14 , 15 , 16 ]. There are different methods of implementing DML, i.e., primal method (e.g., distributed gradient descend [ 4 , 7 ], federated learning [ 5 , 6 ]) and primal–dual method (e.g., alternating direction method of multipliers (ADMM)) [ 16 ].…”

Section: Background and Motivationsmentioning

confidence: 99%

See 1 more Smart Citation

Coding for Large-Scale Distributed Machine Learning

Xiao

Skoglund

2022

Entropy

Self Cite

View full text Add to dashboard Cite

This article aims to give a comprehensive and rigorous review of the principles and recent development of coding for large-scale distributed machine learning (DML). With increasing data volumes and the pervasive deployment of sensors and computing machines, machine learning has become more distributed. Moreover, the involved computing nodes and data volumes for learning tasks have also increased significantly. For large-scale distributed learning systems, significant challenges have appeared in terms of delay, errors, efficiency, etc. To address the problems, various error-control or performance-boosting schemes have been proposed recently for different aspects, such as the duplication of computing nodes. More recently, error-control coding has been investigated for DML to improve reliability and efficiency. The benefits of coding for DML include high-efficiency, low complexity, etc. Despite the benefits and recent progress, however, there is still a lack of comprehensive survey on this topic, especially for large-scale learning. This paper seeks to introduce the theories and algorithms of coding for DML. For primal-based DML schemes, we first discuss the gradient coding with the optimal code distance. Then, we introduce random coding for gradient-based DML. For primal–dual-based DML, i.e., ADMM (alternating direction method of multipliers), we propose a separate coding method for two steps of distributed optimization. Then coding schemes for different steps are discussed. Finally, a few potential directions for future works are also given.

show abstract

Section: Background and Motivationsmentioning

confidence: 99%

“…For the global consensus, network coding can be used to reduce the communication loads and increase reliability. In [ 15 ], we preliminarily investigated how coding (MDS codes) can be used in local optimization (step (a)). A more detailed introduction is given as follows.…”

Section: Coding For Admmmentioning

confidence: 99%

Coding for Large-Scale Distributed Machine Learning

Xiao

Skoglund

2022

Entropy

Self Cite

View full text Add to dashboard Cite

show abstract

“…The global variable 𝑧 𝑘+1 and gradient estimation 𝜇 𝑘+1 get updated at agent 𝑖 𝑘 and passed as tokens to its neighbour 𝑖 𝑘+1 through Hamiltonian cycle. When {𝜂 𝑘 = 0|𝑘 = 1, 2, ...}, the algorithm reduces to the vanilla stochastic incremental ADMM (sI-ADMM) as in [45]. Comparing with sI-ADMM, asI-ADMM constructs stochastic gradient 𝜇 𝑘+1 based on the information 𝐺 𝑖 𝑘 (𝜽 𝑘 𝑖 𝑘 ; 𝜻 𝑘 𝑖 𝑘 ) and 𝜇 𝑘 , while sI-ADMM only considers the current mini-batch gradient.…”

Section: Contributionsmentioning

confidence: 99%

Adaptive Stochastic ADMM for Decentralized Reinforcement Learning in Edge Industrial IoT

Lei,

Ye,

Xiao

et al. 2021

Preprint

Self Cite

View full text Add to dashboard Cite

“…Distributed optimization algorithms have two main classes: distributed primal algorithm [4]- [7] and distributed primaldual algorithm [8]- [12]. In [4], the authors proposed fast distributed gradient algorithms to minimize the sum of individual cost function.…”

Section: Introductionmentioning

confidence: 99%

“…A variant ADMM algorithm was proposed in [11], which has less communication overhead but with the same convergence rate of standard ADMM. To further reduce the communication overhead, the authors in [12] investigated coding for stochastic incremental distributed primal-dual algorithm. However, the above distributed primal-dual works [8]- [12] all ignored the affect of wireless factors (such as transmission error) when implementing distributed primal-dual algorithm over wireless communications.…”

Section: Introductionmentioning

confidence: 99%

Performance Optimization of Distributed Primal-Dual Algorithms over Wireless Networks

Yang

Chen

Wong

et al. 2021

ICC 2021 - IEEE International Conference on Communications

Self Cite

View full text Add to dashboard Cite

In this paper, the problem of convergence rate optimization for distributed primal-dual algorithm over wireless communications is investigated. In the considered model, each user locally updates the primal and dual variables, which are uploaded to the base station (BS). The BS aggregates the data from the users and broadcast the aggregated value to all users. This resource allocation problem is formulated as an optimization problem whose goal is to minimize the gap between the optimal value and the obtained value after a fixed number of iterations in distributed primal-dual algorithm. To solve this problem, the convergence rate is obtained in closed form for the primal-dual algorithm with considering the impact of wireless factors. Based on this convergence rate, the optimal condition for the power control and resource block allocation is obtained. An iterative algorithm with low complexity is proposed to solve this joint power control and resource block allocation problem. Simulation results show that the proposed algorithm can achieve better compared to baseline methods.

show abstract

Coded Stochastic ADMM for Decentralized Consensus Optimization With Edge Computing

Cited by 14 publications

References 34 publications

Coding for Large-Scale Distributed Machine Learning

Coding for Large-Scale Distributed Machine Learning

Adaptive Stochastic ADMM for Decentralized Reinforcement Learning in Edge Industrial IoT

Performance Optimization of Distributed Primal-Dual Algorithms over Wireless Networks

Contact Info

Product

Resources

About