Federated Multi-Armed Bandits

Shi, Chengshuai; Shen, Cong

doi:10.1609/aaai.v35i11.17156

Cited by 41 publications

(30 citation statements)

References 15 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Since most of the MAB algorithms discussed in this paper are recent [143], [144], [147], [149], it remains interesting to see their implications on practical applications, for example, quantifying the effect of bounded communication resources or energy used in wearable devices and congestion between edge nodes. Similarly, quantifying the improvement in regret bounds on actual and Quality of Experience (QoE) metrics can be promising.…”

Section: Communication-efficient Multi-agent Reinforcement Learning 1...mentioning

confidence: 99%

“…Furthermore, privacy concerns sometimes limit the ability of these local servers to share data with other servers. The work in [149] studies the case of a set of servers that run a recommender system for their prospective clients. The goal of each one is to recommend the most popular content across all servers.…”

mentioning

confidence: 99%

“…• Towards the federation of the bandit framework: When the bandit instances faced by each agent are local biased instances, the federated bandits framework arises. In such a situation, agents need to learn with the help of a logically centralized controller, similar to supervised federated learning, in order to estimate the true global instance and the true best action [149]. However, if agents are not interested in solving a hidden global instance but rather only their own, they may reuse their peers' experience and an instance-similarity metric to help them solve their own instances [150].…”

mentioning

confidence: 99%

See 2 more Smart Citations

Pervasive AI for IoT Applications: A Survey on Resource-Efficient Distributed Artificial Intelligence

Baccour

Mhaisen

Abdellatif

et al. 2022

IEEE Commun. Surv. Tutorials

View full text Add to dashboard Cite

Artificial intelligence (AI) has witnessed a substantial breakthrough in a variety of Internet of Things (IoT) applications and services, spanning from recommendation systems and speech processing applications to robotics control and military surveillance. This is driven by the easier access to sensory data and the enormous scale of pervasive/ubiquitous devices that generate zettabytes of real-time data streams. Designing accurate models using such data streams, to revolutionize the decision-taking process, inaugurates pervasive computing as a worthy paradigm for a better quality-of-life (e.g., smart homes and self-driving cars.). The confluence of pervasive computing and artificial intelligence, namely Pervasive AI, expanded the role of ubiquitous IoT systems from mainly data collection to executing distributed computations with a promising alternative to centralized learning, presenting various challenges, including privacy and latency requirements. In this context, an intelligent resource scheduling should be envisaged among IoT devices (e.g., smartphones, smart vehicles) and infrastructure (e.g., edge nodes and base stations) to avoid communication and computation overheads and ensure maximum performance. In this paper, we conduct a comprehensive survey of the recent techniques and strategies developed to overcome these resource challenges in pervasive AI systems. Specifically, we first present an overview of the pervasive computing, its architecture, and its intersection with artificial intelligence. We then review the background, applications and performance metrics of AI, particularly Deep Learning (DL) and reinforcement learning, running in a ubiquitous system. Next, we provide a deep literature review of communication-efficient techniques, from both algorithmic and system perspectives, of distributed training and inference across the combination of IoT devices, edge devices and cloud servers. Finally, we discuss our future vision and research challenges.

show abstract

Section: Communication-efficient Multi-agent Reinforcement Learning 1...mentioning

confidence: 99%

mentioning

confidence: 99%

mentioning

confidence: 99%

See 1 more Smart Citation

Pervasive AI for IoT Applications: A Survey on Resource-Efficient Distributed Artificial Intelligence

Baccour

Mhaisen

Abdellatif

et al. 2022

IEEE Commun. Surv. Tutorials

View full text Add to dashboard Cite

show abstract

“…Therefore, there has been much research interest in developing new FL algorithms that exhibit superior convergence (in terms of communication rounds) compared to FedAvg in the face of non-IID client data. Although most works consider training Deep [2] Averages client models x x O(|x|) FedProx [6] Proximal term for local objectives x x O(2|x|) FedMAX [7] Max-entropy term for local objectives x x O(|x|) AdaptiveFedOpt [3] Server-only optimiser x x O(|x|) MFL [8] Averages client models and optimisers x, s x, s O(|x| + |s|) Mimelite [9] Unbiased global optimiser Neural Networks (DNNs) in a round-based synchronous fashion, some works propose asynchronous algorithms to reduce training time [10], [11], and for other models such as Random Forests [12] and Multi-Armed Bandits [13].…”

Section: Introductionmentioning

confidence: 99%

Accelerating Federated Learning With a Global Biased Optimiser

Mills

Min

et al. 2023

IEEE Trans. Comput.

View full text Add to dashboard Cite

Federated Learning (FL) is a recent development in distributed machine learning that collaboratively trains models without training data leaving client devices, preserving data privacy. In real-world FL, the training set is distributed over clients in a highly non-Independent and Identically Distributed (non-IID) fashion, harming model convergence speed and final performance. To address this challenge, we propose a novel, generalised approach for incorporating adaptive optimisation into FL with the Federated Global Biased Optimiser (FedGBO) algorithm. FedGBO accelerates FL by employing a set of global biased optimiser values during training, reducing 'client-drift' from non-IID data whilst benefiting from adaptive optimisation. We show that in FedGBO, updates to the global model can be reformulated as centralised training using biased gradients and optimiser updates, and apply this framework to prove FedGBO's convergence on nonconvex objectives when using the momentum-SGD (SGDm) optimiser. We also conduct extensive experiments using 4 FL benchmark datasets (CIFAR100, Sent140, FEMNIST, Shakespeare) and 3 popular optimisers (SGDm, RMSProp, Adam) to compare FedGBO against six state-of-the-art FL algorithms. The results demonstrate that FedGBO displays superior or competitive performance across the datasets whilst having low data-upload and computational costs, and provide practical insights into the trade-offs associated with different adaptive-FL algorithms and optimisers.

show abstract

“…(b) The evaluation result in scenario III. 12 ||| IEEE VEHICULAR TECHNOLOGY MAGAZINE | MONTH 2023algorithm[15], which is inspired by the principle of federated learning: training a model across multiple decentralized agents and merging the locally trained models in a communication-efficient and data-private way. Another related challenge to a federated MAB is the adversarial users who could attack the systems by sharing malicious local models, which calls for designing defense mechanisms to protect the global model.…”

mentioning

confidence: 99%

Multi-Armed Bandit for Link Configuration in Millimeter-Wave Networks: An Approach for Solving Sequential Decision-Making Problems

Zhang

Heath

2023

IEEE Veh. Technol. Mag.

View full text Add to dashboard Cite

stablishing and maintaining millimeter-wave (mm-wave) links are challenging due to the changing environment and the high sensibility of mmwave signals to user mobility and channel conditions. mm-Wave link configuration problems often involve a search for optimal system parameter(s) under environmental uncertainties from a finite set of alternatives that are supported by the system hardware and protocol. For example, beam sweeping aims at identifying the optimal beam(s) for data transmission from a discrete codebook. Selecting parameters such as the beam sweeping period and the beamwidth is crucial to achieving high overall system throughput. In this article, we motivate the use of the multi-armed bandit (MAB) framework to intelligently search out the optimal configuration when establishing the mm-wave links. MAB is a reinforcement learning framework that guides a decision maker to sequentially select one action from a set of actions. As an example, we show that within the MAB framework, the optimal beam sweeping period, beamwidth, and beam directions could be dynamically learned with samplecomputational-efficient bandit algorithms. We conclude by highlighting some future research directions on enhancing mm-wave link configuration design with MAB.

show abstract

Federated Multi-Armed Bandits

Cited by 41 publications

References 15 publications

Pervasive AI for IoT Applications: A Survey on Resource-Efficient Distributed Artificial Intelligence

Pervasive AI for IoT Applications: A Survey on Resource-Efficient Distributed Artificial Intelligence

Accelerating Federated Learning With a Global Biased Optimiser

Multi-Armed Bandit for Link Configuration in Millimeter-Wave Networks: An Approach for Solving Sequential Decision-Making Problems

Contact Info

Product

Resources

About