Asynchronous Upper Confidence Bound Algorithms for Federated Linear Bandits

Li, Chuanhao; Wang, Hongning

doi:10.48550/arxiv.2110.01463

Cited by 2 publications

(7 citation statements)

References 15 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…The key challenge of the algorithm design is to manage the data inconsistency during user clustering. Existing matrix determinant-based protocols (Li and Wang 2021;Liu et al 2022a;He et al 2022) fail to achieve this goal due to insufficient communication, so we propose a novel communication protocol that employs a p t -auxiliary protocol in conjunction with the matrix determinant-based protocol. This new protocol effectively controls data inconsistency, ensuring the correct operation of heterogeneity testing and action selection.…”

Section: Our Contributionsmentioning

confidence: 99%

Federated Contextual Cascading Bandits with Asynchronous Communication and Heterogeneous Users

Yang,

Liu,

Wang

et al. 2024

AAAI

View full text Add to dashboard Cite

We study the problem of federated contextual combinatorial cascading bandits, where agents collaborate under the coordination of a central server to provide tailored recommendations to users. Existing works consider either a synchronous framework, necessitating full agent participation and global synchronization, or assume user homogeneity with identical behaviors. We overcome these limitations by considering (1) federated agents operating in an asynchronous communication paradigm, where no mandatory synchronization is required and all agents communicate independently with the server, (2) heterogeneous user behaviors, where users can be stratified into latent user clusters, each exhibiting distinct preferences. For this setting, we propose a UCB-type algorithm with delicate communication protocols. Through theoretical analysis, we give sub-linear regret bounds on par with those achieved in the synchronous framework, while incurring only logarithmic communication costs. Empirical evaluation on synthetic and real-world datasets validates our algorithm's superior performance in terms of regrets and communication costs.

show abstract

Section: Our Contributionsmentioning

confidence: 99%

Federated Contextual Cascading Bandits with Asynchronous Communication and Heterogeneous Users

Yang,

Liu,

Wang

et al. 2024

AAAI

View full text Add to dashboard Cite

show abstract

“…GLB under federated/distributed setting still remains under-explored. The most related works in bandit literature are the federated/distributed linear bandits (Korda et al, 2016;Dubey and Pentland, 2020;Huang et al, 2021;Li and Wang, 2021). In these works, thanks to the existence of closed-form solution for linear models, the clients only communicate their local sufficient statistics for global model update.…”

Section: Related Workmentioning

confidence: 99%

“…Huang et al (2021) considered a star-shaped communication network as in our paper, but they assumed a fixed arm set setting and thus proposed a phase-based elimination algorithm. The closest works to ours are Dubey and Pentland, 2020;Li and Wang, 2021), which proposed event-triggered communication protocols to obtain sub-linear communication cost over time for federated linear bandit with a time-varying arm set.…”

Section: Related Workmentioning

confidence: 99%

“…As a classic model for sequential decision making problems, contextual bandit has been widely used for a variety of real-world applications, including recommender systems (Li et al, 2010a), display advertisement (Li et al, 2010b) and clinical trials (Durand et al, 2018). While most existing bandit solutions are designed under a centralized setting (i.e., data is readily available at a central server), in response to the increasing application scale and public concerns of privacy, there is increasing research effort on federated bandit learning lately Dubey and Pentland, 2020;Shi et al, 2021;Huang et al, 2021;Li and Wang, 2021), where N clients collaborate with limited communication bandwidth to minimize the overall cumulative regret incurred over a finite time horizon T , while keeping each client's raw data local. Compared with standard federated learning (McMahan et al, 2017;Kairouz et al, 2019) that works with fixed datasets, federated bandit learning is characterized by its online interactions with the environment, which continuously provides new data samples to the clients over time.…”

Section: Introductionmentioning

confidence: 99%

“…Existing federated bandit learning solutions only partially addressed this challenge by considering simple bandit models, like context-free bandit (Shi et al, 2021) and contextual linear bandit Dubey and Pentland, 2020;Li and Wang, 2021), where closed-form solution for both local and global model update exists. Therefore, efficient communication for global bandit model update is realized by directly aggregating local sufficient statistics, such that the only concern left is how to control the communication frequency over time horizon T .…”

Section: Introductionmentioning

confidence: 99%

See 2 more Smart Citations

Communication Efficient Federated Learning for Generalized Linear Bandits

Li¹,

Wang²

2022

Preprint

Self Cite

View full text Add to dashboard Cite

Contextual bandit algorithms have been recently studied under the federated learning setting to satisfy the demand of keeping data decentralized and pushing the learning of bandit models to the client side. But limited by the required communication efficiency, existing solutions are restricted to linear models to exploit their closed-form solutions for parameter estimation. Such a restricted model choice greatly hampers these algorithms' practical utility. In this paper, we take the first step to addressing this challenge by studying generalized linear bandit models under a federated learning setting. We propose a communication-efficient solution framework that employs online regression for local update and offline regression for global update. We rigorously proved that, though the setting is more general and challenging, our algorithm can attain sub-linear rate in both regret and communication cost, which is also validated by our extensive empirical evaluations.

show abstract

Asynchronous Upper Confidence Bound Algorithms for Federated Linear Bandits

Cited by 2 publications

References 15 publications

Federated Contextual Cascading Bandits with Asynchronous Communication and Heterogeneous Users

Federated Contextual Cascading Bandits with Asynchronous Communication and Heterogeneous Users

Communication Efficient Federated Learning for Generalized Linear Bandits

Contact Info

Product

Resources

About