2021
DOI: 10.48550/arxiv.2110.01463
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Asynchronous Upper Confidence Bound Algorithms for Federated Linear Bandits

Chuanhao Li,
Hongning Wang

Abstract: Linear contextual bandit is a popular online learning problem. It has been mostly studied in centralized learning settings. With the surging demand of large-scale decentralized model learning, e.g., federated learning, how to retain regret minimization while reducing communication cost becomes an open challenge. In this paper, we study linear contextual bandit in a federated learning setting. We propose a general framework with asynchronous model update and communication for a collection of homogeneous clients… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
7
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
1
1

Relationship

1
1

Authors

Journals

citations
Cited by 2 publications
(7 citation statements)
references
References 15 publications
0
7
0
Order By: Relevance
“…The key challenge of the algorithm design is to manage the data inconsistency during user clustering. Existing matrix determinant-based protocols (Li and Wang 2021;Liu et al 2022a;He et al 2022) fail to achieve this goal due to insufficient communication, so we propose a novel communication protocol that employs a p t -auxiliary protocol in conjunction with the matrix determinant-based protocol. This new protocol effectively controls data inconsistency, ensuring the correct operation of heterogeneity testing and action selection.…”
Section: Our Contributionsmentioning
confidence: 99%
“…The key challenge of the algorithm design is to manage the data inconsistency during user clustering. Existing matrix determinant-based protocols (Li and Wang 2021;Liu et al 2022a;He et al 2022) fail to achieve this goal due to insufficient communication, so we propose a novel communication protocol that employs a p t -auxiliary protocol in conjunction with the matrix determinant-based protocol. This new protocol effectively controls data inconsistency, ensuring the correct operation of heterogeneity testing and action selection.…”
Section: Our Contributionsmentioning
confidence: 99%
“…GLB under federated/distributed setting still remains under-explored. The most related works in bandit literature are the federated/distributed linear bandits (Korda et al, 2016;Dubey and Pentland, 2020;Huang et al, 2021;Li and Wang, 2021). In these works, thanks to the existence of closed-form solution for linear models, the clients only communicate their local sufficient statistics for global model update.…”
Section: Related Workmentioning
confidence: 99%
“…Huang et al (2021) considered a star-shaped communication network as in our paper, but they assumed a fixed arm set setting and thus proposed a phase-based elimination algorithm. The closest works to ours are Dubey and Pentland, 2020;Li and Wang, 2021), which proposed event-triggered communication protocols to obtain sub-linear communication cost over time for federated linear bandit with a time-varying arm set.…”
Section: Related Workmentioning
confidence: 99%
See 2 more Smart Citations