2023
DOI: 10.3233/faia230271
|View full text |Cite
|
Sign up to set email alerts
|

Cooperative Thresholded Lasso for Sparse Linear Bandit

Hanieyh Barghi,
Xiaotong Cheng,
Setareh Maghsudi

Abstract: We present a novel approach to address the multi-agent sparse contextual linear bandit problem, in which the feature vectors have a high dimension d whereas the reward function depends on only a limited set of features - precisely s0 ≪ d. Furthermore, the learning follows under information-sharing constraints. The proposed method employs Lasso regression for dimension reduction, allowing each agent to independently estimate an approximate set of main dimensions and share that information with others depending … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
3
0

Year Published

2024
2024
2024
2024

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(3 citation statements)
references
References 16 publications
0
3
0
Order By: Relevance
“…Thresholded lasso bandit (Ariu, Abe, and Proutière 2022) The thresholded lasso bandit estimates the support of β * each round and a greedy arm selection policy is performed by the inner product of the arm features and the estimated parameter for β * on this support. The relaxed symmetry and the balanced covariance are introduced to ensure proper support estimation under the greedy policy.…”
Section: Application To Several Algorithmsmentioning
confidence: 99%
See 2 more Smart Citations
“…Thresholded lasso bandit (Ariu, Abe, and Proutière 2022) The thresholded lasso bandit estimates the support of β * each round and a greedy arm selection policy is performed by the inner product of the arm features and the estimated parameter for β * on this support. The relaxed symmetry and the balanced covariance are introduced to ensure proper support estimation under the greedy policy.…”
Section: Application To Several Algorithmsmentioning
confidence: 99%
“…These two generalizations provide theoretical guarantees of the greedy policy for a very wide range of the arm feature distributions. Moreover, we demonstrate the usefulness of our analysis by applying it to the other cases: thresholded lasso bandit (Ariu, Abe, and Proutière 2022), combinatorial setting, and non-sparse setting (Bastani and Bayati 2020).…”
Section: Introductionmentioning
confidence: 98%
See 1 more Smart Citation