2017
DOI: 10.48550/arxiv.1704.04567
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Asynchronous Parallel Empirical Variance Guided Algorithms for the Thresholding Bandit Problem

Jie Zhong,
Yijun Huang,
Ji Liu

Abstract: This paper considers the multi-armed thresholding bandit problem -identifying all arms whose expected rewards are above a predefined threshold via as few pulls (or rounds) as possibleproposed by Locatelli et al. (2016) recently. Although the proposed algorithm in Locatelli et al. (2016) achieves the optimal round complexity 1 in a certain sense, there still remain unsolved issues. This paper proposes an asynchronous parallel thresholding algorithm and its parameterfree version to improve the efficiency and the… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
3
0

Year Published

2018
2018
2020
2020

Publication Types

Select...
3

Relationship

1
2

Authors

Journals

citations
Cited by 3 publications
(3 citation statements)
references
References 12 publications
0
3
0
Order By: Relevance
“…This variant of the multi-armed bandit problem was introduced by Locatelli et al (2016), who provided an algorithm for solving the problem with matching upper and lower bounds. Mukherjee et al (2017) and Zhong et al (2017) have since provided algorithmic extensions that incorporate variance estimates and provide guarantees in asynchronous settings.…”
Section: Thresholding Banditsmentioning
confidence: 99%
“…This variant of the multi-armed bandit problem was introduced by Locatelli et al (2016), who provided an algorithm for solving the problem with matching upper and lower bounds. Mukherjee et al (2017) and Zhong et al (2017) have since provided algorithmic extensions that incorporate variance estimates and provide guarantees in asynchronous settings.…”
Section: Thresholding Banditsmentioning
confidence: 99%
“…To improve the efficiency of bandit algorithms, multiple agents can be employed and they can perform simultaneous investigation. Zhong et al [2017] designed an asynchronous parallel bandit algorithm that allows multiple agents working in parallel without waiting for each other. Both theoretical analysis and empirical studies validate that the nearly linear speedup can be achieved.…”
Section: Parallelization Bandit Algorithmsmentioning
confidence: 99%
“…Even-Dar et al [2002], Chen and Li [2015], Simchowitz et al [2017], Garivier and Kaufmann [2016] for papers in the related best arm identification and TOP-M setting 1 in the fixed confidence case. The fixed budget version of TBP was studied in Chen et al [2014], , Mukherjee et al [2017], Zhong et al [2017] -but also see e.g. , Audibert and Bubeck [2010], Gabillon et al [2012], for papers in the related best arm identification and TOP-M setting in the fixed budget case.…”
Section: Introductionmentioning
confidence: 99%