2019
DOI: 10.48550/arxiv.1906.08399
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Sequential Experimental Design for Transductive Linear Bandits

Abstract: In this paper we introduce the transductive linear bandit problem: given a set of measurement vectors X ⊂ R d , a set of items Z ⊂ R d , a fixed confidence δ, and an unknown vector θ * ∈ R d , the goal is to infer argmax z∈Z z θ * with probability 1 − δ by making as few sequentially chosen noisy measurements of the form x θ * as possible. When X = Z, this setting generalizes linear bandits, and when X is the standard basis vectors and Z ⊂ {0, 1} d , combinatorial bandits. Such a transductive setting naturally … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
15
0

Year Published

2019
2019
2022
2022

Publication Types

Select...
5

Relationship

2
3

Authors

Journals

citations
Cited by 5 publications
(15 citation statements)
references
References 10 publications
0
15
0
Order By: Relevance
“…Note that this is precisely the number of labels required in the pool-based setting where the agent can choose any x ∈ X that she desires at each time t (e.g. [11]). In the other extreme, E[U] = ρ(ν) log(1/δ) so that the constraint in the label complexity E[L] is equivalent to ρ(ν) ≥ λ/ν ∞ ρ(λ).…”
Section: Resultsmentioning
confidence: 99%
See 4 more Smart Citations
“…Note that this is precisely the number of labels required in the pool-based setting where the agent can choose any x ∈ X that she desires at each time t (e.g. [11]). In the other extreme, E[U] = ρ(ν) log(1/δ) so that the constraint in the label complexity E[L] is equivalent to ρ(ν) ≥ λ/ν ∞ ρ(λ).…”
Section: Resultsmentioning
confidence: 99%
“…The proof of Theorem 1 relies on standard techniques from best arm identification lower bounds (see e.g. [17,11]).…”
Section: Resultsmentioning
confidence: 99%
See 3 more Smart Citations