2022
DOI: 10.48550/arxiv.2205.10895
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Contextual Information-Directed Sampling

Abstract: Information-directed sampling (IDS) has recently demonstrated its potential as a dataefficient reinforcement learning algorithm (Lu et al., 2021). However, it is still unclear what is the right form of information ratio to optimize when contextual information is available. We investigate the IDS design through two contextual bandit problems: contextual bandits with graph feedback and sparse linear contextual bandits. We provably demonstrate the advantage of contextual IDS over conditional IDS and emphasize the… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2023
2023
2023
2023

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(1 citation statement)
references
References 12 publications
0
1
0
Order By: Relevance
“…An information ratio is the ratio between the squared instantaneous regret and an information gain. Various information ratios have been proposed in the literature [4,6,9,11,13,17,19,20,21].…”
Section: An Information Ratiomentioning
confidence: 99%
“…An information ratio is the ratio between the squared instantaneous regret and an information gain. Various information ratios have been proposed in the literature [4,6,9,11,13,17,19,20,21].…”
Section: An Information Ratiomentioning
confidence: 99%