2023
DOI: 10.48550/arxiv.2302.03319
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Leveraging Demonstrations to Improve Online Learning: Quality Matters

Abstract: We investigate the extent to which offline demonstration data can improve online learning. While it is natural to expect some improvement, we show that the degree of improvement depends on the quality of demonstrations. To generate portable insights, we focus on Thompson sampling (TS) applied to a multi-armed bandit as a prototypical online learning algorithm and model. The offline demonstration data is generated by an expert with a given competence level, a notion we introduce. We propose an informed TS algor… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...

Citation Types

0
0
0

Publication Types

Select...

Relationship

0
0

Authors

Journals

citations
Cited by 0 publications
references
References 17 publications
0
0
0
Order By: Relevance

No citations

Set email alert for when this publication receives citations?