2019
DOI: 10.48550/arxiv.1912.01261
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Continuous Online Learning and New Insights to Online Imitation Learning

Abstract: Online learning is a powerful tool for analyzing iterative algorithms. However, the classic adversarial setup sometimes fails to capture certain regularity in online problems in practice. Motivated by this, we establish a new setup, called Continuous Online Learning (COL), where the gradient of online loss function changes continuously across rounds with respect to the learner's decisions. We show that COL covers and more appropriately describes many interesting applications, from general equilibrium problems … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2020
2020
2021
2021

Publication Types

Select...
1
1

Relationship

1
1

Authors

Journals

citations
Cited by 2 publications
(2 citation statements)
references
References 12 publications
0
2
0
Order By: Relevance
“…, a N , s N }, we aim to learn a policy π * that best matches the demonstration trajectories. Note that we focus on the offline imitation setting as employed in [3], [7], where a set of demonstrations are provided ahead of time instead of gradually incrementing our dataset as in online imitation learning [33].…”
Section: Problem Statementmentioning
confidence: 99%
“…, a N , s N }, we aim to learn a policy π * that best matches the demonstration trajectories. Note that we focus on the offline imitation setting as employed in [3], [7], where a set of demonstrations are provided ahead of time instead of gradually incrementing our dataset as in online imitation learning [33].…”
Section: Problem Statementmentioning
confidence: 99%
“…The interactions of these properties make the classic adversary-style online learning analysis taken by Ross et al [2] overly conservative, creating a mismatch between provable theoretical guarantees and the learning phenomena observed in practice. This reality gap has motivated researchers to study deeper the theoretical underpinnings of OPO [12][13][14].…”
Section: Introductionmentioning
confidence: 99%