2016
DOI: 10.48550/arxiv.1611.06824
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Options Discovery with Budgeted Reinforcement Learning

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2019
2019
2019
2019

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(1 citation statement)
references
References 0 publications
0
1
0
Order By: Relevance
“…Pivotal state discovery is related to unsupervised sequence segmentation [17,47,76,13,16] and option or sub-task discovery in the context of RL [48,6,72,30,56,53,52]. Among them, both [52] and [76] employ sequential variational models to infer either task boundaries or key frames of demonstrations in an unsupervised manner, followed by applications to hierarchical RL or planning.…”
Section: Related Workmentioning
confidence: 99%
“…Pivotal state discovery is related to unsupervised sequence segmentation [17,47,76,13,16] and option or sub-task discovery in the context of RL [48,6,72,30,56,53,52]. Among them, both [52] and [76] employ sequential variational models to infer either task boundaries or key frames of demonstrations in an unsupervised manner, followed by applications to hierarchical RL or planning.…”
Section: Related Workmentioning
confidence: 99%