2021
DOI: 10.48550/arxiv.2109.13588
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Making Curiosity Explicit in Vision-based RL

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2

Citation Types

0
2
0

Year Published

2021
2021
2021
2021

Publication Types

Select...
2

Relationship

2
0

Authors

Journals

citations
Cited by 2 publications
(2 citation statements)
references
References 0 publications
0
2
0
Order By: Relevance
“…Our approach takes advantage of the offpolicy property of most state-of-the-art RL algorithms and trains a separate curious policy based on the SRL error. A preliminary version of this work can be found in [3]. Our experiments show that the proposed method encourages the visitation of SRL-problematic states.…”
Section: Introductionmentioning
confidence: 98%
“…Our approach takes advantage of the offpolicy property of most state-of-the-art RL algorithms and trains a separate curious policy based on the SRL error. A preliminary version of this work can be found in [3]. Our experiments show that the proposed method encourages the visitation of SRL-problematic states.…”
Section: Introductionmentioning
confidence: 98%
“…Sample inefficiency is most commonly attributed to the complexity and noise encountered in sensory information processing. Solutions to the problem range from including pretrained perception modules [12] in the learning pipeline to integrating self-supervised state representation learning objectives into task learning [13,10,4,14]. Safe exploration and environment resetting are rarely mentioned in publications and temporary solutions include engineering the environment or having a human manually stop the robot in dangerous situations and reset the environment at the end of each trial.…”
Section: Introductionmentioning
confidence: 99%