2020
DOI: 10.48550/arxiv.2010.15195
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Reinforcement Learning for Sparse-Reward Object-Interaction Tasks in a First-person Simulated 3D Environment

Abstract: First-person object-interaction tasks in high-fidelity, 3D, simulated environments such as the AI2Thor virtual home-environment pose significant sample-efficiency challenges for reinforcement learning (RL) agents learning from sparse task rewards. To alleviate these challenges, prior work has provided extensive supervision via a combination of reward-shaping, ground-truth object-information, and expert demonstrations. In this work, we show that one can learn object-interaction tasks from scratch without superv… Show more

Help me understand this report
View published versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
3
0

Year Published

2022
2022
2023
2023

Publication Types

Select...
2
1

Relationship

1
2

Authors

Journals

citations
Cited by 3 publications
(3 citation statements)
references
References 13 publications
0
3
0
Order By: Relevance
“…Our work lies along this line of research. Other approaches have considered using contrastive losses (Kipf et al, 2019;Carvalho et al, 2020) while some works have explored unsupervised object-centric learning in 3D scenes (Chen et al, 2021;Crawford & Pineau, 2020;Stelzner et al, 2021;Du et al, 2021b;Kabra et al, 2021). While dealing with 3D scenes would be an important future direction for our work, it is orthogonal to our current focus.…”
Section: Related Workmentioning
confidence: 99%
“…Our work lies along this line of research. Other approaches have considered using contrastive losses (Kipf et al, 2019;Carvalho et al, 2020) while some works have explored unsupervised object-centric learning in 3D scenes (Chen et al, 2021;Crawford & Pineau, 2020;Stelzner et al, 2021;Du et al, 2021b;Kabra et al, 2021). While dealing with 3D scenes would be an important future direction for our work, it is orthogonal to our current focus.…”
Section: Related Workmentioning
confidence: 99%
“…vision-language navigation (Anderson et al 2018) and object interaction (Shridhar et al 2020;Carvalho et al 2020;Corona et al 2020;Blukis et al 2022;Min et al 2021) in embodied settings. In particular, the AL-FRED task (Shridhar et al 2020) studies agents that exploit detailed natural language instructions to generalize to novel instructions in novel environments at test time.…”
Section: … … Look Down Pickup Knife Turn Left Slice Applementioning
confidence: 99%
“…Real world tasks are also compositional (Carvalho et al 2020;Loula, Baroni, and Lake 2018;Andreas, Klein, and Levine 2017;Oh et al 2017). Compositional tasks are often made of different "components" that can recombined to form new tasks.…”
Section: Introductionmentioning
confidence: 99%