2021
DOI: 10.48550/arxiv.2110.04647
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Learning to Follow Language Instructions with Compositional Policies

Abstract: We propose a framework that learns to execute natural language instructions in an environment consisting of goalreaching tasks that share components of their task descriptions. Our approach leverages the compositionality of both value functions and language, with the aim of reducing the sample complexity of learning novel tasks. First, we train a reinforcement learning agent to learn value functions that can be subsequently composed through a Boolean algebra to solve novel tasks. Second, we fine-tune a seq2seq… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2023
2023
2023
2023

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(1 citation statement)
references
References 17 publications
(20 reference statements)
0
1
0
Order By: Relevance
“…Hence, Hupkes et al (2020) discusses this subject in depth by presenting a set of definitions and tests that is grounded on a vast amount of linguistic and philosophical literature, using probabilistic contextfree grammar datasets. Another very good example can also be found in visual recognition (Misra et al, 2017;Wang et al, 2019b;Naeem et al, 2021;Purushwalkam et al, 2019;Logeswaran et al, 2021;Cohen et al, 2021;Nayak et al, 2022). Here, if a model understands the meaning of the phrases "grey elephant" and "blue bottle", they test if it also generalizes to new vision-language concepts like "blue elephant".…”
Section: Semantic Compositionsmentioning
confidence: 93%
“…Hence, Hupkes et al (2020) discusses this subject in depth by presenting a set of definitions and tests that is grounded on a vast amount of linguistic and philosophical literature, using probabilistic contextfree grammar datasets. Another very good example can also be found in visual recognition (Misra et al, 2017;Wang et al, 2019b;Naeem et al, 2021;Purushwalkam et al, 2019;Logeswaran et al, 2021;Cohen et al, 2021;Nayak et al, 2022). Here, if a model understands the meaning of the phrases "grey elephant" and "blue bottle", they test if it also generalizes to new vision-language concepts like "blue elephant".…”
Section: Semantic Compositionsmentioning
confidence: 93%