2021
DOI: 10.48550/arxiv.2102.07024
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Interactive Learning from Activity Description

Abstract: We present a novel interactive learning protocol that enables training request-fulfilling agents by verbally describing their activities. Our protocol gives rise to a new family of interactive learning algorithms that offer complementary advantages against traditional algorithms like imitation learning (IL) and reinforcement learning (RL). We develop an algorithm that practically implements this protocol and employ it to train agents in two challenging request-fulfilling problems using purely language-descript… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2021
2021
2022
2022

Publication Types

Select...
2

Relationship

0
2

Authors

Journals

citations
Cited by 2 publications
(2 citation statements)
references
References 29 publications
0
2
0
Order By: Relevance
“…Rather than provide prescriptive advice, we explored having the coach provide advice by relabeling an agent's trajectory with the goal it achieved. We can then train this now-successful relabeled trajectory using supervised learning, as was done in [36]. However, we found that hindsight relabeling performed poorly, except on the simplest environments.…”
Section: J2 Hindsight Relabelingmentioning
confidence: 99%
“…Rather than provide prescriptive advice, we explored having the coach provide advice by relabeling an agent's trajectory with the goal it achieved. We can then train this now-successful relabeled trajectory using supervised learning, as was done in [36]. However, we found that hindsight relabeling performed poorly, except on the simplest environments.…”
Section: J2 Hindsight Relabelingmentioning
confidence: 99%
“…Moving to autotelic agents, similar processes could be transposed to infer a goal rather than a reward function. A few recent works start addressing this issue by captioning the goal of a demonstration through natural language and a goal generator Zhou & Small (2020); Nguyen et al (2021). We expect follow-up of such works to contribute to answering the key question of the nature of the information conveyed by the tutor through the task channel MacGlashan & Littman (2015); Ho et al (2016;.…”
Section: Inference From Goal-related Signalsmentioning
confidence: 99%