2022
DOI: 10.1007/978-3-031-15931-2_21
|View full text |Cite
|
Sign up to set email alerts
|

Learning Flexible Translation Between Robot Actions and Language Descriptions

Abstract: Handling various robot action-language translation tasks flexibly is an essential requirement for natural interaction between a robot and a human. Previous approaches require change in the configuration of the model architecture per task during inference, which undermines the premise of multi-task learning. In this work, we propose the paired gated autoencoders (PGAE) for flexible translation between robot actions and language descriptions in a tabletop object manipulation scenario. We train our model in an en… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2023
2023
2023
2023

Publication Types

Select...
1
1

Relationship

0
2

Authors

Journals

citations
Cited by 2 publications
(1 citation statement)
references
References 15 publications
0
1
0
Order By: Relevance
“…Different from other approaches, our previous Paired Gated Autoencoders (PGAE) model ( Özdemir, Kerzel, Weber, Lee, and Wermter 2022) can bidirectionally translate between language and action, which enables an agent not only to execute actions according to given instructions but also to recognize and verbalize its own actions or actions executed by another agent. As the desired translation task is communicated to the network through an additional signal word in the language input, PGAE can flexibly translate between and within modalities during inference.…”
Section: Slide Blue Quicklymentioning
confidence: 99%
“…Different from other approaches, our previous Paired Gated Autoencoders (PGAE) model ( Özdemir, Kerzel, Weber, Lee, and Wermter 2022) can bidirectionally translate between language and action, which enables an agent not only to execute actions according to given instructions but also to recognize and verbalize its own actions or actions executed by another agent. As the desired translation task is communicated to the network through an additional signal word in the language input, PGAE can flexibly translate between and within modalities during inference.…”
Section: Slide Blue Quicklymentioning
confidence: 99%