2020 IEEE International Conference on Robotics and Automation (ICRA) 2020
DOI: 10.1109/icra40945.2020.9197464
|View full text |Cite
|
Sign up to set email alerts
|

Deep compositional robotic planners that follow natural language commands

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
12
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
5
3
1

Relationship

3
6

Authors

Journals

citations
Cited by 18 publications
(12 citation statements)
references
References 18 publications
0
12
0
Order By: Relevance
“…Robot navigation [10] and path planning from natural language instructions have been widely investigated in the field of robotics [11]- [13]. Such a task was recently formalized adopting data-driven methods [4] with the release of the R2R dataset.…”
Section: Related Workmentioning
confidence: 99%
“…Robot navigation [10] and path planning from natural language instructions have been widely investigated in the field of robotics [11]- [13]. Such a task was recently formalized adopting data-driven methods [4] with the release of the R2R dataset.…”
Section: Related Workmentioning
confidence: 99%
“…We use the intuition developed in Kuo et al (2020a) that models which are composed of sub-networks can disentangle the meaning of words in a sentence without direct supervision. In other words, the agent is never told what is supposed to mean as opposed to .…”
Section: Modelmentioning
confidence: 99%
“…Similar to attention-based predictors [20,12], attention plays a key role in our model and guides the trajectory predictions. Although the scope of the attention module is gated by the linguistic descriptions [21][22][23], agents that don't co-occur in the same description don't need to be jointly attended to. Language guides attention which ultimately removes extraneous agents and improves performance.…”
Section: A Trajectory Prediction With Attention and Structuresmentioning
confidence: 99%
“…The input linguistic commands for planners mostly specify goals and referents, e.g., landmarks, for navigation or locating the objects to interact with. Some planners learn visual attention maps conditioned on language to predict a goal map [27], filter relevant objects [23], or estimate a visitation map [28] for a single agent. However, none of these planners considers language describing interactions with other agents and manners of performing an action, such as quickly or slowly.…”
Section: B Language and Planningmentioning
confidence: 99%