Proceedings of Second International Combined Workshop on Spatial Language Understanding and Grounded Communication for Robotics 2021
DOI: 10.18653/v1/2021.splurobonlp-1.5
|View full text |Cite
|
Sign up to set email alerts
|

Towards Navigation by Reasoning over Spatial Configurations

Abstract: We deal with the navigation problem where the agent follows natural language instructions while observing the environment. Focusing on language understanding, we show the importance of spatial semantics in grounding navigation instructions into visual perceptions. We propose a neural agent that uses the elements of spatial configurations and investigate their influence on the navigation agent's reasoning ability. Moreover, we model the sequential execution order and align visual objects with spatial configurat… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1

Citation Types

0
4
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
4
1

Relationship

0
5

Authors

Journals

citations
Cited by 5 publications
(4 citation statements)
references
References 17 publications
0
4
0
Order By: Relevance
“…In this paper, we focus on spatial reasoning over text which can be described as inferring the implicit 1 spatial relations from explicit relations 2 described in the text. Spatial reasoning plays a crucial role in diverse domains, including language grounding (Liu et al, 2022), navigation (Zhang et al, 2021), and human-robot interaction (Venkatesh et al, 2021). By studying this task, we can analyze both the reading comprehension and logical reasoning capabilities of models.…”
Section: Introductionmentioning
confidence: 99%
“…In this paper, we focus on spatial reasoning over text which can be described as inferring the implicit 1 spatial relations from explicit relations 2 described in the text. Spatial reasoning plays a crucial role in diverse domains, including language grounding (Liu et al, 2022), navigation (Zhang et al, 2021), and human-robot interaction (Venkatesh et al, 2021). By studying this task, we can analyze both the reading comprehension and logical reasoning capabilities of models.…”
Section: Introductionmentioning
confidence: 99%
“…In Figure 1(b), the instruction "enter the door" does not help distinguish the target viewpoint from other candidate viewpoints since there are multiple doors and walls in the visual environment. As a result, we hypothesize those types of instructions cause the explicit and fine-grained grounding to be less effective for the VLN task, as appears in (Hong et al, 2020b;Zhang et al, 2021) that use sub-instructions and in (Hong et al, 2020a;Hu et al, 2019;Qi et al, 2020a;Zhang and Kordjamshidi, 2022a) that use object-level representations.…”
Section: Introductionmentioning
confidence: 99%
“…Understanding spatial language is important in many applications such as navigation (Zhang and Kordjamshidi, 2022;Zhang et al, 2021;Chen et al, 2019), medical domain Kamel Boulos et al, 2019;Massa et al, 2015), and robotics (Venkatesh et al, 2021;Kennedy et al, 2007). However, few benchmarks have directly focused on comprehending the spatial semantics of the text.…”
Section: Introductionmentioning
confidence: 99%