2024
DOI: 10.1007/s11263-024-02159-8
|View full text |Cite
|
Sign up to set email alerts
|

ESceme: Vision-and-Language Navigation with Episodic Scene Memory

Qi Zheng,
Daqing Liu,
Chaoyue Wang
et al.

Abstract: Vision-and-language navigation (VLN) simulates a visual agent that follows natural-language navigation instructions in real-world scenes. Existing approaches have made enormous progress in navigation in new environments, such as beam search, pre-exploration, and dynamic or hierarchical history encoding. To balance generalization and efficiency, we resort to memorizing visited scenarios apart from the ongoing route while navigating. In this work, we introduce a mechanism of Episodic Scene memory (ESceme) for VL… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...

Citation Types

0
0
0

Publication Types

Select...

Relationship

0
0

Authors

Journals

citations
Cited by 0 publications
references
References 46 publications
0
0
0
Order By: Relevance

No citations

Set email alert for when this publication receives citations?