2019 IEEE/CVF International Conference on Computer Vision (ICCV) 2019
DOI: 10.1109/iccv.2019.00819
|View full text |Cite
|
Sign up to set email alerts
|

Cross-View Policy Learning for Street Navigation

Abstract: The ability to navigate from visual observations in unfamiliar environments is a core component of intelligent agents and an ongoing challenge for Deep Reinforcement Learning (RL). Street View can be a sensible testbed for such RL agents, because it provides real-world photographic imagery at ground level, with diverse street appearances; it has been made into an interactive environment called StreetLearn [27] and used for research on navigation. However, goal-driven street navigation agents have not so far be… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
15
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
6
2
2

Relationship

1
9

Authors

Journals

citations
Cited by 30 publications
(15 citation statements)
references
References 40 publications
(64 reference statements)
0
15
0
Order By: Relevance
“…Cross-View Learning (CVL) essentially searches for mappings between two views, where the similarity between the samples from different modalities can be measured directly. It has been widely applied in real applications [29], [30], [31], [32], [33], [34]. With adversarial training, the embedding spaces of two individual views are learned and aligned simultaneously [30].…”
Section: Related Workmentioning
confidence: 99%
“…Cross-View Learning (CVL) essentially searches for mappings between two views, where the similarity between the samples from different modalities can be measured directly. It has been widely applied in real applications [29], [30], [31], [32], [33], [34]. With adversarial training, the embedding spaces of two individual views are learned and aligned simultaneously [30].…”
Section: Related Workmentioning
confidence: 99%
“…We implement this method as a baseline for comparison. Mirowski et al (2018) have trained deep RL agents in large-scale environments based on Google Street View and studied the transfer of navigation skills by doing limited, modular retraining in new cities, with optional adaption using aerial images (Li et al 2019).…”
Section: Related Workmentioning
confidence: 99%
“…Our fully-configurable environment runs on top of the Unity game engine and their ML-Agents framework [68]. CityLearn is related to the recent StreetLearn work [59] used in [2], [69], [70] but has a range of useful differences. We propose the usage of diverse environments across 5 countries and additionally enable loading any other dataset including in-house recorded data; see Tables I and II for a detailed comparison. In Table I, each environment (region/dataset) also includes GPS data; except for Goald Coast Drive and Alderley Day/Nigth.…”
Section: The Citylearn Environmentmentioning
confidence: 99%