2018 IEEE International Conference on Robotics and Automation (ICRA) 2018
DOI: 10.1109/icra.2018.8460646
|View full text |Cite
|
Sign up to set email alerts
|

A Deep Learning Based Behavioral Approach to Indoor Autonomous Navigation

Abstract: We present a semantically rich graph representation for indoor robotic navigation. Our graph representation encodes: semantic locations such as offices or corridors as nodes, and navigational behaviors such as enter office or cross a corridor as edges. In particular, our navigational behaviors operate directly from visual inputs to produce motor controls and are implemented with deep learning architectures. This enables the robot to avoid explicit computation of its precise location or the geometry of the envi… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
29
0

Year Published

2019
2019
2024
2024

Publication Types

Select...
5
3

Relationship

0
8

Authors

Journals

citations
Cited by 35 publications
(29 citation statements)
references
References 37 publications
0
29
0
Order By: Relevance
“…One common application is that of Conditional Random Fields (CRF) [28] for semantic segmentation, often used to provide globally smooth and consistent results to local predictions [43,25]. In the case of robot navigation, employing semantic graphs to abstract the physical map allows the agent to learn by understanding the relationship between semantic nodes independent of the metric space, which results to easier generalization across spaces [42]. Graph structures are also commonly used in human-object interaction tasks [39] and other spatiotemporal problems [20], creating connections among nodes within and across consecutive video frames, hence extending structure to include, in addition to space, also time.…”
Section: Related Workmentioning
confidence: 99%
“…One common application is that of Conditional Random Fields (CRF) [28] for semantic segmentation, often used to provide globally smooth and consistent results to local predictions [43,25]. In the case of robot navigation, employing semantic graphs to abstract the physical map allows the agent to learn by understanding the relationship between semantic nodes independent of the metric space, which results to easier generalization across spaces [42]. Graph structures are also commonly used in human-object interaction tasks [39] and other spatiotemporal problems [20], creating connections among nodes within and across consecutive video frames, hence extending structure to include, in addition to space, also time.…”
Section: Related Workmentioning
confidence: 99%
“…Similar to [8,9,10,12], we use deep learning [28] to parameterize a motion policy. This approach allows us to forgo hand-engineered features for sensor data.…”
Section: Related Workmentioning
confidence: 99%
“…In this work, we combine ideas from machine learning [8,9,10,11,12] and hierarchical planning [13] to improve reactive robot control. Our approach does not require handspecifying all the parameters of the reactive controller; instead, most parameters are optimized based on example navigation data through imitation learning [14,15,8].…”
Section: Introductionmentioning
confidence: 99%
“…In terms of our topological representation, our work is closely related to that of Sepulveda et al [40]. However, we do not rely on modifying the environment by introducing artificial landmarks, and we define a reduced set of primitive behaviors.…”
Section: Related Workmentioning
confidence: 99%
“…These modifications help to (1) facilitate the design of topological maps for realistic, human environments, and (2) increase the robustness of learned navigation behaviors given limited data. Furthermore, the work in [40] does not pose navigation as a graph traversal problem.…”
Section: Related Workmentioning
confidence: 99%