A Deep Learning Based Behavioral Approach to Indoor Autonomous Navigation

Sepúlveda, Gabriel Ocampo; Niebles, Juan Carlos; Soto, Álvaro

doi:10.1109/icra.2018.8460646

Cited by 35 publications

(29 citation statements)

References 37 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…One common application is that of Conditional Random Fields (CRF) [28] for semantic segmentation, often used to provide globally smooth and consistent results to local predictions [43,25]. In the case of robot navigation, employing semantic graphs to abstract the physical map allows the agent to learn by understanding the relationship between semantic nodes independent of the metric space, which results to easier generalization across spaces [42]. Graph structures are also commonly used in human-object interaction tasks [39] and other spatiotemporal problems [20], creating connections among nodes within and across consecutive video frames, hence extending structure to include, in addition to space, also time.…”

Section: Related Workmentioning

confidence: 99%

3D Scene Graph: A Structure for Unified Semantics, 3D Space, and Camera

Armeni

Zamir

et al. 2019

2019 IEEE/CVF International Conference on Computer Vision (ICCV)

239

161

View full text Add to dashboard Cite

A comprehensive semantic understanding of a scene is important for many applications -but in what space should diverse semantic information (e.g., objects, scene categories, material types, texture, etc.) be grounded and what should be its structure? Aspiring to have one unified structure that hosts diverse types of semantics, we follow the Scene Graph paradigm in 3D, generating a 3D Scene Graph. Given a 3D mesh and registered panoramic images, we construct a graph that spans the entire building and includes semantics on objects (e.g., class, material, and other attributes), rooms (e.g., scene category, volume, etc.) and cameras (e.g., location, etc.), as well as the relationships among these entities.However, this process is prohibitively labor heavy if done manually. To alleviate this we devise a semi-automatic framework that employs existing detection methods and enhances them using two main constraints: I. framing of query images sampled on panoramas to maximize the performance of 2D detectors, and II. multi-view consistency enforcement across 2D detections that originate in different camera locations.

show abstract

Section: Related Workmentioning

confidence: 99%

3D Scene Graph: A Structure for Unified Semantics, 3D Space, and Camera

Armeni

Zamir

et al. 2019

2019 IEEE/CVF International Conference on Computer Vision (ICCV)

239

161

View full text Add to dashboard Cite

show abstract

“…Similar to [8,9,10,12], we use deep learning [28] to parameterize a motion policy. This approach allows us to forgo hand-engineered features for sensor data.…”

Section: Related Workmentioning

confidence: 99%

“…In this work, we combine ideas from machine learning [8,9,10,11,12] and hierarchical planning [13] to improve reactive robot control. Our approach does not require handspecifying all the parameters of the reactive controller; instead, most parameters are optimized based on example navigation data through imitation learning [14,15,8].…”

Section: Introductionmentioning

confidence: 99%

Deep Local Trajectory Replanning and Control for Robot Navigation

Ashwini

Martín-Martín

Goebel

et al. 2019

2019 International Conference on Robotics and Automation (ICRA)

View full text Add to dashboard Cite

We present a navigation system that combines ideas from hierarchical planning and machine learning. The system uses a traditional global planner to compute optimal paths towards a goal, and a deep local trajectory planner and velocity controller to compute motion commands. The latter components of the system adjust the behavior of the robot through attention mechanisms such that it moves towards the goal, avoids obstacles, and respects the space of nearby pedestrians. Both the structure of the proposed deep models and the use of attention mechanisms make the system's execution interpretable. Our simulation experiments suggest that the proposed architecture outperforms baselines that try to map global plan information and sensor data directly to velocity commands. In comparison to a hand-designed traditional navigation system, the proposed approach showed more consistent performance.

show abstract

“…In terms of our topological representation, our work is closely related to that of Sepulveda et al [40]. However, we do not rely on modifying the environment by introducing artificial landmarks, and we define a reduced set of primitive behaviors.…”

Section: Related Workmentioning

confidence: 99%

“…These modifications help to (1) facilitate the design of topological maps for realistic, human environments, and (2) increase the robustness of learned navigation behaviors given limited data. Furthermore, the work in [40] does not pose navigation as a graph traversal problem.…”

Section: Related Workmentioning

confidence: 99%

A Behavioral Approach to Visual Navigation with Graph Localization Networks

Chen¹,

Vicente²,

Sepúlveda³

et al. 2019

Robotics: Science and Systems XV

View full text Add to dashboard Cite

Inspired by research in psychology, we introduce a behavioral approach for visual navigation using topological maps. Our goal is to enable a robot to navigate from one location to another, relying only on its visual input and the topological map of the environment. We propose using graph neural networks for localizing the agent in the map, and decompose the action space into primitive behaviors implemented as convolutional or recurrent neural networks. Using the Gibson simulator, we verify that our approach outperforms relevant baselines and is able to navigate in both seen and unseen environments. Webpage URL: https://graphnav.stanford.edu.

show abstract

A Deep Learning Based Behavioral Approach to Indoor Autonomous Navigation

Cited by 35 publications

References 37 publications

3D Scene Graph: A Structure for Unified Semantics, 3D Space, and Camera

3D Scene Graph: A Structure for Unified Semantics, 3D Space, and Camera

Deep Local Trajectory Replanning and Control for Robot Navigation

A Behavioral Approach to Visual Navigation with Graph Localization Networks

Contact Info

Product

Resources

About