Motion planning and control for mobile robot navigation using machine learning: a survey

Xiao, Xuesu; Liu, Bo; Warnell, Garrett; Stone, Peter

doi:10.1007/s10514-022-10039-8

Cited by 136 publications

(53 citation statements)

References 84 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Recently, several algorithms have emerged that show the potential of applying learning to address challenges in robot navigation [2]. Broadly speaking, in the robot navigation literature, learning-based approaches have been shown to be successful in problems such as adaptive planner parameter learning [16], overcoming viewpoint invariance in demonstrations [13], and end-to-end learning for autonomous driving [14], [17], [18].…”

Section: A Learning For Robot Navigationmentioning

confidence: 99%

“…1, can be a valuable resource. For instance, such 1 The University of Texas at Austin, Department of Mechanical Engineering haresh.miriyala@utexas.edu 2 The University of Texas at Austin, Department of Computer Science, ani.nair@utexas.edu, {xiao, joydeepb, hart, pstone}@cs.utexas.edu 3 Robotics@Google {toshev, pirk}@google.com 4 Sony AI 5 Computational and Information Sciences Directorate, Army Research Laboratory garrett.a.warnell.civ@army.mil demonstration information can be used to learn socially compliant robot navigation using the paradigm of Learning from Demonstrations (LfD) [6], [7] or understanding human navigation in the presence of autonomous robots [8]. Datasets for social navigation, generally used for learning and benchmarking, include data collected both in the realworld [9] and in simulated environments [10], [11].…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Socially Compliant Navigation Dataset (SCAND): A Large-Scale Dataset of Demonstrations for Social Navigation

Karnan¹,

Nair²,

Xiao³

et al. 2022

Preprint

Self Cite

View full text Add to dashboard Cite

Social navigation is the capability of an autonomous agent, such as a robot, to navigate in a "socially compliant" manner in the presence of other intelligent agents such as humans. With the emergence of autonomously navigating mobile robots in human-populated environments (e.g., domestic service robots in homes and restaurants and food delivery robots on public sidewalks), incorporating socially compliant navigation behaviors on these robots becomes critical to ensuring safe and comfortable human-robot coexistence. To address this challenge, imitation learning is a promising framework, since it is easier for humans to demonstrate the task of social navigation rather than to formulate reward functions that accurately capture the complex multi-objective setting of social navigation. The use of imitation learning and inverse reinforcement learning to social navigation for mobile robots, however, is currently hindered by a lack of large-scale datasets that capture socially compliant robot navigation demonstrations in the wild. To fill this gap, we introduce Socially Compli-Ant Navigation Dataset (SCAND)-a large-scale, first-personview dataset of socially compliant navigation demonstrations. Our dataset contains 8.7 hours, 138 trajectories, 25 miles of socially compliant, human-teleoperated driving demonstrations that comprises multi-modal data streams including 3D lidar, joystick commands, odometry, visual and inertial information, collected on two morphologically different mobile robots-a Boston Dynamics Spot and a Clearpath Jackal-by four different human demonstrators in both indoor and outdoor environments. We additionally perform preliminary analysis and validation through real-world robot experiments and show that navigation policies learned by imitation learning on SCAND generate socially compliant behaviors.

show abstract

Section: A Learning For Robot Navigationmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

Socially Compliant Navigation Dataset (SCAND): A Large-Scale Dataset of Demonstrations for Social Navigation

Karnan¹,

Nair²,

Xiao³

et al. 2022

Preprint

Self Cite

View full text Add to dashboard Cite

show abstract

“…The use of machine learning methods in the design of autonomous navigation systems goes back several decades, though recent years have seen a spike in interest from the research community [2], [3], [11], [12]. One of the earliest successes was reported by Pomerleau [13], in which a system called ALVINN used imitation learning to train an artificial neural network that could perform lane keeping based on demonstration data generated in simulation.…”

Section: A Machine Learning For Autonomous Navigationmentioning

confidence: 99%

VOILA: Visual-Observation-Only Imitation Learning for Autonomous Navigation

Karnan¹,

Warnell²,

Xiao³

et al. 2021

Preprint

Self Cite

View full text Add to dashboard Cite

While imitation learning for vision-based autonomous mobile robot navigation has recently received a great deal of attention in the research community, existing approaches typically require state-action demonstrations that were gathered using the deployment platform. However, what if one cannot easily outfit their platform to record these demonstration signals or-worse yet-the demonstrator does not have access to the platform at all? Is imitation learning for vision-based autonomous navigation even possible in such scenarios? In this work, we hypothesize that the answer is yes, and that recent ideas from the Imitation from Observation (IfO) literature can be brought to bear such that a robot can learn to navigate using only ego-centric video collected by a demonstrator, even in the presence of viewpoint mismatch. To this end, we introduce a new algorithm, Visual-Observation-only Imitation Learning for Autonomous navigation (VOILA), that can successfully learn navigation policies from a single video demonstration collected from a physically different agent. We evaluate VOILA in the photorealistic AirSim simulator, and show that VOILA not only successfully imitates the expert, but that it also learns navigation policies that can generalize to novel environments. Further, we demonstrate the effectiveness of VOILA in a real-world setting by showing that it allows a wheeled Jackal robot to successfully imitate a human walking in an environment using a video recorded using a mobile phone camera.

show abstract

“…One candidate approach to learning and navigation is to replace the traditionally engineered system with an end-to-end sensor to decision neural network [3]- [6]. Empirical and limited benchmarking show some promise on this front.…”

Section: A Research Context 1) Navigation and Machine Learningmentioning

confidence: 99%

“…Overall, there is no substantive benchmarking of learning based methods with traditional navigation schemes [6]. Thus, the assertions that learning can overcome sensitivity to environmental conditions and can outperform traditionally engineered systems remains unconfirmed.…”

mentioning

confidence: 99%

NavTuner: Learning a Scene-Sensitive Family of Navigation Policies

Ma¹,

Smith²,

Vela³

2021

Preprint

View full text Add to dashboard Cite

The advent of deep learning has inspired research into end-to-end learning for a variety of problem domains in robotics. For navigation, the resulting methods may not have the generalization properties desired let alone match the performance of traditional methods. Instead of learning a navigation policy, we explore learning an adaptive policy in the parameter space of an existing navigation module. Having adaptive parameters provides the navigation module with a family of policies that can be dynamically reconfigured based on the local scene structure, and addresses the common assertion in machine learning that engineered solutions are inflexible. Of the methods tested, reinforcement learning (RL) is shown to provide a significant performance boost to a modern navigation method through reduced sensitivity of its success rate to environmental clutter. The outcomes indicate that RL as a meta-policy learner, or dynamic parameter tuner, effectively robustifies algorithms sensitive to external, measurable nuisance factors. I. IAutonomous navigation through static, unstructured environments has advanced in the past decades but fundamentally still relies on engineered approaches [1], [2]. Given an approximate map, the approaches use sensor data to inform updated estimates of the environment which are used to evaluate future trajectories in terms of safety and other characteristics, with the aim of finding a collision-free, goalattaining path. Traditionally designed systems involve manual parameter selection for general purpose navigation, which exhibits sensitivity to environmental conditions. This paper investigates the use of machine learning to dynamically reconfigure the parameters of a hierarchical navigation system according to the immediate, sensed surroundings of the robot. We show that scene-dependent online tuning improves navigation performance and reduces sensitivity to environmental conditions. The final reinforcement learning solution, called NavTuner, addresses the problem of parameter sensitivity to operational variance. A. Research Context 1) Navigation and Machine Learning:One candidate approach to learning and navigation is to replace the traditionally engineered system with an end-to-end sensor to decision neural network [3]- [6]. Empirical and limited benchmarking show some promise on this front. However, instead of directly solving the navigation problem itself, these methods solve

show abstract

Motion planning and control for mobile robot navigation using machine learning: a survey

Cited by 136 publications

References 84 publications

Socially Compliant Navigation Dataset (SCAND): A Large-Scale Dataset of Demonstrations for Social Navigation

Socially Compliant Navigation Dataset (SCAND): A Large-Scale Dataset of Demonstrations for Social Navigation

VOILA: Visual-Observation-Only Imitation Learning for Autonomous Navigation

NavTuner: Learning a Scene-Sensitive Family of Navigation Policies

Contact Info

Product

Resources

About