Reinforcement learning-based mobile robot navigation

Altuntas, Nihal; Imal, Erkan; Emanet, Nahit; Öztürk, Ceyda Nur

doi:10.3906/elk-1311-129

Cited by 26 publications

(12 citation statements)

References 21 publications

Supporting

Mentioning

Contrasting

Unclassified

Order By: Relevance

“…In the mobile robot framework structure in Figure 6, the agent is controlled by the supervisor controller, and the robot controller is executed by the agent, so that the robot operates in the warehouse environment and the necessary information is observed and collected [25][26][27].…”

Section: System Structurementioning

confidence: 99%

Mobile Robot Path Optimization Technique Based on Reinforcement Learning Algorithm in Warehouse Environment

Lee

Jeong

2021

Applied Sciences

View full text Add to dashboard Cite

This paper reports on the use of reinforcement learning technology for optimizing mobile robot paths in a warehouse environment with automated logistics. First, we compared the results of experiments conducted using two basic algorithms to identify the fundamentals required for planning the path of a mobile robot and utilizing reinforcement learning techniques for path optimization. The algorithms were tested using a path optimization simulation of a mobile robot in same experimental environment and conditions. Thereafter, we attempted to improve the previous experiment and conducted additional experiments to confirm the improvement. The experimental results helped us understand the characteristics and differences in the reinforcement learning algorithm. The findings of this study will facilitate our understanding of the basic concepts of reinforcement learning for further studies on more complex and realistic path optimization algorithm development.

show abstract

Section: System Structurementioning

confidence: 99%

Mobile Robot Path Optimization Technique Based on Reinforcement Learning Algorithm in Warehouse Environment

Lee

Jeong

2021

Applied Sciences

View full text Add to dashboard Cite

show abstract

“…Typically, an agent will evaluate a state, and will then undertake an action either in an exploitative or exploratory manner thereafter and finally will receive an instant reward, while transitioning to a new state. Q-learning has tremendous success in robotics, especially in mobile robot navigation and obstacle avoidance [60,61]. In [62] the Dyna AI architecture was proposed to integrate both learning, and experience, based on online planning, as well as reactive execution in a stochastic environment.…”

Section: Generic Approaches To Uncertaintymentioning

confidence: 99%

Reinforcement learning based adaptive power pinch analysis for energy management of stand-alone hybrid energy storage systems considering uncertainty

Nyong-Bassey

Giaouris

Patsios

et al. 2020

Energy

View full text Add to dashboard Cite

This is a PDF file of an article that has undergone enhancements after acceptance, such as the addition of a cover page and metadata, and formatting for readability, but it is not yet the definitive version of record. This version will undergo additional copyediting, typesetting and review before it is published in its final form, but we are providing this version to give early visibility of the article. Please note that, during the production process, errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain.

show abstract

“…Both behaviors need to produce dynamically feasible trajectories that are robust to many kinds of noise and inaccuracies, rely only on observations coming from primitive sensors, and avoid unexpected obstacles. In this paper, we present a reliable method to implement these navigation behaviors by learning 1 Google AI, Mountain View, CA 94043, USA lewispro,faust,mfiser,centaur@google.com * Authors contributed equally. end to end polices that directly map sensors to controls, and we show that these policies transfer from simulation to physical robots and new environments while robustly avoiding obstacles.…”

Section: Introductionmentioning

confidence: 99%

Learning Navigation Behaviors End-to-End With AutoRL

Chiang

Faust

Fišer

et al. 2019

IEEE Robot. Autom. Lett.

210

149

View full text Add to dashboard Cite

We learn end-to-end point-to-point and pathfollowing navigation behaviors that avoid moving obstacles. These policies receive noisy lidar observations and output robot linear and angular velocities. The policies are trained in small, static environments with AutoRL, an evolutionary automation layer around Reinforcement Learning (RL) that searches for a deep RL reward and neural network architecture with large-scale hyper-parameter optimization. AutoRL first finds a reward that maximizes task completion, and then finds a neural network architecture that maximizes the cumulative of the found reward. Empirical evaluations, both in simulation and on-robot, show that AutoRL policies do not suffer from the catastrophic forgetfulness that plagues many other deep reinforcement learning algorithms, generalize to new environments and moving obstacles, are robust to sensor, actuator, and localization noise, and can serve as robust building blocks for larger navigation tasks. Our path-following and point-topoint policies are respectively 23% and 26% more successful than comparison methods across new environments. Video at: https://youtu.be/0UwkjpUEcbI.

show abstract

Reinforcement learning-based mobile robot navigation

Cited by 26 publications

References 21 publications

Mobile Robot Path Optimization Technique Based on Reinforcement Learning Algorithm in Warehouse Environment

Mobile Robot Path Optimization Technique Based on Reinforcement Learning Algorithm in Warehouse Environment

Reinforcement learning based adaptive power pinch analysis for energy management of stand-alone hybrid energy storage systems considering uncertainty

Learning Navigation Behaviors End-to-End With AutoRL

Contact Info

Product

Resources

About