2019
DOI: 10.1016/j.robot.2019.02.013
|View full text |Cite
|
Sign up to set email alerts
|

Solving the optimal path planning of a mobile robot using improved Q-learning

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
98
0
1

Year Published

2019
2019
2024
2024

Publication Types

Select...
6
2
1

Relationship

0
9

Authors

Journals

citations
Cited by 242 publications
(123 citation statements)
references
References 37 publications
0
98
0
1
Order By: Relevance
“…A concurrent grid-based implementation of a dynamic programming algorithm was presented in Reference [9]. In Reference [10], the flower pollination algorithm (FPA) was implemented as partially guided Q learning to solve a low convergence problem. The suggested technique implemented was a path planner for a three-wheel mobile robot.…”
Section: Introductionmentioning
confidence: 99%
“…A concurrent grid-based implementation of a dynamic programming algorithm was presented in Reference [9]. In Reference [10], the flower pollination algorithm (FPA) was implemented as partially guided Q learning to solve a low convergence problem. The suggested technique implemented was a path planner for a three-wheel mobile robot.…”
Section: Introductionmentioning
confidence: 99%
“…The results also found that after reinforcement learning is added, the convergence time of robot path planning is increased by 13.54%. Low et al used the flower pollination algorithm to properly initialize the Q -value, which could speed up the convergence of mobile robots (Low et al, 2019 ). The principle is similar to reinforcement learning, therefore, the research results here are also supported.…”
Section: Discussionmentioning
confidence: 99%
“…In [24], Q-learning is used in combination with a Deep Deterministic Policy Gradients (DDPG) algorithm for a UAV to learn a landing task in simulation. In [25], the effectiveness of the Q-learning algorithm for robot path planning, is improved by using a flower pollinating algorithm to initialize the q-values of the algorithm.…”
Section: Cognitive Reasoningmentioning
confidence: 99%
“…are constraint averages for each of the variables in . The vector = 1 , 2 , … n , n = n V , represents the Lagrange multipliers, calculated for each variable in , using (25).…”
Section: The Set Of Variables Are Represented Bymentioning
confidence: 99%