2019
DOI: 10.1109/access.2019.2932257
|View full text |Cite
|
Sign up to set email alerts
|

Deep Reinforcement Learning With Optimized Reward Functions for Robotic Trajectory Planning

Abstract: To improve the efficiency of deep reinforcement learning (DRL)-based methods for robotic trajectory planning in the unstructured working environment with obstacles. Different from the traditional sparse reward function, this paper presents two brand-new dense reward functions. First, the azimuth reward function is proposed to accelerate the learning process locally with a more reasonable trajectory by modeling the position and orientation constraints, which can reduce the blindness of exploration dramatically.… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
33
0
3

Year Published

2019
2019
2024
2024

Publication Types

Select...
6
3
1

Relationship

0
10

Authors

Journals

citations
Cited by 61 publications
(36 citation statements)
references
References 25 publications
0
33
0
3
Order By: Relevance
“…2) Negative reward function, i.e., reward expressed as negative of the cost as used in [28] and is given by. − ( ( ) + ( )) ( 11)…”
Section: Rewardmentioning
confidence: 99%
“…2) Negative reward function, i.e., reward expressed as negative of the cost as used in [28] and is given by. − ( ( ) + ( )) ( 11)…”
Section: Rewardmentioning
confidence: 99%
“…In such cases, which entail most realistic applications, the value function needs to be approximated. In a large group of works, e.g., [10]- [14], artificial neural networks were employed to approximate the value function over the entire state-space. Despite few successful trials, most early attempts of such were not very successful due to the overfitting problem.…”
Section: Introductionmentioning
confidence: 99%
“…Another key part is the design of reward function since the optimization of strategy is based on the cumulative reward of each state. How to design the reward function will affect the efficiency and effectiveness of skill learning in the framework of robot reinforcement learning [10]. It is still a difficult task to design a suitable reward function, since different tasks are often difficult to have a unified reward function.…”
Section: Introductionmentioning
confidence: 99%