2021
DOI: 10.1016/j.aei.2021.101360
|View full text |Cite
|
Sign up to set email alerts
|

Deep reinforcement learning-based safe interaction for industrial human-robot collaboration using intrinsic reward function

Abstract: Aiming at human-robot collaboration in manufacturing, the operator's safety is the primary issue during the manufacturing operations. This paper presents a deep reinforcement learning approach to realize the real-time collision-free motion planning of an industrial robot for human-robot collaboration. Firstly, the safe human-robot collaboration manufacturing problem is formulated into a Markov decision process, and the mathematical expression of the reward function design problem is given. The goal is that the… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
19
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
5
2
1

Relationship

0
8

Authors

Journals

citations
Cited by 73 publications
(19 citation statements)
references
References 32 publications
0
19
0
Order By: Relevance
“…Where, t   represents that the car continues to travel at speed along the current heading angle. This paper adds the prediction status of Unmanned Vehicle and resets reward function [24]. When the distance between the unmanned vehicle and the obstacle center is greater than the unmanned vehicle prediction distance, reward=-1 is set to give a positive return.…”
Section: B Q-learning Enhanced Learning Algorithm With Lstmmentioning
confidence: 99%
“…Where, t   represents that the car continues to travel at speed along the current heading angle. This paper adds the prediction status of Unmanned Vehicle and resets reward function [24]. When the distance between the unmanned vehicle and the obstacle center is greater than the unmanned vehicle prediction distance, reward=-1 is set to give a positive return.…”
Section: B Q-learning Enhanced Learning Algorithm With Lstmmentioning
confidence: 99%
“…In order to obtain the signal estimation in the domain, the Bayesian maximum a posteriori estimation method is used to calculate the a posteriori probability. It is obtained by Equation (5).…”
Section: Design Of Wavelet Shrinkage Algorithmmentioning
confidence: 99%
“…In diagnosis, it is convenient for doctors to obtain the patient's condition information (3). With the development of medicine, medical images have been continuously optimized to gradually form three-dimensional (3D) multimodal medical images, which make medical images clearer and have higher resolution (4,5). In order to effectively distinguish the pathological region from the normal region in medical image and enable doctors to diagnose and treat more intuitively, the segmentation of 3D multimodal medical image has become the focus of current research.…”
Section: Introductionmentioning
confidence: 99%
“…However, as a non-talent ability of the machine, applying this ability is a very challenging task. Further more, motion prediction has also been widely applied to autonomous driving [23,24,20], intelligent robot [39,38], human-robot collaboration [51,58,52,68,19,50,55,78], and multimedia applications [64,104], as shown in Fig 1.…”
Section: Introductionmentioning
confidence: 99%