2021
DOI: 10.1109/lra.2020.3047730
|View full text |Cite
|
Sign up to set email alerts
|

Human-Centered Collaborative Robots With Deep Reinforcement Learning

Abstract: We present a reinforcement learning based framework for human-centered collaborative systems. The framework is proactive and balances the benefits of timely actions with the risk of taking improper actions by minimizing the total time spent to complete the task. The framework is learned end-toend in an unsupervised fashion addressing the perception uncertainties and decision making in an integrated manner. The framework is shown to provide more fluent coordination between human and robot partners on an example… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
34
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
4
3
1

Relationship

0
8

Authors

Journals

citations
Cited by 62 publications
(34 citation statements)
references
References 29 publications
(39 reference statements)
0
34
0
Order By: Relevance
“…In deep reinforcement learning, the single-step average reward value of each episode is an important indicator to measure the training effect [ 30 , 31 , 32 , 33 ]. This paper counts the average single-step rewards of [ 22 ] and DCPER-DDPG algorithm in 6000 episodes.…”
Section: Results Analysismentioning
confidence: 99%
“…In deep reinforcement learning, the single-step average reward value of each episode is an important indicator to measure the training effect [ 30 , 31 , 32 , 33 ]. This paper counts the average single-step rewards of [ 22 ] and DCPER-DDPG algorithm in 6000 episodes.…”
Section: Results Analysismentioning
confidence: 99%
“…(1) Creation of 3D virtual models from the experimental assembly by any 3D design software or point cloud creation by laser scanning technology with conversion to some standard 3D format (OBJ, FBX, STL, IGES, etc. ); (2) Import 3D models into the software with cinematic rendering and some simulation of dynamics; (3) Algorithms design of an automatic data queue of parts positioning, rotating, and camera setup by parts size; (4) Rendering two sets of images: the first for CNN teaching and the second for an automated annotation algorithm; (5) Creating of XML file for single shot detection and JSON format for instance segmentation; (6) Automated ratio sorting to training and testing samples and moving to separate folder; (7) Training of convolutional neural network for parts classification and localization (using single shot detection and instance segmentation); (8) Transformation of CNN models into some type of embedded devices for inference of the trained model and results distribution of the detected position data to assisted assembly systems: a collaborative robot internal Cartesian system and mixed reality device anchoring system;…”
Section: Methodology Of Deep Learning Implementation Into the Assisted Assembly Processmentioning
confidence: 99%
“…A nice review of virtual, mixed, and augmented reality for immersive systems research is presented in [4]. Some other research results of the mixed assembly process between human and collaborative robots are described in [5][6][7]. An AR-based worker support system for human-robot collaboration using AR libraries was proposed in [8] and an anchoring support system using the AR toolkit was developed in [9].…”
Section: Introduction and Related Workmentioning
confidence: 99%
“…On the one hand, Al-Yacoub et al present in [140] a LfD methodology that combines a machine learning algorithm -i.e., Random Forest (RF)-with stochastic regression, using haptic information captured from human demonstration. On the other hand, Ghadirzadeh et al propose in [141] a RL based framework for a more time-efficient HR cooperation that finds an optimal balance between timely actions and the risk of taking improper actions.…”
Section: Efficiency-oriented Control System Designmentioning
confidence: 99%
“…All the aforementioned control strategies based on learning have been developed to address uncertainties and external disturbances that might provoke the robot's performance degradation by replacing the traditional proportional-integralderivative controllers, typically characterized by a complicated tuning of control parameters. Since the above described methods suffer from several problems, ranging from a huge computation time to a limited generalizability or adaptability to unseen situations, NNs based on modern control theories -e.g., SMC [89], [90], Takagi-Sugeno fuzzy control [57], and RL [70], [109], [112], [121], [133], [141]-are introduced in the literature and modeled to overcome these complex robot's control issues. Obviously, also these innovative advanced techniques present limitations, such as chattering and sensitive problems for the SMC and possible instabilities for fuzzy approaches.…”
Section: B Emerging Control Issues and Challengesmentioning
confidence: 99%