Robot grasping method optimization using improved deep deterministic policy gradient algorithm of deep reinforcement learning

Zhang, Hongxu; Wang, Fei; Wang, Jianhui; Cui, Ben

doi:10.1063/5.0034101

Cited by 16 publications

(6 citation statements)

References 12 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…b) Policy-based methods are algorithms that directly optimize policy without estimating the value of states or state-action pairs by determining the actions taken by the agent as a function of the agent's state and environment [27]. Policy Gradient [44], Advantage Actor-Critic (A2C) [45], Asynchronous Advantage Actor-Critic (A3C) [46], Proximal Policy Optimization (PPO) [47] In robotic manipulation the classification of DRL is given according to the specific tasks and problem-solving approaches: Grasping and manipulation: Applications include using DRL for robots to perform ambiguous manipulation tasks such as grasping and hand manipulation [12]. Navigation and localization: Applications where DRL is not used for robots to navigate and localize different environments Figure 2 shows some examples from the studies on manipulation tasks such as robotic grasping [57], robotic hand manipulation [58], and object manipulation [59] using DRL algorithms.…”

Section: Figure 1 Drl Classificationmentioning

confidence: 99%

Advancements in Deep Reinforcement Learning and Inverse Reinforcement Learning for Robotic Manipulation: Toward Trustworthy, Interpretable, and Explainable Artificial Intelligence

Ozalp,

Ucar,

Guzelis

2024

IEEE Access

View full text Add to dashboard Cite

This article presents a literature review of the past five years of studies using Deep Reinforcement Learning (DRL) and Inverse Reinforcement Learning (IRL) in robotic manipulation tasks. The reviewed articles are examined in various categories, including DRL and IRL for perception, assembly, manipulation with uncertain rewards, multitasking, transfer learning, multimodal, and Human-Robot Interaction (HRI). The articles are summarized in terms of the main contributions, methods, challenges, and highlights of the latest and relevant studies using DRL and IRL for robotic manipulation. Additionally, summary tables regarding the problem and solution are presented. The literature review then focuses on the concepts of trustworthy AI, interpretable AI, and explainable AI (XAI) in the context of robotic manipulation. Moreover, this review provides a resource for future research on DRL/IRL in trustworthy robotic manipulation.

show abstract

Section: Figure 1 Drl Classificationmentioning

confidence: 99%

Advancements in Deep Reinforcement Learning and Inverse Reinforcement Learning for Robotic Manipulation: Toward Trustworthy, Interpretable, and Explainable Artificial Intelligence

Ozalp,

Ucar,

Guzelis

2024

IEEE Access

View full text Add to dashboard Cite

show abstract

“…DDPG algorithm has been successfully applied to Robotics and motion control problems [81,82]. DDPG can be used for a variety of tasks, such as manipulation, locomotion, and navigation [83].…”

Section: Summary Of Studies Classified As Roboticsmentioning

confidence: 99%

“…Robotic manipulation (H. X. Zhang et al, 2021) [81] Techniques: Importance-Weighted Autoencoder (IWAE) and Gaussian parameter (Gaussian-DDPG) Methodology: Addition of Gaussian parameters to DDPG algorithm for better exploration and optimization of grasping position control using torque information.…”

Section: Airbornementioning

confidence: 99%

Deep Deterministic Policy Gradient Algorithm: A Systematic Review

Sumiea,

AbdulKadir,

Al-Selwi

et al. 2023

Preprint

View full text Add to dashboard Cite

Deep Reinforcement Learning (DRL) has gained significant adoption in diverse fields and applications, mainly due to its proficiency in resolving complicated decision-making problems in spaces with high-dimensional states and actions. Deep Deterministic Policy Gradient (DDPG) is a well-known DRL algorithm that adopts an actor-critic approach, synthesizing the advantages of value-based and policy-based reinforcement learning methods. The aim of this study is to provide a thorough examination of the latest developments, patterns, obstacles, and potential opportunities related to DDPG. A systematic search was conducted using relevant academic databases (Scopus, Web of Science, and ScienceDirect) to identify 85 relevant studies published in the last five years (2018-2023). We provide a comprehensive overview of the key concepts and components of DDPG, including its formulation, implementation, and training. Then, we highlight the various applications and domains of DDPG, including Autonomous Driving, Unmanned Aerial Vehicles, Resource Allocation, Communications and the Internet of Things, Robotics, and Finance. Additionally, we provide an in-depth comparison of DDPG with other DRL algorithms and traditional RL methods, highlighting its strengths and weaknesses. We believe that this review will be an essential resource for researchers, offering them valuable insights into the methods and techniques utilized in the field of DRL and DDPG.

show abstract

“…The effective resolution of complex manipulation tasks in an unstructured or highly variable environment remains an active field of research. Current research focuses mainly on grasping [12], picking and placing [13], and assembly tasks [14]. In particular, RL methods have shown high robustness to uncertainties in the latter, leading more and more researchers to focus on learning assembly skills.…”

Section: Contact-rich Manipulation Tasks: Assembly and Disassemblymentioning

confidence: 99%

Goal-Conditioned Reinforcement Learning within a Human-Robot Disassembly Environment

et al. 2022

View full text Add to dashboard Cite

The introduction of collaborative robots in industrial environments reinforces the need to provide these robots with better cognition to accomplish their tasks while fostering worker safety without entering into safety shutdowns that reduce workflow and production times. This paper presents a novel strategy that combines the execution of contact-rich tasks, namely disassembly, with real-time collision avoidance through machine learning for safe human-robot interaction. Specifically, a goal-conditioned reinforcement learning approach is proposed, in which the removal direction of a peg, of varying friction, tolerance, and orientation, is subject to the location of a human collaborator with respect to a 7-degree-of-freedom manipulator at each time step. For this purpose, the suitability of three state-of-the-art actor-critic algorithms is evaluated, and results from simulation and real-world experiments are presented. In reality, the policy’s deployment is achieved through a new scalable multi-control framework that allows a direct transfer of the control policy to the robot and reduces response times. The results show the effectiveness, generalization, and transferability of the proposed approach with two collaborative robots against static and dynamic obstacles, leveraging the set of available solutions in non-monotonic tasks to avoid a potential collision with the human worker.

show abstract

Robot grasping method optimization using improved deep deterministic policy gradient algorithm of deep reinforcement learning

Cited by 16 publications

References 12 publications

Advancements in Deep Reinforcement Learning and Inverse Reinforcement Learning for Robotic Manipulation: Toward Trustworthy, Interpretable, and Explainable Artificial Intelligence

Advancements in Deep Reinforcement Learning and Inverse Reinforcement Learning for Robotic Manipulation: Toward Trustworthy, Interpretable, and Explainable Artificial Intelligence

Deep Deterministic Policy Gradient Algorithm: A Systematic Review

Goal-Conditioned Reinforcement Learning within a Human-Robot Disassembly Environment

Contact Info

Product

Resources

About