Multi-Agent Deep Reinforcement Learning with Human Strategies

Nguyen, Thanh Thi; Nguyen, Ngoc Duy; Nahavandi, Saeid

doi:10.1109/icit.2019.8755032

Cited by 17 publications

(9 citation statements)

References 15 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Establishing communication channels among agents during learning is an important step in designing and constructing MADRL algorithms. Nguyen et al [82] characterized the communication channel via human knowledge represented by images and allow deep RL agents to communicate using these shared images. The asynchronous advantage actor-critic (A3C) algorithm [74] is used to learn optimal policy for each agent, which can be extended to multiple heterogeneous agents.…”

Section: Madrl Applicationsmentioning

confidence: 99%

See 1 more Smart Citation

Deep Reinforcement Learning for Multiagent Systems: A Review of Challenges, Solutions, and Applications

Nguyen

Nahavandi

2020

IEEE Trans. Cybern.

Self Cite

804

309

View full text Add to dashboard Cite

Reinforcement learning (RL) algorithms have been around for decades and employed to solve various sequential decision-making problems. These algorithms however have faced great challenges when dealing with high-dimensional environments. The recent development of deep learning has enabled RL methods to drive optimal policies for sophisticated and capable agents, which can perform efficiently in these challenging environments. This paper addresses an important aspect of deep RL related to situations that require multiple agents to communicate and cooperate to solve complex tasks. A survey of different approaches to problems related to multi-agent deep RL (MADRL) is presented, including non-stationarity, partial observability, continuous state and action spaces, multi-agent training schemes, multi-agent transfer learning. The merits and demerits of the reviewed methods will be analyzed and discussed, with their corresponding applications explored. It is envisaged that this review provides insights about various MADRL methods and can lead to future development of more robust and highly useful multi-agent learning methods for solving real-world problems.

show abstract

Section: Madrl Applicationsmentioning

confidence: 99%

“…These pose important research questions towards extensions of imitation learning and inverse RL to MADRL methods. In addition, for complicated tasks or behaviors which are difficult for humans to demonstrate, there is a need of alternative methods that allow human preferences to be integrated into deep RL [13,81,82].…”

Section: Conclusion and Research Directionsmentioning

confidence: 99%

Deep Reinforcement Learning for Multiagent Systems: A Review of Challenges, Solutions, and Applications

Nguyen

Nahavandi

2020

IEEE Trans. Cybern.

Self Cite

804

309

View full text Add to dashboard Cite

show abstract

“…As a result, it is possible to infer the trajectories into a dynamic environment based on the conditional distribution. Reinforcement learning (RL) has become a promising approach to modeling an autonomous agent [24]- [28]. RL has the abilities to mimic human learning behaviors to maximize the long-term reward.…”

Section: Related Workmentioning

confidence: 99%

Manipulating Soft Tissues by Deep Reinforcement Learning for Autonomous Robotic Surgery

Nguyen

Nahavandi

et al. 2019

2019 IEEE International Systems Conference (SysCon)

Self Cite

View full text Add to dashboard Cite

In robotic surgery, pattern cutting through a deformable material is a challenging research field. The cutting procedure requires a robot to concurrently manipulate a scissor and a gripper to cut through a predefined contour trajectory on the deformable sheet. The gripper ensures the cutting accuracy by nailing a point on the sheet and continuously tensioning the pinch point to different directions while the scissor is in action. The goal is to find a pinch point and a corresponding tensioning policy to minimize damage to the material and increase cutting accuracy measured by the symmetric difference between the predefined contour and the cut contour. Previous study considers finding one fixed pinch point during the course of cutting, which is inaccurate and unsafe when the contour trajectory is complex. In this paper, we examine the soft tissue cutting task by using multiple pinch points, which imitates human operations while cutting. This approach, however, does not require the use of a multi-gripper robot. We use a deep reinforcement learning algorithm to find an optimal tensioning policy of a pinch point. Simulation results show that the multi-point approach outperforms the state-ofthe-art method in soft pattern cutting task with respect to both accuracy and reliability.

show abstract

“…The action policy is used to describe the agent's behavior, which specifies the way in which the agent chooses the action from a state. If the action policy, h = X → U, does not change over time it is considered stationary [20].…”

Section: Single Agent Casementioning

confidence: 99%

Multi-Agent Reinforcement Learning Using Linear Fuzzy Model Applied to Cooperative Mobile Robots

et al. 2018

View full text Add to dashboard Cite

A multi-agent system (MAS) is suitable for addressing tasks in a variety of domains without any programmed behaviors, which makes it ideal for the problems associated with the mobile robots. Reinforcement learning (RL) is a successful approach used in the MASs to acquire new behaviors; most of these select exact Q-values in small discrete state space and action space. This article presents a joint Q-function linearly fuzzified for a MAS’ continuous state space, which overcomes the dimensionality problem. Also, this article gives a proof for the convergence and existence of the solution proposed by the algorithm presented. This article also discusses the numerical simulations and experimental results that were carried out to validate the proposed algorithm.

show abstract

Multi-Agent Deep Reinforcement Learning with Human Strategies

Cited by 17 publications

References 15 publications

Deep Reinforcement Learning for Multiagent Systems: A Review of Challenges, Solutions, and Applications

Deep Reinforcement Learning for Multiagent Systems: A Review of Challenges, Solutions, and Applications

Manipulating Soft Tissues by Deep Reinforcement Learning for Autonomous Robotic Surgery

Multi-Agent Reinforcement Learning Using Linear Fuzzy Model Applied to Cooperative Mobile Robots

Contact Info

Product

Resources

About