Heuristically-Accelerated Multiagent Reinforcement Learning

Bianchi, Reinaldo A. C.; Martins, Murilo Fernandes; Ribeiro, Carlos H. C.; Costa, Anna Helena Reali

doi:10.1109/tcyb.2013.2253094

Cited by 69 publications

(59 citation statements)

References 17 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…The same effect happens when the heuristic is used only until a certain episode (Figure 7c). The results presented here corroborates the results presented in (Bianchi et al, 2014). This experiment was coded in C++, compiled with GNU g++, and exe-cuted on a Virtual Machine running Linux Ubuntu 14 LTS, virtualised using VM-Ware Player, and running on a MacPro running Mac OS X 10.6, 2,66GHz Intel Xeon processor and 12 Gb of RAM memory 3 .…”

Section: Experiments 1: Mountain Car Problemsupporting

confidence: 86%

“…Figure 7 shows that the results of the negative transfer when using L3-SARSA(λ) depends on the value of the η and ξ parameters and their decay (Figure 7a and b), and in the number of episodes that the heuristic is used ( Figure 7c). Bianchi et al (2014) showed that using a fixed value for η and ξ, the algorithm takes longer to ignore the negative transfer. Multiplying ξ by a decay value at the end of each episode reduces the influence of the heuristics over time.…”

Section: Experiments 1: Mountain Car Problemmentioning

confidence: 99%

See 1 more Smart Citation

Transferring knowledge as heuristics in reinforcement learning: A case-based approach

Bianchi

Celiberto

Santos

et al. 2015

Artificial Intelligence

View full text Add to dashboard Cite

The goal of this paper is to propose and analyse a transfer learning meta-algorithm that allows the implementation of distinct methods using heuristics to accelerate a Reinforcement Learning procedure in one domain (the target) that are obtained from another (simpler) domain (the source domain). This meta-algorithm works in three stages: first, it uses a Reinforcement Learning step to learn a task on the source domain, storing the knowledge thus obtained in a case base; second, it does an unsupervised mapping of the source-domain actions to the target-domain actions; and, third, the case base obtained in the first stage is used as heuristics to speed up the learning process in the target domain.A set of empirical evaluations were conducted in two target domains: the 3D mountain car (using a learned case base from a 2D simulation) and stability learning for a humanoid robot in the Robocup 3D Soccer Simulator (that uses knowledge learned from the Acrobot domain). The results attest that our transfer learning algorithm outperforms recent heuristically-accelerated reinforcement learning and transfer learning algorithms.

show abstract

Section: Experiments 1: Mountain Car Problemsupporting

confidence: 86%

Section: Experiments 1: Mountain Car Problemmentioning

confidence: 99%

Transferring knowledge as heuristics in reinforcement learning: A case-based approach

Bianchi

Celiberto

Santos

et al. 2015

Artificial Intelligence

View full text Add to dashboard Cite

show abstract

“…Taylor and Stone [15] introduced behavior transfer, a novel approach to speeding up traditional RL. Celiberto, Matsuura et al [2] applied transfer learning from one agent to another agent by means of the heuristic function speeds up the convergence of the algorithm. Case-based is used to transfer the learning, and it makes TL-HAQL algorithm.…”

Section: Approach On Accelerated Multiagent Reinforcement Learningmentioning

confidence: 99%

“…In the particular case of multiagent systems, the reinforcement received by each agent depends both on the dynamics of the environment and on the behavior of other agents, and therefore a multiagent reinforcement learning (MRL) algorithm must address the resulting nonstationary scenarios in which both the environment and other agents are present. Unfortunately, convergence of any RL algorithm requires extensive exploration of the state-action space, which can be very time consuming [2], not to mention the existence of multiple agents also increases the size of the state-action space, therefore, worsening the performance of RL algorithms with respect to convergence (even to suboptimal control policies) when it is adapted to multiagent problems. Therefore acceleration of learning processes is one of important issues in reinforcement learning [3,4].…”

Section: Introductionmentioning

confidence: 99%

“…Most successes in accelerating MRL incorporated internal knowledge or human intervention into the learning system, such as reward shaping [5][6][7][8], transfer learning [6,[9][10][11][12][13], parameter tuning [14], and even heuristics [2,11,13,15]. These approaches could be no longer solutions to RL acceleration where internal knowledge is not available.…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

State Elimination in Accelerated Multiagent Reinforcement Learning

Prihatmanto¹,

Adiprawita²,

Sari³

et al. 2016

ijeei

View full text Add to dashboard Cite

This paper presents a novel algorithm of Multiagent Reinforcement Learning called State Elimination in Accelerated Multiagent Reinforcement Learning (SEA-MRL), that successfully produces faster learning without incorporating internal knowledge or human intervention such as reward shaping, transfer learning, parameter tuning, and even heuristics, into the learning system. Since the learning speed is determined among others by the size of the state space where the larger the state space the slower learning might become, reducing the state space can lead to faster convergence. SEA-MRL distinguishes insignificant states of the state space from the significant ones and then eliminating them in early learning episodes, which aggressively reduces the scale of the state space in the following learning episodes. Applying SEA-MRL in gridworld multi robot navigation shows 1.62 times faster in achieving learning convergence. This algorithm is generally applicable for other multiagent task challenges or general multiagent learning with large scale state space, and perfectly applicable with no adjustments for single agent learning situation.

show abstract

Introduction

2020

Multi‐Agent Coordination

View full text Add to dashboard Cite

Heuristically-Accelerated Multiagent Reinforcement Learning

Cited by 69 publications

References 17 publications

Transferring knowledge as heuristics in reinforcement learning: A case-based approach

Transferring knowledge as heuristics in reinforcement learning: A case-based approach

State Elimination in Accelerated Multiagent Reinforcement Learning

Introduction

Contact Info

Product

Resources

About