2008
DOI: 10.1007/978-3-540-68847-1_19
|View full text |Cite
|
Sign up to set email alerts
|

Heuristic Reinforcement Learning Applied to RoboCup Simulation Agents

Abstract: Abstract. This paper describes the design and implementation of robotic agents for the RoboCup Simulation 2D category that learns using a recently proposed Heuristic Reinforcement Learning algorithm, the Heuristically Accelerated Q-Learning (HAQL). This algorithm allows the use of heuristics to speed up the well-known Reinforcement Learning algorithm Q-Learning. A heuristic function that influences the choice of the actions characterizes the HAQL algorithm. A set of empirical evaluations was conducted in the R… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
11
0
1

Year Published

2009
2009
2021
2021

Publication Types

Select...
4
4

Relationship

2
6

Authors

Journals

citations
Cited by 18 publications
(12 citation statements)
references
References 3 publications
0
11
0
1
Order By: Relevance
“…Heuristic model: it is a deterministic approach for representing the behaviour of a simulated user. Among the most common methods for representing information deterministically are hierarchical patterns [ 46 ] and rule sets [ 47 ]. Heuristic models are simple to create and maintain, and require little effort to modify.…”
Section: Simulated Usersmentioning
confidence: 99%
“…Heuristic model: it is a deterministic approach for representing the behaviour of a simulated user. Among the most common methods for representing information deterministically are hierarchical patterns [ 46 ] and rule sets [ 47 ]. Heuristic models are simple to create and maintain, and require little effort to modify.…”
Section: Simulated Usersmentioning
confidence: 99%
“…New exploration strategy helps to find out the optimal policy from transition history faster than if we would do it randomly. There were studies [12] related to the introducing heuristic function for multiagent reinforcement learning [13], however, it was able to perform only in deterministic action space. Combining heuristic function and actor-critic algorithm should lead to increasing the speed of algorithm convergence, in case when the optimal policy should be established from the set of previous interactions.…”
Section: Motivationmentioning
confidence: 99%
“…Bianchi, Ribeiro and Costa (9) investigated the use of a multiagent HARL algorithm in a simplified simulator for the robot soccer domain; Celiberto, Ribeiro, Costa and Bianchi (12) studied the use of the HARL algorithms to speed up learning in the RoboCup 2D Simulation domain. Finally, Martins and Bianchi (13) studied the use of several HARL algorithms in a simulated Robot soccer environment that reproduces the conditions of a real physical robot, the FIRA Simurosot competition league.…”
Section: Heuristic Accelerated Reinforcement Learning and The Haql Almentioning
confidence: 99%