DOI: 10.1007/978-3-540-85502-6_4
|View full text |Cite
|
Sign up to set email alerts
|

Recognizing the Enemy: Combining Reinforcement Learning with Strategy Selection Using Case-Based Reasoning

Abstract: Abstract. This paper presents CBRetaliate, an agent that combines Case-Based Reasoning (CBR) and Reinforcement Learning (RL) algorithms. Unlike most previous work where RL is used to improve accuracy in the action selection process, CBRetaliate uses CBR to allow RL to respond more quickly to changing conditions. CBRetaliate combines two key features: it uses a time window to compute similarity and stores and reuses complete Q-tables for continuous problem solving. We demonstrate CBRetaliate on a team-based fir… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

1
31
0

Publication Types

Select...
6
2
1

Relationship

2
7

Authors

Journals

citations
Cited by 36 publications
(32 citation statements)
references
References 11 publications
1
31
0
Order By: Relevance
“…Sharma et al [17] make use of CBR as a function approximator for RL, and RL as revision algorithm for CBR in a hybrid architecture system; Gabel and Riedmiller [18] also makes use of CBR in the task of approximating a function over high-dimensional, continuous spaces; Juell and Paulson [19] exploit the use of RL to learn similarity metrics in response to feedback from the environment; Auslander et al [20] use CBR to adapt quickly an RL agent to changing conditions of the environment by the use of previously stored policies and Li, Zonghai and Feng [21] propose an algorithm that makes use of knowledge acquired by reinforcement learning to construct and extend a case base. Finally, Bianchi, Ros and López de Mántaras [22] use CBR together with Heuristic Accelerated Reinforcement Learning to improve reinforcement learning by using case based heuristics.…”
Section: Transfer Learningmentioning
confidence: 99%
“…Sharma et al [17] make use of CBR as a function approximator for RL, and RL as revision algorithm for CBR in a hybrid architecture system; Gabel and Riedmiller [18] also makes use of CBR in the task of approximating a function over high-dimensional, continuous spaces; Juell and Paulson [19] exploit the use of RL to learn similarity metrics in response to feedback from the environment; Auslander et al [20] use CBR to adapt quickly an RL agent to changing conditions of the environment by the use of previously stored policies and Li, Zonghai and Feng [21] propose an algorithm that makes use of knowledge acquired by reinforcement learning to construct and extend a case base. Finally, Bianchi, Ros and López de Mántaras [22] use CBR together with Heuristic Accelerated Reinforcement Learning to improve reinforcement learning by using case based heuristics.…”
Section: Transfer Learningmentioning
confidence: 99%
“…Two states are similar if the absolute difference of the attributes is smaller or equal than each of the corresponding entries in the table below. For example, (6,2,5,10) is similar to (3,1,8,5) relative to the major similarity but not relative to the minor similarity. The values in parenthesis in the Major similarity show the ranges for the large and small maps.…”
Section: Similarity Metricmentioning
confidence: 99%
“…The potential for integrating these two techniques has been demonstrated in a variety of domains including digital games [1] and robotics [2]. For the most part the integration has been aimed at exploiting synergies between RL and CBR that result in performance that is better than each individually (e.g., [3]) or to enhance the performance of the CBR system (e.g., [4]). Although researchers have pointed out that CBR could help to enhance RL processes [5], comparatively little research has been done in this direction, and the bulk of it has concentrated on tasks with continuous states [6,7,16,17].…”
Section: Introductionmentioning
confidence: 99%
“…RETALIATE (Reinforced Tactic Learning in Agent-Tam Environments), an online Q-Learning algorithm that creates strategies for teams of computer agents in the commercial First Person Shooter (FPS) game Unreal Tournament is introduced in [10]. This approach is extended in [11], where the authors use CBR in order to get the original RETALIATE algorithm to adapt more quickly to changes in the environment. IN COMPUTER GAME AI…”
Section: Related Workmentioning
confidence: 99%