DOI: 10.22215/etd/2012-09679
|View full text |Cite
|
Sign up to set email alerts
|

Multi-agent reinforcement learning in games

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1

Citation Types

0
45
0

Publication Types

Select...
5

Relationship

0
5

Authors

Journals

citations
Cited by 12 publications
(45 citation statements)
references
References 33 publications
(94 reference statements)
0
45
0
Order By: Relevance
“…(x p , y p ) ← (0, 0) {pursuer initial position} 10: initialize (x e , y e ) randomly {evader initial position} 11: update s p = ( ,̇) 12: update s e = ( , d) 13: u ← Eq. (5.59) 17: play the game, observe the next states s ′ p and s ′ e and the reward r 18: end for 25: end for (5.59) 17: play the game, observe the next states s ′ p and s ′ e and the reward r 18: end for 25: end for…”
Section: Q( )-Learning Fuzzy Inference Systemmentioning
confidence: 99%
See 1 more Smart Citation
“…(x p , y p ) ← (0, 0) {pursuer initial position} 10: initialize (x e , y e ) randomly {evader initial position} 11: update s p = ( ,̇) 12: update s e = ( , d) 13: u ← Eq. (5.59) 17: play the game, observe the next states s ′ p and s ′ e and the reward r 18: end for 25: end for (5.59) 17: play the game, observe the next states s ′ p and s ′ e and the reward r 18: end for 25: end for…”
Section: Q( )-Learning Fuzzy Inference Systemmentioning
confidence: 99%
“…33, point O on the invader's reachable region is the closest point to the 33. Reproduced from[18], © X. Lu. Reproduced from[18], © X. Lu.…”
mentioning
confidence: 99%
“…Moreover, it is well known that fuzzy inference systems are widely used as function approximators [4], [5]. Reinforcement fuzzy learning methods have recently been proposed for the problem of learning in differential games [5]- [9] . In [5], only the consequent parameters of the FLC and fuzzy inference system (FIS) are tuned using a fuzzy actor-critic learning algorithm.…”
Section: Introductionmentioning
confidence: 99%
“…In addition the FIS is used as an approximation to the actionvalue function, Q(s,a). In [9], fuzzy actor-critic learning is applied to the guarding territory differential game. In this learning technique, the consequent parameters are tuned to allow the defender to learn its Nash equilibrium strategy.…”
Section: Introductionmentioning
confidence: 99%
“…Therefore, it is preferable to use these algorithms in an unsupervised learning manner. In [2], [8]- [12] , reinforcement learning (RL) methods have also been proposed for the problem of tuning the FLC parameters in an unsupervised manner.…”
Section: Introductionmentioning
confidence: 99%