2008
DOI: 10.1007/978-3-540-68847-1_9
|View full text |Cite
|
Sign up to set email alerts
|

Cooperative/Competitive Behavior Acquisition Based on State Value Estimation of Others

Abstract: Abstract. The existing reinforcement learning approaches have been suffering from the curse of dimension problem when they are applied to multiagent dynamic environments. One of the typical examples is a case of RoboCup competitions since other agents and their behaviors easily cause state and action space explosion. This paper presents a method of modular learning in a multiagent environment by which the learning agent can acquire cooperative behaviors with its team mates and competitive ones against its oppo… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2008
2008
2010
2010

Publication Types

Select...
2

Relationship

1
1

Authors

Journals

citations
Cited by 2 publications
(1 citation statement)
references
References 8 publications
0
1
0
Order By: Relevance
“…Here, the method is briefly introduced. More detailed description was given in (Noma et al, 2007). Fig.11 shows a basic architecture of the proposed system, i.e., a two-layered multimodule reinforcement learning system.…”
Section: Resultsmentioning
confidence: 99%
“…Here, the method is briefly introduced. More detailed description was given in (Noma et al, 2007). Fig.11 shows a basic architecture of the proposed system, i.e., a two-layered multimodule reinforcement learning system.…”
Section: Resultsmentioning
confidence: 99%