2019
DOI: 10.1109/tcyb.2017.2781130
|View full text |Cite
|
Sign up to set email alerts
|

MOO-MDP: An Object-Oriented Representation for Cooperative Multiagent Reinforcement Learning

Abstract: Reinforcement learning (RL) is a widely known technique to enable autonomous learning. Even though RL methods achieved successes in increasingly large and complex problems, scaling solutions remains a challenge. One way to simplify (and consequently accelerate) learning is to exploit regularities in a domain, which allows generalization and reduction of the learning space. While object-oriented Markov decision processes (OO-MDPs) provide such generalization opportunities, we argue that the learning process may… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
18
0

Year Published

2019
2019
2023
2023

Publication Types

Select...
7
1

Relationship

1
7

Authors

Journals

citations
Cited by 30 publications
(18 citation statements)
references
References 20 publications
0
18
0
Order By: Relevance
“…The following full papers are the main related publications: [Silva et al 2020a, Silva et al 2020b, Silva and Costa 2019, Silva et al 2019b, Silva et al 2020c, Silva et al 2019a, Silva et al 2016.…”
Section: Scientific Resultsmentioning
confidence: 99%
See 2 more Smart Citations
“…The following full papers are the main related publications: [Silva et al 2020a, Silva et al 2020b, Silva and Costa 2019, Silva et al 2019b, Silva et al 2020c, Silva et al 2019a, Silva et al 2016.…”
Section: Scientific Resultsmentioning
confidence: 99%
“…We have also explored the generalization capabilities provided by objectoriented representations [Silva et al 2019b]. Our first work leveraging this representation estimates Probabilistic Inter-TAsk Mappings (PITAM) [Silva and Costa 2017] through human-given task descriptions.…”
Section: Methods and Avenues For Future Workmentioning
confidence: 99%
See 1 more Smart Citation
“…It too promotes learning of energy trading management where Agents arbitrate optimal solution from Agent's individualistic decision to gain uniform coalition payoff. Thus, probability stagnation schema was committed to ensure build-up convergence is attained during learning developments [26]- [28].…”
Section: Reinforced Learning For Cooperative Tendency In Layered Cmentioning
confidence: 99%
“…Reinforcement learning (RL) is an important machine learning method that has many applications in areas such as intelligent control of robots and analysis and prediction. The reinforcement learning algorithm can learn the mapping from environmental state to behavior, which makes the behavior selected by the agent can obtain the biggest reward of the environment [4,5]. However, reinforcement learning often faces dimensional disasters when the state of the system is large.…”
Section: Introductionmentioning
confidence: 99%