2016 5th Brazilian Conference on Intelligent Systems (BRACIS) 2016
DOI: 10.1109/bracis.2016.015
|View full text |Cite
|
Sign up to set email alerts
|

Object-Oriented Reinforcement Learning in Cooperative Multiagent Domains

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
12
0

Year Published

2017
2017
2024
2024

Publication Types

Select...
5
3

Relationship

1
7

Authors

Journals

citations
Cited by 12 publications
(12 citation statements)
references
References 11 publications
0
12
0
Order By: Relevance
“…A Best Paper Award from the BRACIS conference was awarded to one of those publications [Silva et al 2016]. Our work also received a Honorable Mention as Best Student Poster at the AAAI Conference on Artificial Intelligence in 2017.…”
Section: Scientific Resultsmentioning
confidence: 99%
See 1 more Smart Citation
“…A Best Paper Award from the BRACIS conference was awarded to one of those publications [Silva et al 2016]. Our work also received a Honorable Mention as Best Student Poster at the AAAI Conference on Artificial Intelligence in 2017.…”
Section: Scientific Resultsmentioning
confidence: 99%
“…The following full papers are the main related publications: [Silva et al 2020a, Silva et al 2020b, Silva and Costa 2019, Silva et al 2019b, Silva et al 2020c, Silva et al 2019a, Silva et al 2016.…”
Section: Scientific Resultsmentioning
confidence: 99%
“…While a common procedure in the literature is to provide hand-coded task mappings, autonomously computing task similarities is hard. In our BRACIS paper, we propose the Multiagent Object-Oriented MDP (MOO-MDP) (Silva, Glatt, and Costa 2016), a relational approach in which the state space is described through classes of objects. The main idea is that each entity in the environment (agent or not) belongs to a class of objects, and then the agent can generalize experiences.…”
Section: Partial Results and Future Workmentioning
confidence: 99%
“…Our work seeks to present a solution to how agents might automatically generate expressive temporally extended auxiliary tasks--in contrast to intermediate tasks--that can maximally leverage the directed experience of a single-task curriculum in object-centric environments. TaskExplore is distinct from the limited task generation works in curriculum learning literature [16,17], where tasks are not manually designed. The goal of our approach is not to generate good intermediate tasks to obtain experience samples, as is the goal in task generation for curriculum learning [15].…”
Section: Related Workmentioning
confidence: 99%