2021 4th International Conference on Artificial Intelligence and Pattern Recognition 2021
DOI: 10.1145/3488933.3489029
|View full text |Cite
|
Sign up to set email alerts
|

Meta Actor-Critic Framework for Multi-Agent Reinforcement Learning

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2

Citation Types

0
2
0

Year Published

2024
2024
2024
2024

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(2 citation statements)
references
References 4 publications
0
2
0
Order By: Relevance
“…Furthermore, even if explicit reward structures are crafted, agents might fall into reward manipulation during the policy learning and optimization phase (Skalse et al 2022). It involves exploiting vulnerabilities in the reward structure to maximize cumulative rewards without genuinely addressing the intended tasks (Leike et al 2018;Ouyang et al 2022).…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…Furthermore, even if explicit reward structures are crafted, agents might fall into reward manipulation during the policy learning and optimization phase (Skalse et al 2022). It involves exploiting vulnerabilities in the reward structure to maximize cumulative rewards without genuinely addressing the intended tasks (Leike et al 2018;Ouyang et al 2022).…”
Section: Introductionmentioning
confidence: 99%
“…In summary, PbRL offers an effective way to acquire reward functions based on human intent rather than predefined designs. Its effectiveness spans diverse domains like robot control (Lee, Smith, and Abbeel 2021) and dialogue systems (Ouyang et al 2022).…”
Section: Introductionmentioning
confidence: 99%