2021
DOI: 10.48550/arxiv.2109.06795
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

ROMAX: Certifiably Robust Deep Multiagent Reinforcement Learning via Convex Relaxation

Abstract: In a multirobot system, a number of cyberphysical attacks (e.g., communication hijack, observation perturbations) can challenge the robustness of agents. This robustness issue worsens in multiagent reinforcement learning because there exists the non-stationarity of the environment caused by simultaneously learning agents whose changing policies affect the transition and reward functions. In this paper, we propose a minimax MARL approach to infer the worst-case policy update of other agents. As the minimax form… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2022
2022
2022
2022

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(1 citation statement)
references
References 19 publications
0
1
0
Order By: Relevance
“…Adversarial attacks and adversarial training in MARL Lin et al [10], Guo et al [5] attacked one agent in the cMARL environment. Li et al [9], Sun et al [25], Nisioti et al [16] did minimax adversarial training in the cMARL environment, which assumed some agents may behave adversarially against other agents. All these works assumed agent(s) can behave adversarially at any timestep.…”
Section: Related Workmentioning
confidence: 99%
“…Adversarial attacks and adversarial training in MARL Lin et al [10], Guo et al [5] attacked one agent in the cMARL environment. Li et al [9], Sun et al [25], Nisioti et al [16] did minimax adversarial training in the cMARL environment, which assumed some agents may behave adversarially against other agents. All these works assumed agent(s) can behave adversarially at any timestep.…”
Section: Related Workmentioning
confidence: 99%