2021
DOI: 10.48550/arxiv.2108.13976
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

WarpDrive: Extremely Fast End-to-End Deep Multi-Agent Reinforcement Learning on a GPU

Tian Lan,
Sunil Srinivasa,
Huan Wang
et al.

Abstract: Deep reinforcement learning (RL) is a powerful framework to train decision-making models in complex dynamical environments. However, RL can be slow as it learns through repeated interaction with a simulation of the environment. Accelerating RL requires both algorithmic and engineering innovations. In particular, there are key systems engineering bottlenecks when using RL in complex environments that feature multiple agents or highdimensional state, observation, or action spaces, for example. We present WarpDri… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2

Citation Types

0
2
0

Year Published

2022
2022
2022
2022

Publication Types

Select...
2

Relationship

0
2

Authors

Journals

citations
Cited by 2 publications
(2 citation statements)
references
References 8 publications
0
2
0
Order By: Relevance
“…A few recent works, e.g., Brax [7], Isaac Gym [19], and WarpDrive [16], use accelerators like GPUs and TPUs for the environment engine. Due to the highly parallel nature of the accelerators, numerous environments can be executed simultaneously.…”
Section: Related Workmentioning
confidence: 99%
“…A few recent works, e.g., Brax [7], Isaac Gym [19], and WarpDrive [16], use accelerators like GPUs and TPUs for the environment engine. Due to the highly parallel nature of the accelerators, numerous environments can be executed simultaneously.…”
Section: Related Workmentioning
confidence: 99%
“…MARL frameworks such as [10] and MAVA [15] are designed to enable easier and more efficient implementation of MARL algorithms. The former innovates by focusing on performance with the use of GPU and their parallelization power.…”
Section: Related Workmentioning
confidence: 99%