2021
DOI: 10.48550/arxiv.2109.06668
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Exploration in Deep Reinforcement Learning: A Comprehensive Survey

Abstract: Deep Reinforcement Learning (DRL) and Deep Multi-agent Reinforcement Learning (MARL) have achieved significant success across a wide range of domains, including game AI, autonomous vehicles, robotics, finance, healthcare, transportation and so on. However, DRL and deep MARL agents are widely known to be sample-inefficient and millions of interactions are usually needed even for relatively simple game settings, thus preventing the wide application and deployment in real-industry scenarios. One bottleneck challe… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
14
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
4
3
1

Relationship

1
7

Authors

Journals

citations
Cited by 8 publications
(14 citation statements)
references
References 85 publications
0
14
0
Order By: Relevance
“…We also believe the exploration benefits we observed are novel. The problem of exploration in RL is fundamental, and existing solutions make use of noise, explicit exploratory drives/bonuses, and/or other forms of auto-annealing [9,10]. We suggest an additional class of strategies, namely, exploration as an added benefit of having multiple independent drives, since exploitation from the perspective of one module is exploration from the perspective of another.…”
Section: Discussionmentioning
confidence: 99%
“…We also believe the exploration benefits we observed are novel. The problem of exploration in RL is fundamental, and existing solutions make use of noise, explicit exploratory drives/bonuses, and/or other forms of auto-annealing [9,10]. We suggest an additional class of strategies, namely, exploration as an added benefit of having multiple independent drives, since exploitation from the perspective of one module is exploration from the perspective of another.…”
Section: Discussionmentioning
confidence: 99%
“…However, these works are outdated as more CDL methods have emerged and become increasingly popular since then. In the domain of reinforcement learning, the recent success of leveraging intrinsic rewards (artificial curiosity) to encourage efficient exploration is reviewed and discussed [4,143]. Nevertheless, these works lack a comprehensive framework and nuanced understanding of psychological curiosity to differentiate and utilize different types of artificial curiosity fundamentally.…”
Section: Introductionmentioning
confidence: 99%
“…The key to solve this problem is to reduce the size of the search space properly (Yang et al, 2021).…”
Section: Introductionmentioning
confidence: 99%