2018
DOI: 10.1007/978-3-319-97310-4_48
|View full text |Cite
|
Sign up to set email alerts
|

Weighted Double Deep Multiagent Reinforcement Learning in Stochastic Cooperative Environments

Abstract: Recently, multiagent deep reinforcement learning (DRL) has received increasingly wide attention. Existing multiagent DRL algorithms are inefficient when facing with the non-stationarity due to agents update their policies simultaneously in stochastic cooperative environments. This paper extends the recently proposed weighted double estimator to the multiagent domain and propose a multiagent DRL framework, named weighted double deep Qnetwork (WDDQN). By utilizing the weighted double estimator and the deep neura… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
21
0

Year Published

2019
2019
2024
2024

Publication Types

Select...
5
4
1

Relationship

3
7

Authors

Journals

citations
Cited by 36 publications
(21 citation statements)
references
References 8 publications
0
21
0
Order By: Relevance
“…Later this idea was applied to arbitrary function approximators, including deep neural networks, i.e., Double DQN [92], which were naturally applied since two networks were already used in DQN (see Section 2.2). These ideas have also been recently applied to MDRL [178].…”
Section: Avoiding Deep Learning Amnesia: Examples In Mdrlmentioning
confidence: 99%
“…Later this idea was applied to arbitrary function approximators, including deep neural networks, i.e., Double DQN [92], which were naturally applied since two networks were already used in DQN (see Section 2.2). These ideas have also been recently applied to MDRL [178].…”
Section: Avoiding Deep Learning Amnesia: Examples In Mdrlmentioning
confidence: 99%
“…The main idea of DRL is to maximize the cumulative reward rather than an immediate reward. The principle of DRL is illustrated in Figure 4(d [56], asynchronous actor-critic (A3C) [57], duelling DQN [58], double DQN [59], and multiagent DRL [60]. Researchers are still working to improve the DRL method by integrating LSTM [61] or CNN [62] with it and utilizing the advantages of both architectures in the same network.…”
Section: ) Deep Reinforcement Learning (Drl)mentioning
confidence: 99%
“…Game playing [23] defines the state based on the outputs of APIs and Android testing [25] uses an existing tool UIAutomator [28] to extract structures as the states, which are both not applicable in web testing. Secondly, one fundamental challenge of RL is how to perform effective exploration especially when the space of the environment is huge [29,30]. The existing techniques [21,23,31] mainly guide the exploration with simple reward functions, which could be ineffective for web applications that have complex business logic and frequent dynamic update.…”
Section: Introductionmentioning
confidence: 99%