2020
DOI: 10.1007/978-3-030-64096-5_4
|View full text |Cite
|
Sign up to set email alerts
|

D3PG: Decomposed Deep Deterministic Policy Gradient for Continuous Control

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2023
2023
2023
2023

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(1 citation statement)
references
References 12 publications
0
1
0
Order By: Relevance
“…Decaying the exploration rate and coordinatingly shrinking the action space can mitigate these issues. To coordinate the agents in the system to jointly explore the environment, Dong et al (2020b) uses the state information of other agents to enhance the state information of the individual learning agent or Dong et al (2020a) decomposes the global critic into the weighted sum of multi-task local critics. We use the decentralized reinforcement learning method to learn the robot motion control policy.…”
Section: Decentralized Reinforcement Learningmentioning
confidence: 99%
“…Decaying the exploration rate and coordinatingly shrinking the action space can mitigate these issues. To coordinate the agents in the system to jointly explore the environment, Dong et al (2020b) uses the state information of other agents to enhance the state information of the individual learning agent or Dong et al (2020a) decomposes the global critic into the weighted sum of multi-task local critics. We use the decentralized reinforcement learning method to learn the robot motion control policy.…”
Section: Decentralized Reinforcement Learningmentioning
confidence: 99%