2019
DOI: 10.1016/j.engappai.2019.06.019
|View full text |Cite
|
Sign up to set email alerts
|

Accelerating decentralized reinforcement learning of complex individual behaviors

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
3
0

Year Published

2020
2020
2023
2023

Publication Types

Select...
2
2
1

Relationship

0
5

Authors

Journals

citations
Cited by 5 publications
(3 citation statements)
references
References 5 publications
0
3
0
Order By: Relevance
“…Reinforcement learning for robot motion control suffers from the complexity of action space increases exponentially with the number of robot actuators, which makes policy search and optimization difficult (Leottau et al (2018); Leottau et al (2019)). Decentralized reinforcement learning methods decompose the robot into multiple agents to explore the action space coordinately to achieve the common goal.…”
Section: Decentralized Reinforcement Learningmentioning
confidence: 99%
See 1 more Smart Citation
“…Reinforcement learning for robot motion control suffers from the complexity of action space increases exponentially with the number of robot actuators, which makes policy search and optimization difficult (Leottau et al (2018); Leottau et al (2019)). Decentralized reinforcement learning methods decompose the robot into multiple agents to explore the action space coordinately to achieve the common goal.…”
Section: Decentralized Reinforcement Learningmentioning
confidence: 99%
“…Decentralized reinforcement learning methods decompose the robot into multiple agents to explore the action space coordinately to achieve the common goal. The decomposed action space will improve policy learning efficiency by reducing the search space of policies (Leottau et al (2019)). However, the decentralized learning system has non-stationary and non-Markovian issues, which causes policy divergence, especially in the multitask situation.…”
Section: Decentralized Reinforcement Learningmentioning
confidence: 99%
“…In recent decades, and with the advancement in artificial intelligence (AI) models, they became powerful tools to solve complex engineering problems [19][20][21][22][23][24][25][26][27][28][29] . A number of AI-based studies have conducted to enhance the accuracy of Dx estimation in turbulent flow systems such as streams 30,33 .…”
Section: Introductionmentioning
confidence: 99%