2019 IEEE International Conference on Autonomous Robot Systems and Competitions (ICARSC) 2019
DOI: 10.1109/icarsc.2019.8733632
|View full text |Cite
|
Sign up to set email alerts
|

Learning low level skills from scratch for humanoid robot soccer using deep reinforcement learning

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
15
0

Year Published

2019
2019
2023
2023

Publication Types

Select...
5
2

Relationship

2
5

Authors

Journals

citations
Cited by 25 publications
(15 citation statements)
references
References 5 publications
0
15
0
Order By: Relevance
“…Our learning framework employs the Proximal Policy Optimization (PPO) algorithm, introduced by Schulman et al [37], which was chosen due to its success in optimizing low-level skills concerning the NAO robot [21,38,39,40,41], and highlevel skills [42], where it outperformed other algorithms such as TRPO or DDPG. The chosen implementation uses the clipped surrogate objective:…”
Section: Learning Frameworkmentioning
confidence: 99%
See 2 more Smart Citations
“…Our learning framework employs the Proximal Policy Optimization (PPO) algorithm, introduced by Schulman et al [37], which was chosen due to its success in optimizing low-level skills concerning the NAO robot [21,38,39,40,41], and highlevel skills [42], where it outperformed other algorithms such as TRPO or DDPG. The chosen implementation uses the clipped surrogate objective:…”
Section: Learning Frameworkmentioning
confidence: 99%
“…• Performance: to have an entirely fair comparison, the performance of our framework should be compared with other frameworks in the same scenario and simulator. To do so, we took into consideration the maximum forward speed, and our proposed framework provides a faster walk than the agents in [45,50,46,25,19] and slower than [51,21,38,41]. However, some of the faster examples are solely focused on sprinting forward, without the basic ability of changing direction [21,38,41].…”
Section: Featuresmentioning
confidence: 99%
See 1 more Smart Citation
“…First, in [17], the action space comprises two controllable, abstract, and consequently discrete commands: dash, to get closer to the ball, and kick, to push the ball. Second, in [18], the objective is to produce continuous actions, which are considered as low-level.…”
Section: Related Workmentioning
confidence: 99%
“…It is also important to distinguish the concept of learning at a high or at a low level of abstraction. For instance, the authors of [17] and [18] separate this in two different applications in a simulated environment. First, in [17], the action space comprises two controllable, abstract, and consequently discrete commands: dash, to get closer to the ball, and kick, to push the ball.…”
Section: Related Workmentioning
confidence: 99%