2019
DOI: 10.1007/978-3-030-35699-6_52
|View full text |Cite
|
Sign up to set email alerts
|

B-Human 2019 – Complex Team Play Under Natural Lighting Conditions

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
5
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
3
3
2

Relationship

1
7

Authors

Journals

citations
Cited by 8 publications
(5 citation statements)
references
References 4 publications
0
5
0
Order By: Relevance
“…In 10 trials, the agent first walked to the path between the ball and its own goal, to block a potential shot from the opponent, before turning toward the ball and approaching the ball and opponent. This type of defensive behavior is manually implemented in some RoboCup teams to defend the goal against an attacker with possession (36). Our learned policy, in contrast, discovered this tactic "on its own" by optimizing for task reward (which includes minimizing opponent scoring) rather than via manual specification.…”
Section: Opponent Awarenessmentioning
confidence: 99%
“…In 10 trials, the agent first walked to the path between the ball and its own goal, to block a potential shot from the opponent, before turning toward the ball and approaching the ball and opponent. This type of defensive behavior is manually implemented in some RoboCup teams to defend the goal against an attacker with possession (36). Our learned policy, in contrast, discovered this tactic "on its own" by optimizing for task reward (which includes minimizing opponent scoring) rather than via manual specification.…”
Section: Opponent Awarenessmentioning
confidence: 99%
“…Reinforcement learning is an area that has received attention since the early days of RoboCup to tackle some of the main technical challenges, which emerge in various of its football leagues, in isolation. These works typically focus on the handling of large state-action spaces (134,135), skill learning (136)(137)(138)(139)(140)(141), the keep-away and half-field offense tasks and multi-agent coordination (142)(143)(144)(145)(146)(147)(148)(149)(150), grounded simulation learning for improved skills (151-154) (e.g. in sim-toreal and back), skill learning in 3D humanoid football (155,156), and deep reinforcement learning for parameterised action spaces (157).…”
Section: Robocup and Simulated Footballmentioning
confidence: 99%
“…Yet, despite these examples of applications of reinforcement learning in the RoboCup domain, many successful recent RoboCup competition entries learn or optimize only a subset of the components of the control architecture (e.g. 141,158). One successful RoboCup approach related to our method is Layered Learning (29,159) which uses reinforcement learning at multiple levels of a pre-defined hierarchy of skills, from individual ball interaction to multi-agent behaviours such as pass selection.…”
Section: Robocup and Simulated Footballmentioning
confidence: 99%
“…Implementations for efficient neural network inference on the NAO platform have been provided by [11] as well as by [12]. The classification of preprocessed image patches, e. g. for ball classification, has become the de facto standard, one example out of many is given in [13]. A Deep Learning approach for processing the whole image for the purpose of robot detection has been presented by [14].…”
Section: Scientific and Technical Challengesmentioning
confidence: 99%