1996
DOI: 10.1007/bf00117447
|View full text |Cite
|
Sign up to set email alerts
|

Purposive behavior acquisition for a real robot by vision-based reinforcement learning

Abstract: Abstract. This paper presents a method of vision-based reinforcement learning by which a robot learns to shoot a ball into a goal. We discuss several issues in applying the reinforcement learning method to a real robot with vision sensor by which the robot can obtain information about the changes in an environment. First, we construct a state space in terms of size, position, and orientation of a ball and a goal in an image, and an action space is designed in terms of the action commands to be sent to the left… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
105
0
2

Year Published

1997
1997
2006
2006

Publication Types

Select...
4
2

Relationship

2
4

Authors

Journals

citations
Cited by 197 publications
(110 citation statements)
references
References 10 publications
0
105
0
2
Order By: Relevance
“…The state space was structured based on positions from where the box and the goal area can be seen in the CCD image, as described in [4]. The viewing angle of AIBO CCD is so narrow that the box or the goal area cannot be seen well with only one-directional images, in most cases.…”
Section: Rl Part Conducted On the Real Robotmentioning
confidence: 99%
See 2 more Smart Citations
“…The state space was structured based on positions from where the box and the goal area can be seen in the CCD image, as described in [4]. The viewing angle of AIBO CCD is so narrow that the box or the goal area cannot be seen well with only one-directional images, in most cases.…”
Section: Rl Part Conducted On the Real Robotmentioning
confidence: 99%
“…The "state-action deviation" problem should be taken into account when executing Q-learning with the state constructed from a visual image [4]. This is the problem that optimal actions cannot be achieved due to the dispersion of state transitions because the state composed only of the images remains the same without clearly distinguishing differences in image values.…”
Section: Integration Of Gp and Rlmentioning
confidence: 99%
See 1 more Smart Citation
“…We have selected a simplified soccer game consisting of two or three robots as a testbed for the problem because both competitive and cooperative tasks are involved as stated in RoboCup Initiative [4]. We built an original soccer simulator which models real mobile robots we have been using so far in [1,8,9]. The environment consists of a ball and two goals, and a wall is placed around the field except the two goals.…”
Section: Mutual Skill Developmentmentioning
confidence: 99%
“…Table 1. Although we design these behaviors by hand in this experiments, these primitive behaviors can be acquired by other learning algorithms such as ones in [1,8,9]. …”
Section: Function and Terminal Setsmentioning
confidence: 99%