Purposive behavior acquisition for a real robot by vision-based reinforcement learning

Asada, Minoru; Noda, Shoichi; Tawaratsumida, Sukoya; Hosoda, Koh

doi:10.1007/bf00117447

Cited by 197 publications

(110 citation statements)

References 10 publications

Supporting

Mentioning

105

Contrasting

Unclassified

Order By: Relevance

“…The state space was structured based on positions from where the box and the goal area can be seen in the CCD image, as described in [4]. The viewing angle of AIBO CCD is so narrow that the box or the goal area cannot be seen well with only one-directional images, in most cases.…”

Section: Rl Part Conducted On the Real Robotmentioning

confidence: 99%

“…The "state-action deviation" problem should be taken into account when executing Q-learning with the state constructed from a visual image [4]. This is the problem that optimal actions cannot be achieved due to the dispersion of state transitions because the state composed only of the images remains the same without clearly distinguishing differences in image values.…”

Section: Integration Of Gp and Rlmentioning

confidence: 99%

“…The huge amount of learning time required presents a great problem when using a real robot. Accordingly, most studies deal with the problems of receiving an immediate reward from an action as shown in [3], or loading the results learned with a simulator into a real robot as shown in [4,5].…”

Section: Introductionmentioning

confidence: 99%

See 2 more Smart Citations

Integration of Genetic Programming and Reinforcement Learning for Real Robots

Kamio

Mitsuhashi

Iba

2003

Genetic and Evolutionary Computation — GECCO 2003

View full text Add to dashboard Cite

Abstract.We propose an integrated technique of genetic programming (GP) and reinforcement learning (RL) that allows a real robot to execute real-time learning. Our technique does not need a precise simulator because learning is done with a real robot. Moreover, our technique makes it possible to learn optimal actions in real robots. We show the result of an experiment with a real robot AIBO and represents the result which proves proposed technique performs better than traditional Q-learning method.

show abstract

Section: Rl Part Conducted On the Real Robotmentioning

confidence: 99%

Section: Integration Of Gp and Rlmentioning

confidence: 99%

See 1 more Smart Citation

Integration of Genetic Programming and Reinforcement Learning for Real Robots

Kamio

Mitsuhashi

Iba

2003

Genetic and Evolutionary Computation — GECCO 2003

View full text Add to dashboard Cite

show abstract

“…We have selected a simplified soccer game consisting of two or three robots as a testbed for the problem because both competitive and cooperative tasks are involved as stated in RoboCup Initiative [4]. We built an original soccer simulator which models real mobile robots we have been using so far in [1,8,9]. The environment consists of a ball and two goals, and a wall is placed around the field except the two goals.…”

Section: Mutual Skill Developmentmentioning

confidence: 99%

“…Table 1. Although we design these behaviors by hand in this experiments, these primitive behaviors can be acquired by other learning algorithms such as ones in [1,8,9]. …”

Section: Function and Terminal Setsmentioning

confidence: 99%

Cooperative Behavior Acquisition in a Multiple Mobile Robot Environment by Co-evolution

Uchibe

Nakamura

Asada

1999

Lecture Notes in Computer Science

Self Cite

View full text Add to dashboard Cite

Abstract. Co-evolution has been receiving increased attention as a method for multi agent simultaneous learning. This paper discusses how multiple robots can emerge cooperative behaviors through co-evolutionary processes. As an example task, a simplified soccer game with three learning robots is selected and a GP (genetic programming) method is applied to individual population corresponding to each robot so as to obtain cooperative and competitive behaviors through evolutionary processes. The complexity of the problem can be explained twofold: co-evolution for cooperative behaviors needs exact synchronization of mutual evolutions, and three robot co-evolution requires well-complicated environment setups that may gradually change from simpler to more complicated situations so that they can obtain cooperative and competitive behaviors simultaneously in a wide range of search area in various kinds of aspects. Simulation results are shown, and a discussion is given.

show abstract

Training and delayed reinforcements in Q-learning agents

Caironi

Dorigo

1997

Int. J. Intell. Syst.

View full text Add to dashboard Cite

Purposive behavior acquisition for a real robot by vision-based reinforcement learning

Cited by 197 publications

References 10 publications

Integration of Genetic Programming and Reinforcement Learning for Real Robots

Integration of Genetic Programming and Reinforcement Learning for Real Robots

Cooperative Behavior Acquisition in a Multiple Mobile Robot Environment by Co-evolution

Training and delayed reinforcements in Q-learning agents

Contact Info

Product

Resources

About