Learning model-free robot control by a Monte Carlo EM algorithm

Vlassis, Nikos; Toussaint, Marc; Kontes, Georgios; Piperidis, Savvas

doi:10.1007/s10514-009-9132-0

Cited by 51 publications

(34 citation statements)

References 17 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Our novel algorithm, PoWER, is based on an expectation-maximization inspired optimization and a structured, state-dependent exploration. Our approach has already given rise to follow-up work in other contexts, for example, [Vlassis et al, 2009, Kormushev et al, 2010. Theodorou et al [2010] have shown that an algorithm very similar to PoWER can also be derived from a completely different perspective, that is, the path integral approach.…”

Section: Policy Learning By Weighting Exploration With the Returns (Pmentioning

confidence: 96%

“…When the reward is treated as an improper probability distribution [Dayan and Hinton, 1997], safe and fast methods can be derived that are inspired by expectation-maximization. Some of these approaches have proven successful in robotics, e.g., reward-weighted regression [Peters and Schaal, 2008b], Policy Learning by Weighting Exploration with the Returns , Monte Carlo Expectation-Maximization [Vlassis et al, 2009], Cost-regularized Kernel Regression , and Policy Improvements with Path Integrals [Theodorou et al, 2010]. An overview of publications using policy search methods is presented in Table 2.2.…”

Section: Policy Searchmentioning

confidence: 99%

“…Applications include learning helicopter flight [Bagnell and Schneider, 2001], learning biped walk patterns , Tedrake et al, 2005, to drive a radio-controlled (RC) car as well as a jumping behavior for a robot dog jump [Kolter and Ng, 2009], as illustrated in Figure 2.3, and to balance a two wheeled robot [Vlassis et al, 2009]. Operational space control was also learned by Peters and Schaal [2008b] using locally linear controller models.…”

Section: Pre-structured Policiesmentioning

confidence: 99%

“…Our approach has inspired follow-up work in other contexts, for example [Vlassis et al, 2009, Kormushev et al, 2010. Theodorou et al [2010] have derived algorithms based on the path integral approach that are very similar to PoWER and have also been successfully employed for robotic tasks [Buchli et al, 2011, Tamosiunaite et al, 2011.…”

Section: Policy Learning By Weighting Exploration With the Returns (Pmentioning

confidence: 99%

See 3 more Smart Citations

Learning motor skills: from algorithms to robot experiments

Kober

Peters

2014

It - Information Technology

View full text Add to dashboard Cite

Die Veröffentlichung steht unter folgender Creative Commons Lizenz: Namensnennung -Keine kommerzielle Nutzung -Keine Bearbeitung 2.0 Deutschland http://creativecommons.org/licenses/by-nc-nd/2.0/de/ Abstract Ever since the word "robot" was introduced to the English language by KarelČapek's play "Rossum's Universal Robots" in 1921, robots have been expected to become part of our daily lives. In recent years, robots such as autonomous vacuum cleaners, lawn mowers, and window cleaners, as well as a huge number of toys have been made commercially available. However, a lot of additional research is required to turn robots into versatile household helpers and companions. One of the many challenges is that robots are still very specialized and cannot easily adapt to changing environments and requirements. Since the 1960s, scientists attempt to provide robots with more autonomy, adaptability, and intelligence. Research in this field is still very active but has shifted focus from reasoning based methods towards statistical machine learning. Both navigation (i.e., moving in unknown or changing environments) and motor control (i.e., coordinating movements to perform skilled actions) are important sub-tasks.In this thesis, we will discuss approaches that allow robots to learn motor skills. We mainly consider tasks that need to take into account the dynamic behavior of the robot and its environment, where a kinematic movement plan is not sufficient. The presented tasks correspond to sports and games but the presented techniques will also be applicable to more mundane household tasks. Motor skills can often be represented by motor primitives. Such motor primitives encode elemental motions which can be generalized, sequenced, and combined to achieve more complex tasks. For example, a forehand and a backhand could be seen as two different motor primitives of playing table tennis. We show how motor primitives can be employed to learn motor skills on three different levels. First, we discuss how a single motor skill, represented by a motor primitive, can be learned using reinforcement learning. Second, we show how such learned motor primitives can be generalized to new situations. Finally, we present first steps towards using motor primitives in a hierarchical setting and how several motor primitives can be combined to achieve more complex tasks.To date, there have been a number of successful applications of learning motor primitives employing imitation learning. However, many interesting motor learning problems are high-dimensional reinforcement learning problems which are often beyond the reach of current reinforcement learning methods. We review research on reinforcement learning applied to robotics and point out key challenges and important strategies to render reinforcement learning tractable. Based on these insights, we introduce novel learning approaches both for single and generalized motor skills.For learning single motor skills, we study parametrized policy search methods and introduce a framework of reward-weighted imi...

show abstract

Section: Policy Learning By Weighting Exploration With the Returns (Pmentioning

confidence: 96%

Section: Policy Searchmentioning

confidence: 99%

Section: Pre-structured Policiesmentioning

confidence: 99%

Section: Policy Learning By Weighting Exploration With the Returns (Pmentioning

confidence: 99%

See 2 more Smart Citations

Learning motor skills: from algorithms to robot experiments

Kober

Peters

2014

It - Information Technology

View full text Add to dashboard Cite

show abstract

“…Interestingly, the EM algorithm family along with Monte Carlo sampling has also been used in [65] for model-free reinforcement learning, in order to obtain directly the policy without learning the model first. In [40], the authors use this kind of technique to optimize motion of a robot in order to minimize uncertainty of localization.…”

Section: Active Exploration While Learningmentioning

confidence: 99%

Learning the behavior model of a robot

2010

View full text Add to dashboard Cite

Complex artifacts are designed today from well specified and well modeled components. But most often, the models of these components cannot be composed into a global functional model of the artifact. A significant observation, modeling and identification effort is required to get such a global model, which is needed in order to better understand, control and improve the designed artifact.Robotics provides a good illustration of this need. Autonomous robots are able to achieve more and more complex tasks, relying on more advanced sensori-motor functions. To better understand their behavior and improve their performance, it becomes necessary but more difficult to characterize and to model, at the global level, how robots behave in a given environment. Low-level models of sensors, actuators and controllers cannot be easily combined into a behavior model. Sometimes high level models operators used for planning are also available, but generally they are too coarse to represent the actual robot behavior.We propose here a general framework for learning from observation data the behavior model of a robot when performing a given task. The behavior is modeled as a Dynamic Bayesian Network, a convenient stochastic structured representations. We show how such a probabilistic model can be learned and how it can be used to improve, on line, the robot behavior with respect to a specific environment and user preferences. Framework and algorithms are detailed; they are substantiated by experimental results for autonomous navigation tasks.

show abstract

Learning Motor Skills

Kober

Peters

2014

Springer Tracts in Advanced Robotics

View full text Add to dashboard Cite

The use of general descriptive names, registered names, trademarks, service marks, etc. in this publication does not imply, even in the absence of a specific statement, that such names are exempt from the relevant protective laws and regulations and therefore free for general use. While the advice and information in this book are believed to be true and accurate at the date of publication, neither the authors nor the editors nor the publisher can accept any legal responsibility for any errors or omissions that may be made. The publisher makes no warranty, express or implied, with respect to the material contained herein.

show abstract

Learning model-free robot control by a Monte Carlo EM algorithm

Cited by 51 publications

References 17 publications

Learning motor skills: from algorithms to robot experiments

Learning motor skills: from algorithms to robot experiments

Learning the behavior model of a robot

Learning Motor Skills

Contact Info

Product

Resources

About