Autonomous helicopter control using reinforcement learning policy search methods

Bagnell, J. Andrew; Schneider, Jeff

doi:10.1109/robot.2001.932842

Cited by 191 publications

(185 citation statements)

References 1 publication

Supporting

Mentioning

185

Contrasting

Order By: Relevance

“…In fact, many of the methods that scale to the most interesting tasks are model-based and often employ policy search rather than value function-based approaches , Miyamoto et al, 1996, Bagnell and Schneider, 2001, Kohl and Stone, 2004, Tedrake et al, 2005, Peters and Schaal, 2008b,c, Kober and Peters, 2008. This stands in contrast to perhaps the bulk of [Kaelbling et al, 1996, Sutton andBarto, 1998] research in the machine learning community.…”

Section: (D)mentioning

confidence: 99%

“…The computation of the policy update is the key step here and a variety of updates have been proposed ranging from pairwise comparisons [Strens andMoore, 2001, Ng et al, 2004a] over gradient estimation using finite policy differences [Geng et al, 2006, Mitsunaga et al, 2005, Sato et al, 2002, Tedrake et al, 2005, and general stochastic optimization methods (such as Nelder-Mead [Bagnell and Schneider, 2001], cross entropy [Rubinstein and Kroese, 2004] and population-based methods [Goldberg, 1989]) to approaches coming from optimal control such as differential dynamic programming (DDP) [Atkeson, 1998] and multiple shooting approaches [Betts, 2001] as well as core reinforcement learning methods.…”

Section: Policy Searchmentioning

confidence: 99%

“…Applications include learning helicopter flight [Bagnell and Schneider, 2001], learning biped walk patterns , Tedrake et al, 2005, to drive a radio-controlled (RC) car as well as a jumping behavior for a robot dog jump [Kolter and Ng, 2009], as illustrated in Figure 2.3, and to balance a two wheeled robot [Vlassis et al, 2009]. Operational space control was also learned by Peters and Schaal [2008b] using locally linear controller models.…”

Section: Pre-structured Policiesmentioning

confidence: 99%

See 2 more Smart Citations

Learning motor skills: from algorithms to robot experiments

Kober

Peters

2014

It - Information Technology

View full text Add to dashboard Cite

Die Veröffentlichung steht unter folgender Creative Commons Lizenz: Namensnennung -Keine kommerzielle Nutzung -Keine Bearbeitung 2.0 Deutschland http://creativecommons.org/licenses/by-nc-nd/2.0/de/ Abstract Ever since the word "robot" was introduced to the English language by KarelČapek's play "Rossum's Universal Robots" in 1921, robots have been expected to become part of our daily lives. In recent years, robots such as autonomous vacuum cleaners, lawn mowers, and window cleaners, as well as a huge number of toys have been made commercially available. However, a lot of additional research is required to turn robots into versatile household helpers and companions. One of the many challenges is that robots are still very specialized and cannot easily adapt to changing environments and requirements. Since the 1960s, scientists attempt to provide robots with more autonomy, adaptability, and intelligence. Research in this field is still very active but has shifted focus from reasoning based methods towards statistical machine learning. Both navigation (i.e., moving in unknown or changing environments) and motor control (i.e., coordinating movements to perform skilled actions) are important sub-tasks.In this thesis, we will discuss approaches that allow robots to learn motor skills. We mainly consider tasks that need to take into account the dynamic behavior of the robot and its environment, where a kinematic movement plan is not sufficient. The presented tasks correspond to sports and games but the presented techniques will also be applicable to more mundane household tasks. Motor skills can often be represented by motor primitives. Such motor primitives encode elemental motions which can be generalized, sequenced, and combined to achieve more complex tasks. For example, a forehand and a backhand could be seen as two different motor primitives of playing table tennis. We show how motor primitives can be employed to learn motor skills on three different levels. First, we discuss how a single motor skill, represented by a motor primitive, can be learned using reinforcement learning. Second, we show how such learned motor primitives can be generalized to new situations. Finally, we present first steps towards using motor primitives in a hierarchical setting and how several motor primitives can be combined to achieve more complex tasks.To date, there have been a number of successful applications of learning motor primitives employing imitation learning. However, many interesting motor learning problems are high-dimensional reinforcement learning problems which are often beyond the reach of current reinforcement learning methods. We review research on reinforcement learning applied to robotics and point out key challenges and important strategies to render reinforcement learning tractable. Based on these insights, we introduce novel learning approaches both for single and generalized motor skills.For learning single motor skills, we study parametrized policy search methods and introduce a framework of reward-weighted imi...

show abstract

Section: (D)mentioning

confidence: 99%

Section: Policy Searchmentioning

confidence: 99%

Section: Pre-structured Policiesmentioning

confidence: 99%

See 1 more Smart Citation

Learning motor skills: from algorithms to robot experiments

Kober

Peters

2014

It - Information Technology

View full text Add to dashboard Cite

show abstract

“…Examples range from basic upright hovering and forward flight [4,10,15,16,17] to inverted hovering [14], and even to extreme aerobatic maneuvers [1,6,5].…”

Section: Introductionmentioning

confidence: 99%

“…In particular, the model we present in this paper explicitly incorporates a model for the rotor speed dynamics, a crucial aspect of helicopter flight during autorotation. 4 Then, since it can be very difficult to specify helicopter maneuvers by hand, we use the expert demonstrations to define the autorotation task. (See also, e.g., [5], where demonstrations were used to enable a helicopter to fly high performance helicopter aerobatics.)…”

Section: Introductionmentioning

confidence: 99%

Autonomous Autorotation of an RC Helicopter

Abbeel

Coates

Hunter

et al. 2009

Experimental Robotics

View full text Add to dashboard Cite

Summary. In case of engine failure, skilled pilots can save a helicopter from crashing by executing an emergency procedure known as autorotation. In autorotation, rather than relying on the engine to drive the main rotor, the pilot has to control the helicopter such that potential energy from altitude is transferred to rotor speed. In fact, maintaining a sufficiently high rotor speed is critical to retain sufficient control of the helicopter to land safely. In this paper, we present the first autonomous controller to successfully pilot a remotely controlled (RC) helicopter during an autorotation descent and landing.

show abstract