Kendall Lowrey scite author profile

Reinforcement learning has emerged as a promising methodology for training robot controllers. However, most results have been limited to simulation due to the need for a large number of samples and the lack of automated-yet-safe data collection methods. Model-based reinforcement learning methods provide an avenue to circumvent these challenges, but the traditional concern has been the mismatch between the simulator and the real world. Here, we show that control policies learned in simulation can successfully transfer to a physical system, composed of three Phantom robots pushing an object to various desired target positions. We use a modified form of the natural policy gradient algorithm for learning, applied to a carefully identified simulation model. The resulting policies, trained entirely in simulation, work well on the physical system without additional training. In addition, we show that training with an ensemble of models makes the learned policies more robust to modeling errors, thus compensating for difficulties in system identification. The results are illustrated in the accompanying video.

show abstract

Towards Generalization and Simplicity in Continuous Control

Rajeswaran¹,

Lowrey²,

Todorov³

et al. 2017

Preprint

View full text Add to dashboard Cite

This work shows that policies with simple linear and RBF parameterizations can be trained to solve a variety of widely studied continuous control tasks, including the OpenAI gym benchmarks. The performance of these trained policies are competitive with state of the art results, obtained with more elaborate parameterizations such as fully connected neural networks. Furthermore, the standard training and testing scenarios for these tasks are shown to be very limited and prone to overfitting, thus giving rise to only trajectory-centric policies. Training with a diverse initial state distribution induces more global policies with better generalization. This allows for interactive control scenarios where the system recovers from large on-line perturbations; as shown in the supplementary video.

show abstract

Model-Based Generalization Under Parameter Uncertainty Using Path Integral Control

Abraham

Handa

Ratliff

et al. 2020

IEEE Robot. Autom. Lett.

View full text Add to dashboard Cite

Fig. 1. Environments where we test our method on model-based control for generalizing under parameter uncertainty. Top row illustrates the experimental test beds where we test object reconfiguration, opening drawers, and nonprehensile manipulation under parameter uncertainty. Bottom row illustrates a sample set of the simulated environments where we tested our method. From left to right we have the Shadow hand manipulating a dice, a half-cheetah robot performing a backflip, and the Adroit hand opening a door.Abstract-This work addresses the problem of robot interaction in complex environments where online control and adaptation is necessary. By expanding the sample space in the free energy formulation of path integral control, we derive a natural extension to the path integral control that embeds uncertainty into action and provides robustness for model-based robot planning. Our algorithm is applied to a diverse set of tasks using different robots and validate our results in simulation and real-world experiments. We further show that our method is capable of running in real-time without loss of performance. Videos of the experiments as well as additional implementation details can be found at https://sites.google.com/view/emppi .

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Kendall Lowrey

Ensemble-CIO: Full-body dynamic motion planning that transfers to physical humanoids

An integrated system for real-time model predictive control of humanoid robots

Reinforcement learning for non-prehensile manipulation: Transfer from simulation to physical system

Towards Generalization and Simplicity in Continuous Control

Model-Based Generalization Under Parameter Uncertainty Using Path Integral Control

Contact Info

Product

Resources

About