Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence 2018
DOI: 10.24963/ijcai.2018/451
|View full text |Cite
|
Sign up to set email alerts
|

Fast Model Identification via Physics Engines for Data-Efficient Policy Search

Abstract: This paper presents a method for identifying mechanical parameters of robots or objects, such as their mass and friction coefficients. Key features are the use of off-the-shelf physics engines and the adaptation of a Bayesian optimization technique towards minimizing the number of real-world experiments needed for model-based reinforcement learning. The proposed framework reproduces in a physics engine experiments performed on a real robot and optimizes the model's mechanical parameters so as to match real-wor… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
29
0

Year Published

2018
2018
2024
2024

Publication Types

Select...
5
4

Relationship

0
9

Authors

Journals

citations
Cited by 35 publications
(29 citation statements)
references
References 1 publication
0
29
0
Order By: Relevance
“…The closest to our approach are the methods from [30,31,32,33,34] that propose to iteratively learn simulation parameters and train policies. In [30], an iterative system identification framework is used to optimize trajectories of a bipedal robot in simulation and calibrate the simulation parameters by minimizing the discrepancy between the real world and simulated execution of the trajectories.…”
Section: Related Workmentioning
confidence: 99%
“…The closest to our approach are the methods from [30,31,32,33,34] that propose to iteratively learn simulation parameters and train policies. In [30], an iterative system identification framework is used to optimize trajectories of a bipedal robot in simulation and calibrate the simulation parameters by minimizing the discrepancy between the real world and simulated execution of the trajectories.…”
Section: Related Workmentioning
confidence: 99%
“…For example, Christiano et al [9] learn inverse dynamics models from data gradually collected from a real robotics system, while transferring trajectory planning policy from a simulator. Chebotar et al [10] and Zhu et al [11] transfer manipulation policies by iteratively collecting data on the real system and updating a distribution of dynamics parameters for the simulator physics engine. Similar principles work for the problem of humanoid balancing [12].…”
Section: Related Workmentioning
confidence: 99%
“…Reality gap is the major obstacle to applying deep RL in robotics. Neunert et al [31] analyzed potential causes of the reality gap, some of which can be solved by system identification [32]. Li et al [33] showed that the transferability of open loop locomotion could be increased if carefully measured physical parameters were used in simulation.…”
Section: B Overcoming the Reality Gapmentioning
confidence: 99%