2021
DOI: 10.1007/978-3-030-71278-5_4
|View full text |Cite
|
Sign up to set email alerts
|

Learning to Identify Physical Parameters from Video Using Differentiable Physics

Abstract: Video representation learning has recently attracted attention in computer vision due to its applications for activity and scene forecasting or vision-based planning and control. Video prediction models often learn a latent representation of video which is encoded from input frames and decoded back into images. Even when conditioned on actions, purely deep learning based architectures typically lack a physically interpretable latent space. In this study, we use a differentiable physics engine within an action-… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
8
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
6
3

Relationship

2
7

Authors

Journals

citations
Cited by 10 publications
(8 citation statements)
references
References 12 publications
0
8
0
Order By: Relevance
“…In contrast to our method, they additionally require depth inputs for a strong 3D cue and do not recover local surface deformations. The idea of combining a differentiable physics and a differentiable graphics engine has been recently explored in contexts different from ours, i.e., estimation of material properties and visuomotor control [21,23,31].…”
Section: Related Workmentioning
confidence: 99%
“…In contrast to our method, they additionally require depth inputs for a strong 3D cue and do not recover local surface deformations. The idea of combining a differentiable physics and a differentiable graphics engine has been recently explored in contexts different from ours, i.e., estimation of material properties and visuomotor control [21,23,31].…”
Section: Related Workmentioning
confidence: 99%
“…Unsupervised system identification from vision is a recent area of research that removes the requirements for trajectory data, with approaches including unsupervised physical parameter estimation [24,31,40], structured latent space learning [19,25,32], and Hamiltonian/Lagrangian learning [18,51,58]. Unfortunately, these approaches are still relatively limited in the complexity of scene they can model, and typically restricted to toy problems and simulated environments.…”
Section: Related Workmentioning
confidence: 99%
“…Unfortunately, these approaches are still relatively limited in the complexity of scene they can model, and typically restricted to toy problems and simulated environments. In this work we aim to improve upon [24,31,40]'s limitation to simulated environments by performing physical parameter estimation on real dynamical scenes with distractors.…”
Section: Related Workmentioning
confidence: 99%
“…Weiss et al [32] estimate material properties of deformables by matching a differentiable deformable simulation to point cloud observations. In Kandukuri et al [15] a differentiable physics simulation based on [9] is embedded as layer into a deep neural network which infers the physical state from images and predicts the next states. Recently, [16] proposed gradSim, a framework that combines differentiable simulation and differentiable rendering for system identification from video and visual control.…”
Section: Related Workmentioning
confidence: 99%