Learning Visual Shape Control of Novel 3D Deformable Objects from Partial-View Point Clouds

Thach, Bao; Cho, Brian Y.; Kuntz, Alan; Hermans, Tucker

doi:10.48550/arxiv.2110.04685

Cited by 1 publication

(1 citation statement)

References 35 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Existing deformable object manipulation approaches typically use one modality (mostly vision) and rely on finite element/particle-based techniques [6,7,8,9,10,11,12,13] or leverage deep learning for visual affordance/latent dynamics learning [14,15,16,17,18,19]. The former methods typically rely on privileged knowledge (e.g., occluded or unknown boundary conditions) and stop at system identification, limiting their ability to refine the underlying physics model by learning from data.…”

Section: Introductionmentioning

confidence: 99%

VIRDO: Visio-tactile Implicit Representations of Deformable Objects

Wi¹,

Florence²,

Zeng³

et al. 2022

2022 International Conference on Robotics and Automation (ICRA)

View full text Add to dashboard Cite

Deformable objects manipulation can benefit from representations that seamlessly integrate vision and touch while handling occlusions. In this work, we present a novel approach for, and real-world demonstration of, multimodal visuotactile state-estimation and dynamics prediction for deformable objects. Our approach, VIRDO++, builds on recent progress in multimodal neural implicit representations for deformable object state-estimation [1] via a new formulation for deformation dynamics and a complementary state-estimation algorithm that (i) maintains a belief distribution of deformation within a trajectory, and (ii) enables practical real-world application by removing the need for contact patches. In the context of two real-world robotic tasks, we show: (i) high-fidelity cross-modal state-estimation and prediction of deformable objects from partial visuo-tactile feedback, and (ii) generalization to unseen objects and contact formations.

show abstract