“…Previous works [42,43,44,45] have also explored various robotic planning and control methods for manipulating 3D articulated objects. More recent works further leveraged learning techniques for better predicting articulated part configurations, parameters, and states [6,7,4,46,3,5,47], estimating kinematic structures [1,2], as well as manipulating 3D articulated objects with the learned visual knowledge [8,9,10,11,12]. While most of these works represented visual data with link poses, joint parameters, and kinematic structures, such standardized abstractions may be insufficient if fine-grained part geometry, such as drawer handles and faucet switches that exhibit rich geometric diversity among different shapes, matters for downstream robotic tasks and motion planning.…”