2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2022
DOI: 10.1109/cvpr52688.2022.00366
|View full text |Cite
|
Sign up to set email alerts
|

Watch It Move: Unsupervised Discovery of 3D Joints for Re-Posing of Articulated Objects

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
7
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
5
3
1

Relationship

0
9

Authors

Journals

citations
Cited by 23 publications
(7 citation statements)
references
References 51 publications
0
7
0
Order By: Relevance
“…Similarly, MovingParts [YZH*24] studies self‐supervised parts discovery by a motion‐based grouping mechanism that uses slot‐based attention, assuming each group follows a rigid motion. Watch‐it‐move [NIT*22] is the first approach that learns re‐poseable part decomposition from multi‐view videos and foreground masks without any prior knowledge of the structure. Skeletonization of these part‐level decomposition methods enables object re‐posing, as discussed in Sec.…”
Section: State‐of‐the‐art Methodsmentioning
confidence: 99%
“…Similarly, MovingParts [YZH*24] studies self‐supervised parts discovery by a motion‐based grouping mechanism that uses slot‐based attention, assuming each group follows a rigid motion. Watch‐it‐move [NIT*22] is the first approach that learns re‐poseable part decomposition from multi‐view videos and foreground masks without any prior knowledge of the structure. Skeletonization of these part‐level decomposition methods enables object re‐posing, as discussed in Sec.…”
Section: State‐of‐the‐art Methodsmentioning
confidence: 99%
“…There are a number of works that aim to recover 3D keypoints using self-supervised geometric reasoning [12,22], but they are limited to rigid objects. More recent unsupervised methods work for articulated objects from monocular RGB data [9,10,10,18,20,24], multi-view data [16], or point clouds [27], where authors suggest to condition on the predicted keypoints and train a conditional generative model to supervise the keypoints through reconstruction losses. We propose a simpler pipeline where we apply our novel unsupervised losses to the predicted keypoints directly and do not require additional models besides the keypoint predictor itself.…”
Section: Unsupervised Keypoint Localizationmentioning
confidence: 99%
“…One downside of general reconstruction methods is that they do not leverage the rigidity and kinematic constraints of articulated objects. To explicitly use such priors, [26] propose a method that processes a multi-view sequence of a moving object and discovers its parts as well as their respective reconstruction and kinematic structure in an unsupervised way. Going one step further, [36] learn a shape and appearance prior for each category which allows them to model accurate reconstructions of articulated objects with only 6 given views.…”
Section: Related Workmentioning
confidence: 99%
“…However, there are a wide variety of real-world objects that do not have a constant shape but can be articulated according to the object's underlying kinematics. There has been great progress in articulated object tracking [5,9,32,37] and reconstruction [10,26] from a sequence of observations. However, a sequence of observations is cumbersome since it often requires prior interaction with the environment.…”
Section: Introductionmentioning
confidence: 99%