Non-Rigid Structure Estimation in Trajectory Space from Monocular Vision

Wang, Yaming; Tong, Lingling; Jiang, Mingfeng; Zheng, Junbao

doi:10.3390/s151025730

Cited by 4 publications

(4 citation statements)

References 21 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Furthermore, the performance of our marker-less algorithm is illustrated on the Human3.6M. We compare our approach, which is denoted as TUNS (temporal union of nonlinear subspaces) in the remainder of this section, against six NRSfM baselines: point tracking algorithm PTA [ 5 ], the trajectory-sapce method CSF [ 6 ], the block matrix method BMM [ 8 ], the temporal union of subspaces TUS [ 31 ], the accelerated proximal gradient optimization APG [ 9 ] and the consensus NRSfM of CNR [ 14 ]. For PTA [ 5 ], CSF [ 6 ], BMM [ 8 ], CNR [ 14 ], we use authors’ implementation in experiments.…”

Section: Methodsmentioning

confidence: 99%

“…For PTA [ 5 ] and CSF [ 6 ], we manually set the rank of the subspace to the value yielding the best results. For TUS [ 31 ] and APG [ 9 ], since there are not publicly available implementations, our re-implementations are adopted in comparison. We test such re-implementations and get similar results to what the authors reported in [ 9 , 31 ].…”

Section: Methodsmentioning

confidence: 99%

“…Consequently, it is less effective when recovering the complex motion. APG [ 9 ] inherently uses the single subspace, whereas it performs better than BMM [ 8 ] since a more effective rank-minimisation technique is employed. The part-based method CNR [ 14 ], which is more adept at handling complex shape configurations, yields better reconstruction of subject 86 than the subject 56, since sequences of subject 86 have more points than subject 56.…”

Section: Methodsmentioning

confidence: 99%

“…Based on this, a prior-free method [ 8 ] was introduced to estimate the 3D non-rigid structures and camera rotations by only exploiting the low-rank shape assumption. In Wang et al [ 9 ], they use the low-rank assumption in a similar way, but an Accelerated Proximal Gradient (APG) algorithm solver is employed to solve the resulting problem. Furthermore, the method in Gotardo and Martinez [ 10 ] combined the shape basis model and trajectory basis model, and revealed trajectories of the shape basis coefficients.…”

Section: Introductionmentioning

confidence: 99%

See 3 more Smart Citations

Capturing Complex 3D Human Motions with Kernelized Low-Rank Representation from Monocular RGB Camera

Wang

Chen

2017

Sensors

View full text Add to dashboard Cite

Recovering 3D structures from the monocular image sequence is an inherently ambiguous problem that has attracted considerable attention from several research communities. To resolve the ambiguities, a variety of additional priors, such as low-rank shape basis, have been proposed. In this paper, we make two contributions. First, we introduce an assumption that 3D structures lie on the union of nonlinear subspaces. Based on this assumption, we propose a Non-Rigid Structure from Motion (NRSfM) method with kernelized low-rank representation. To be specific, we utilize the soft-inextensibility constraint to accurately recover 3D human motions. Second, we extend this NRSfM method to the marker-less 3D human pose estimation problem by combining with Convolutional Neural Network (CNN) based 2D human joint detectors. To evaluate the performance of our methods, we apply our marker-based method on several sequences from Utrecht Multi-Person Motion (UMPM) benchmark and CMU MoCap datasets, and then apply the marker-less method on the Human3.6M datasets. The experiments demonstrate that the kernelized low-rank representation is more suitable for modeling the complex deformation and the method consequently yields more accurate reconstructions. Benefiting from the CNN-based detector, the marker-less approach can be applied to more real-life applications.

show abstract