Texture-Generic Deep Shape-From-Template

Fuentes-Jimenez, David; Pizarro, Daniel; Casillas-Pérez, David; Collins, Toby; Bartoli, Adrien

doi:10.1109/access.2021.3082011

Cited by 13 publications

(7 citation statements)

References 59 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…A recent real‐time SfT approach by Fuentes‐Jimenez et al . [FJPCP*21], i.e . RRNet‐DCT, relies on deep neural networks.…”

Section: State‐of‐the‐art Methodsmentioning

confidence: 99%

“…Its architecture has two neural networks: A segmentation module for pixel‐based detection of the template and a registration‐reconstruction module to perform SfT. RRNet‐DCT is texture‐agnostic as it adapts to new texture maps at runtime compared to the authors' earlier texture‐specific method, DeepSfT [FJPCP*21]. Being an object‐specific method that encodes the template into the neural network weights, it is highly accurate, unlike earlier object‐generic methods such as IsMo‐GAN [SGTS19].…”

Section: State‐of‐the‐art Methodsmentioning

confidence: 99%

“…The template often corresponds to the first frame of the sequence though it is not always a strict requirement. The template can be used as the initial state of a physics simulator [KTE*22], to obtain 3D‐2D registration as a basis for reconstruction [CPPFJ*21], and to encode prior knowledge in neural network weights [SGTS19, FJPCP*21].…”

Section: State‐of‐the‐art Methodsmentioning

confidence: 99%

See 2 more Smart Citations

State of the Art in Dense Monocular Non‐Rigid 3D Reconstruction

Tretschk

Navami

Mallikarjun

et al. 2023

Computer Graphics Forum

View full text Add to dashboard Cite

3D reconstruction of deformable (or non‐rigid) scenes from a set of monocular 2D image observations is a long‐standing and actively researched area of computer vision and graphics. It is an ill‐posed inverse problem, since—without additional prior assumptions—it permits infinitely many solutions leading to accurate projection to the input 2D images. Non‐rigid reconstruction is a foundational building block for downstream applications like robotics, AR/VR, or visual content creation. The key advantage of using monocular cameras is their omnipresence and availability to the end users as well as their ease of use compared to more sophisticated camera set‐ups such as stereo or multi‐view systems. This survey focuses on state‐of‐the‐art methods for dense non‐rigid 3D reconstruction of various deformable objects and composite scenes from monocular videos or sets of monocular views. It reviews the fundamentals of 3D reconstruction and deformation modeling from 2D image observations. We then start from general methods—that handle arbitrary scenes and make only a few prior assumptions—and proceed towards techniques making stronger assumptions about the observed objects and types of deformations (e.g. human faces, bodies, hands, and animals). A significant part of this STAR is also devoted to classification and a high‐level comparison of the methods, as well as an overview of the datasets for training and evaluation of the discussed techniques. We conclude by discussing open challenges in the field and the social aspects associated with the usage of the reviewed methods.

show abstract

“…A recent real‐time SfT approach by Fuentes‐Jimenez et al . [FJPCP*21], i.e . RRNet‐DCT, relies on deep neural networks.…”

Section: State‐of‐the‐art Methodsmentioning

confidence: 99%

Section: State‐of‐the‐art Methodsmentioning

confidence: 99%

Section: State‐of‐the‐art Methodsmentioning

confidence: 99%

See 1 more Smart Citation

State of the Art in Dense Monocular Non‐Rigid 3D Reconstruction

Tretschk

Navami

Mallikarjun

et al. 2023

Computer Graphics Forum

View full text Add to dashboard Cite

show abstract

“…These methods minimise the 3D-2D reprojection error and impose geometric constraints such as surface inextensibility [40,45] or isometry [5,34,61]. Recent neural SfT methods [13,18,41,47] predict 3D surfaces from monocular images relying on datasets with different template states. Our φ-SfT contrasts with other SfT methods in that it uses temporal information and a differentiable physics simulator as a regulariser for high-fidelity 3D surface tracking instead of approximating the underlying physical properties via geometric constraints.…”

Section: Related Workmentioning

confidence: 99%

“…The objective of SfT is: Given a known initial 3D state (a template) of an observed deformable scene or an object, reconstruct all its 3D states observed in the entire image sequence [45]. Recent learning-based SfT methods encode prior knowledge in neural network weights [13,47]. This offers multiple advantages over a vast body of previ-Figure 1.…”

Section: Introductionmentioning

confidence: 99%

ϕ-SfT: Shape-from-Template with a Physics-Based Deformation Model

Navami¹,

Tretschk²,

Elgharib³

et al. 2022

Preprint

View full text Add to dashboard Cite

Shape-from-Template (SfT) methods estimate 3D surface deformations from a single monocular RGB camera while assuming a 3D state known in advance (a template). This is an important yet challenging problem due to the under-constrained nature of the monocular setting. Existing SfT techniques predominantly use geometric and simplified deformation models, which often limits their reconstruction abilities. In contrast to previous works, this paper proposes a new SfT approach explaining 2D observations through physical simulations accounting for forces and material properties. Our differentiable physics simulator regularises the surface evolution and optimises the material elastic properties such as bending coefficients, stretching stiffness and density. We use a differentiable renderer to minimise the dense reprojection error between the estimated 3D states and the input images and recover the deformation parameters using an adaptive gradient-based optimisation. For the evaluation, we record with an RGB-D camera challenging real surfaces exposed to physical forces with various material properties and textures. Our approach significantly reduces the 3D reconstruction error compared to multiple competing methods. For the source code and data, see https://4dqv.mpi-inf.mpg.de/phi-SfT/.

show abstract

Shape-from-Template with Camera Focal Length Estimation

Collins

Bartoli

2023

Springer INdAM Series

View full text Add to dashboard Cite

Texture-Generic Deep Shape-From-Template

Cited by 13 publications

References 59 publications

State of the Art in Dense Monocular Non‐Rigid 3D Reconstruction

State of the Art in Dense Monocular Non‐Rigid 3D Reconstruction

ϕ-SfT: Shape-from-Template with a Physics-Based Deformation Model

Shape-from-Template with Camera Focal Length Estimation

Contact Info

Product

Resources

About