2016
DOI: 10.1007/978-3-319-46484-8_15
|View full text |Cite
|
Sign up to set email alerts
|

Real-Time Facial Segmentation and Performance Capture from RGB Input

Abstract: We introduce the concept of unconstrained real-time 3D facial performance capture through explicit semantic segmentation in the RGB input. To ensure robustness, cutting edge supervised learning approaches rely on large training datasets of face images captured in the wild. While impressive tracking quality has been demonstrated for faces that are largely visible, any occlusion due to hair, accessories, or hand-toface gestures would result in significant visual artifacts and loss of tracking accuracy. The model… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
100
0

Year Published

2017
2017
2021
2021

Publication Types

Select...
5
2
1

Relationship

0
8

Authors

Journals

citations
Cited by 108 publications
(102 citation statements)
references
References 63 publications
0
100
0
Order By: Relevance
“…Image taken from [HMYL15]. Figure 8: An RGB face tracker that is robust to occlusion has been proposed by [SLL16]. Using deep learning a clean segmentation of all visible parts of the face is obtained.…”
Section: Handling Occlusionsmentioning
confidence: 99%
See 1 more Smart Citation
“…Image taken from [HMYL15]. Figure 8: An RGB face tracker that is robust to occlusion has been proposed by [SLL16]. Using deep learning a clean segmentation of all visible parts of the face is obtained.…”
Section: Handling Occlusionsmentioning
confidence: 99%
“…Using deep learning a clean segmentation of all visible parts of the face is obtained. Image taken from [SLL16].…”
Section: Handling Occlusionsmentioning
confidence: 99%
“…In terms of 3D facial geometry reconstruction for the refinement of landmarks, recently there has been an increasing amount of research based on 2D images and videos [19,[35][36][37][38][39][40][41]. In order to accurately track facial landmarks, it is important to first reconstruct face geometry.…”
Section: Literature Reviewmentioning
confidence: 99%
“…Robust methods such as Refs. [35,36,39] can track facial performance in the presence of noise but often miss subtle details such as small eyelid and mouth movements, which are important in conveying the target's emotion and to generate convincing animation. Although we use a 3D optical flow approach similar to that in Ref.…”
Section: Literature Reviewmentioning
confidence: 99%
“…An adaptive scheme was proposed to capture more detail with point-to-point deformation on top of blendshapes in [22]. To explicitly deal with outliers caused by occlusions, a method was proposed to segment the face and complete the occluded parts based on the blendshape in [19], which was later extended to RGB input in [28]. Binocular stereo system, on the other hand, can provide higher resolution and work in outdoor environments directly under sunlight, but are more prone to suffer from lighting variation.…”
Section: Expression Clusteringmentioning
confidence: 99%