2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2020
DOI: 10.1109/cvpr42600.2020.00509
|View full text |Cite
|
Sign up to set email alerts
|

Deep Spatial Gradient and Temporal Depth Learning for Face Anti-Spoofing

Abstract: Face anti-spoofing is critical to the security of face recognition systems. Depth supervised learning has been proven as one of the most effective methods for face antispoofing. Despite the great success, most previous works still formulate the problem as a single-frame multi-task one by simply augmenting the loss with depth, while neglecting the detailed fine-grained information and the interplay between facial depths and moving patterns. In contrast, we design a new approach to detect presentation attacks fr… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

1
103
0

Year Published

2020
2020
2023
2023

Publication Types

Select...
4
3
1

Relationship

2
6

Authors

Journals

citations
Cited by 199 publications
(104 citation statements)
references
References 45 publications
1
103
0
Order By: Relevance
“…All recorded subjects are Chinese people (21 males and 27 females). In terms of the age distribution, most of the subjects are within the range [20,30) and [50,60) years old. The youngest and eldest age is 23 and 62, respectively.…”
Section: Casia-surf 3dmask Datasetmentioning
confidence: 99%
“…All recorded subjects are Chinese people (21 males and 27 females). In terms of the age distribution, most of the subjects are within the range [20,30) and [50,60) years old. The youngest and eldest age is 23 and 62, respectively.…”
Section: Casia-surf 3dmask Datasetmentioning
confidence: 99%
“…In contrast, auxiliary depth supervised FAS methods [2,22] are introduced to learn more detailed information effectively. On the other hand, several video level CNN methods are presented to exploit the dynamic spatiotemporal [33,34,19] or rPPG [17,22,18,36,37,31] features for PAD. Despite achieving state-of-the-art performance, single-modal methods are easily influenced by unseen domain shift (e.g., cross ethnicity and cross attack types) and not robust for challenging cases (e.g., harsh environment and realistic attacks).…”
Section: Related Workmentioning
confidence: 99%
“…where H, W denote the height and width of the binary mask, respectively, and B pre and B gt mean the predicted grayscale mask and ground truth binary mask, respectively. Moreover, for the sake of fine-grained supervision needs in FAS task, contrastive depth loss (CDL) L CDL [33] is considered to help the networks learn more detailed features. CDL can be formulated as…”
Section: Supervisionmentioning
confidence: 99%
See 1 more Smart Citation
“…Under a simple live/spoof binary classification settings, resulting two representations separated by a classification boundary can have mixed information, such as subject ID. Several neural networks are proposed to disentangle live/spoof class signature from the unwanted information [48], [49], [50] Because most of the presentation attacks are conducted under visible-light domain, using multimodal input or non-visible modality (e.g., depth, rPPG, SWIR) are useful for increasing PAD performance [6], [51], [52], [53], [54], [55], [56], [57], [58], [59], [60], [61], [62].…”
Section: Dnnsmentioning
confidence: 99%