Deep fisher faces

Hanselmann, Harald; Yan, Shen; Ney, Hermann

doi:10.5244/c.31.165

Cited by 8 publications

(17 citation statements)

References 10 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…To achieve these goals, we propose a Region Independence Loss which helps to reduce the overlap among attention maps and keep the consistency for different inputs. We apply BAP on the pooled feature map D otained in Section 3.2 to get a "semantic feature vector" : V ∈ R M ×N , and the Regional Independence Loss is defined as below by modifying the center loss in [15]:…”

Section: Regional Independence Loss For Attention Maps Regularizationmentioning

confidence: 99%

Multi-attentional Deepfake Detection

Zhao¹,

Zhou²,

Chen³

et al. 2021

Preprint

View full text Add to dashboard Cite

Face forgery by deepfake is widely spread over the internet and has raised severe societal concerns. Recently, how to detect such forgery contents has become a hot research topic and many deepfake detection methods have been proposed. Most of them model deepfake detection as a vanilla binary classification problem, i.e, first use a backbone network to extract a global feature and then feed it into a binary classifier (real/fake). But since the difference between the real and fake images in this task is often subtle and local, we argue this vanilla solution is not optimal. In this paper, we instead formulate deepfake detection as a fine-grained classification problem and propose a new multi-attentional deepfake detection network. Specifically, it consists of three key components: 1) multiple spatial attention heads to make the network attend to different local parts; 2) textural feature enhancement block to zoom in the subtle artifacts in shallow features; 3) aggregate the low-level textural feature and high-level semantic features guided by the attention maps. Moreover, to address the learning difficulty of this network, we further introduce a new regional independence loss and an attention guided data augmentation strategy. Through extensive experiments on different datasets, we demonstrate the superiority of our method over the vanilla binary classifier counterparts, and achieve state-of-the-art performance. The models will be released recently at https://github.com/yoctta/ multiple-attention.

show abstract

Section: Regional Independence Loss For Attention Maps Regularizationmentioning

confidence: 99%

Multi-attentional Deepfake Detection

Zhao¹,

Zhou²,

Chen³

et al. 2021

Preprint

View full text Add to dashboard Cite

show abstract

“…CNNs are therefore often used to train a discriminative embedding space in which face images can be compared efficiently and accurately. The embeddings are learned using specifically designed loss functions such as center loss [30], triplet loss [23] or DFF [10]. We insert such an embedding layer trained with a loss function based similar to [10] into the backbone CNN as penultimate layer.…”

Section: Introductionmentioning

confidence: 99%

“…The embeddings are learned using specifically designed loss functions such as center loss [30], triplet loss [23] or DFF [10]. We insert such an embedding layer trained with a loss function based similar to [10] into the backbone CNN as penultimate layer. We show that this greatly improves the performance of the softmax classifier.…”

Section: Introductionmentioning

confidence: 99%

ELoPE: Fine-Grained Visual Classification with Efficient Localization, Pooling and Embedding

Hanselmann¹,

Ney²

2019

Preprint

Self Cite

View full text Add to dashboard Cite

The task of fine-grained visual classification (FGVC) deals with classification problems that display a small interclass variance such as distinguishing between different bird species or car models. State-of-the-art approaches typically tackle this problem by integrating an elaborate attention mechanism or (part-) localization method into a standard convolutional neural network (CNN). Also in this work the aim is to enhance the performance of a backbone CNN such as ResNet by including three efficient and lightweight components specifically designed for FGVC. This is achieved by using global k-max pooling, a discriminative embedding layer trained by optimizing class means and an efficient bounding box estimator that only needs class labels for training. The resulting model achieves new best stateof-the-art recognition accuracies on the Stanford cars and FGVC-Aircraft datasets.

show abstract

“…There are many other alternative methods, including the range loss in [117], fisher face in [118], marginal loss in [120], sphere face in [121], etc. Each of these methods has its own uniqueness and advantages under certain setup.…”

Section: Generalized Feature Learningmentioning

confidence: 99%

“…The first way is to enhance the generalization and discriminative capability of representation model. Examples include range loss [117], fisher face [118], center invariant loss [119], marginal loss [120], sphere face [121], etc. The second way is to improve the estimation of partitions in the feature space.…”

Section: Introductionmentioning

confidence: 99%

Deep multi-factor forensic face recognition

Ding¹

View full text Add to dashboard Cite

Deep fisher faces

Cited by 8 publications

References 10 publications

Multi-attentional Deepfake Detection

Multi-attentional Deepfake Detection

ELoPE: Fine-Grained Visual Classification with Efficient Localization, Pooling and Embedding

Deep multi-factor forensic face recognition

Contact Info

Product

Resources

About