2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2020
DOI: 10.1109/cvpr42600.2020.00791
|View full text |Cite
|
Sign up to set email alerts
|

Watch Your Up-Convolution: CNN Based Generative Deep Neural Networks Are Failing to Reproduce Spectral Distributions

Abstract: Generative convolutional deep neural networks, e.g. popular GAN architectures, are relying on convolution based up-sampling methods to produce non-scalar outputs like images or video sequences. In this paper, we show that common up-sampling methods, i.e. known as upconvolution or transposed convolution, are causing the inability of such models to reproduce spectral distributions of natural training data correctly. This effect is independent of the underlying architecture and we show that it can be used to easi… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

2
271
0

Year Published

2020
2020
2022
2022

Publication Types

Select...
4
3
1
1

Relationship

1
8

Authors

Journals

citations
Cited by 282 publications
(273 citation statements)
references
References 42 publications
2
271
0
Order By: Relevance
“…As color difference can be easily fixed, more sophisticated detection methods, e.g, [10,16], seek more abstract signal-level traces or fingerprints in the noise residuals to differentiate GAN-synthesized faces. More recent works such as [17,18,19] extend the analysis to the frequency domain, where the upsampling step in the GAN generation leaves specific artifacts. The second category of GAN synthesized face detection methods are of data-driven nature [20,21,22,23,24], where a deep neural network model is trained and employed to classify real and GAN-synthesized faces.…”
Section: Related Workmentioning
confidence: 99%
“…As color difference can be easily fixed, more sophisticated detection methods, e.g, [10,16], seek more abstract signal-level traces or fingerprints in the noise residuals to differentiate GAN-synthesized faces. More recent works such as [17,18,19] extend the analysis to the frequency domain, where the upsampling step in the GAN generation leaves specific artifacts. The second category of GAN synthesized face detection methods are of data-driven nature [20,21,22,23,24], where a deep neural network model is trained and employed to classify real and GAN-synthesized faces.…”
Section: Related Workmentioning
confidence: 99%
“…GANs have been significantly improved in recent years and are able to synthesize high fidelity images fooling human eyes. However, there persist some problems in GANs revealing the differences between generated distributions and real ones because of commonly-used up-convolution (or deconvolution) operation [Durall et al, 2020], which maps lowresolution tensors to high-resolution ones. Yet these problems are never long-lasting compared to the steady improvement of GANs.…”
Section: Low-level Artifacts Produced By Gansmentioning
confidence: 99%
“…A CNN classifier is typically trained on the extracted features to perform binary classification for fake detection. However, recent work has shown that such low-level features are rather easy to recognize [Wang et al, 2020] and can be effectively concealed [Durall et al, 2020;Jung and Keuper, 2021]. Due to the steady improvement of generative models and the constantly narrowing gap between real and fake images, this appears not to yield in a reliable and sustainable approach to distinguish real and fake images.…”
Section: Introductionmentioning
confidence: 99%
“…Dataset. FaceForensics++ (FF++) [6] is a recently released benchmark dataset and widely used for performance evaluation of face forgery detection [12,21,22]. The dataset consists of 1,000 original videos and their manipulated counterparts created by four typical manipulation methods: Deep-Fakes (DF), Face2Face (F2F), FaceSwap (FS), NeuralTexures (NT).…”
Section: Experimental Settingmentioning
confidence: 99%