Proceedings of the 29th ACM International Conference on Multimedia 2021
DOI: 10.1145/3474085.3475332
|View full text |Cite
|
Sign up to set email alerts
|

Video Transformer for Deepfake Detection with Incremental Learning

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
15
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
5
2
1

Relationship

1
7

Authors

Journals

citations
Cited by 42 publications
(15 citation statements)
references
References 18 publications
0
15
0
Order By: Relevance
“…The community lacks an end-to-end vision-transformer framework designed explicitly for deepfake detection tasks. For example, Khan and Dai [46] presented a video transformer with an incremental learning approach for deepfake detection. Their design benefits from XceptionNet [47] as a backbone for image feature extraction and 12 transformer blocks for feature learning.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…The community lacks an end-to-end vision-transformer framework designed explicitly for deepfake detection tasks. For example, Khan and Dai [46] presented a video transformer with an incremental learning approach for deepfake detection. Their design benefits from XceptionNet [47] as a backbone for image feature extraction and 12 transformer blocks for feature learning.…”
Section: Related Workmentioning
confidence: 99%
“…This figure also reveals another critical point: although most deepfake detection approaches perform well on relatively more straightforward datasets, i.e., FaceForensics++, their performance is still far from perfect on more challenging and real-world datasets, i.e., WildDeepfake. [46] 99.64 F3-Net [73] 65.1 ADD-Xception [18] 79.23 RNN [74] 83.10 PPA [75] 83.1 DefakeHop [6] 90.5 FakeCatcher [15] 91.5 ATS-DE [7] 97.8 ADD-ResNet [18] 98. MesoNet-4 [34] ADDNet-3D [75] MesoNet-inception [34] XceptionNet [65] ADDNet-2D [75] ADD-Xception [55] DFDT (Ours) ACC (%)…”
Section: Intra-dataset Evaluationmentioning
confidence: 99%
“…Deepfake detection is a challenging task and a lot of studies have been proposed in the past to detect deepfake media employing diverse set of features for training deep learning models. Some examples include, biological signals, behavioural features, 3D face decomposition features, and optical flow [117], [118], [119], [120], [121], [122], [123], [109], birds [110], buses [109], indoor and outdoor scenes [109].…”
Section: Deepfake Detectionmentioning
confidence: 99%
“…Several works leveraging transformers to boost face forgery detection have been done: Miao et al [18] extend transformer using bag-of-feature to learn local forgery features. Khan et al [8] propose a video transformer to extract spatial features with the temporal information for detecting forgery. Zheng et al [35] design a light-weight temporal transformer after their proposed fully temporal convolution network to explore the temporal coherence for general manipulated video detection.…”
Section: Transformermentioning
confidence: 99%