2022
DOI: 10.3390/app12062953
|View full text |Cite
|
Sign up to set email alerts
|

DFDT: An End-to-End DeepFake Detection Framework Using Vision Transformer

Abstract: The ever-growing threat of deepfakes and large-scale societal implications has propelled the development of deepfake forensics to ascertain the trustworthiness of digital media. A common theme of existing detection methods is using Convolutional Neural Networks (CNNs) as a backbone. While CNNs have demonstrated decent performance on learning local discriminative information, they fail to learn relative spatial features and lose important information due to constrained receptive fields. Motivated by the aforeme… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

1
18
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
7
2
1

Relationship

0
10

Authors

Journals

citations
Cited by 33 publications
(19 citation statements)
references
References 96 publications
1
18
0
Order By: Relevance
“…Test metrics for our best validated model on the Celeb-DFv2 test set, compared to other works. The AUC-ROC metric of prior works were aggregated and compiled in the recent work of 34 .…”
Section: Modelmentioning
confidence: 99%
“…Test metrics for our best validated model on the Celeb-DFv2 test set, compared to other works. The AUC-ROC metric of prior works were aggregated and compiled in the recent work of 34 .…”
Section: Modelmentioning
confidence: 99%
“…This progress has encouraged researchers to develop novel deep neural network models for image forensics. Recent works based on CNNs [17][18][19][20], recurrent neural networks [4,21], and transformers [22][23][24] perform much better for Forgery detection than traditional methods.…”
Section: Rel Ated Workmentioning
confidence: 99%
“…Transformers are highly well-known deep learning models, and they are now used in the field of deepfake generation and detection. The following transformer models can be used to generate and recognize deepfakes: EfficentNet+ViT [57], M2TR [63], CViT [65], ViT [100], ViT+Distillation [64], Video Transformer [101].…”
Section: Transformer-based Modelsmentioning
confidence: 99%