ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2021
DOI: 10.1109/icassp39728.2021.9414225
|View full text |Cite
|
Sign up to set email alerts
|

Attentive Semantic Exploring for Manipulated Face Detection

Abstract: Face manipulation methods develop rapidly in recent years, whose potential risk to society accounts for the emerging of researches on detection methods. However, due to the diversity of manipulation methods and the high quality of fake images, detection methods suffer from a lack of generalization ability. To solve the problem, we find that segmenting images into semantic fragments could be effective, as discriminative defects and distortions are closely related to such fragments. Besides, to highlight discrim… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
11
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
6
3

Relationship

0
9

Authors

Journals

citations
Cited by 19 publications
(11 citation statements)
references
References 26 publications
0
11
0
Order By: Relevance
“…Future work will try to further improve the generalization and robustness capability of the detector by performing fusion with other detectors resorting to semantic analysis, that is, looking at other semantic facial attributes [e.g., the mouth (Suwajanakorn et al, 2017;Haliassos et al, 2021), or the nose (Chen and Yang, 2021)]. The existence of inconsistencies in symmetries that might come from other facial attributes, e.g., eyebrows or mouths shapes, is also worth investigation.…”
Section: Discussionmentioning
confidence: 99%
See 1 more Smart Citation
“…Future work will try to further improve the generalization and robustness capability of the detector by performing fusion with other detectors resorting to semantic analysis, that is, looking at other semantic facial attributes [e.g., the mouth (Suwajanakorn et al, 2017;Haliassos et al, 2021), or the nose (Chen and Yang, 2021)]. The existence of inconsistencies in symmetries that might come from other facial attributes, e.g., eyebrows or mouths shapes, is also worth investigation.…”
Section: Discussionmentioning
confidence: 99%
“…In Liu et al (2020), the authors presented a new CNN-based detector, called Gram-Net, that leverages global image texture representations to improve the generalization and the robustness of GAN image detection. An approach that relies on semantic segmentation and perform detection based on multiple semantic fragments getting remarkable generalization capability has been recently presented in Chen and Yang (2021). A different solution to improve generalization is proposed in Xuan et al (2019): the idea is to carry out augmentation by Gaussian blurring so as to force the discriminator to learn more general features.…”
Section: Gan Face Detectionmentioning
confidence: 99%
“…Multi-task learning could also be used for classifying and locating the manipulated facial images (Nguyen et al, 2019a). Formulating fake forensics as a segmentation task to localize the manipulated region in synthesized faces is another interesting idea in fighting DeepFakes (Li et al, 2019a;Chen and Yang, 2020). Combing deep learning and co-occurrence matrices could also be used for the detection, attribution, and localization of GAN images (Goebel et al, 2020).…”
Section: Detection and Localizationmentioning
confidence: 99%
“…• LAE [39] utilizes an encoder-decoder network for simultaneously predicting real and fake labels as well as locating the manipulated regions. We do not compare our work with Face X-ray [41] since it needs external data collection and annotation for training their network; we do compare our work with [22], [60] since they conduct additional inference task other than real/fake prediction and manipulation mask segmentation, and therefore bring additional computation cost and increase network complexity. For example, [22] brings a facial expression recognition module into their architecture to recognize facial expressions, [60] conducts a multilevel facial semantic segmentation in their architecture.…”
Section: B Implementation Details and Evaluation Settingsmentioning
confidence: 99%