2023
DOI: 10.1109/access.2023.3276480
|View full text |Cite
|
Sign up to set email alerts
|

TIMIT-TTS: A Text-to-Speech Dataset for Multimodal Synthetic Media Detection

Abstract: With the rapid development of deep learning techniques, the generation and counterfeiting of multimedia material has become increasingly simple. Current technology enables the creation of videos where both the visual and audio contents are falsified. While the multimedia forensics community has begun to address this threat by developing fake media detectors. However, the vast majority existing forensic techniques only analyze one modality at a time. This is an important limitation when authenticating manipulat… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
4
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
5
2

Relationship

2
5

Authors

Journals

citations
Cited by 12 publications
(5 citation statements)
references
References 55 publications
0
4
0
Order By: Relevance
“…TIMIT-TTS [ 6 ]. This is a speech dataset including only fake audio samples, generated starting from the VidTIMIT corpus.…”
Section: Methodsmentioning
confidence: 99%
See 3 more Smart Citations
“…TIMIT-TTS [ 6 ]. This is a speech dataset including only fake audio samples, generated starting from the VidTIMIT corpus.…”
Section: Methodsmentioning
confidence: 99%
“…In the recent years, few multimodal datasets have been proposed, containing both counterfeited video and audio tracks. These are DFDC [ 42 ], FakeAVCeleb [ 43 ], and DeepfakeTIMIT [ 44 ] with TIMIT-TTS [ 6 ]. In the following sections, we provide further details on these datasets and test our proposed multimodal detector on them.…”
Section: Deepfake Detectionmentioning
confidence: 99%
See 2 more Smart Citations
“…On the other hand, the methods proposed in [6] and [7] perform synthetic speech detection by analyzing the emotional and prosodic content of speech. Also, numerous datasets have been presented in this field to increase the interest of the scientific community on the topic and push the research toward the development of new detection methods [8], [9].…”
Section: Introductionmentioning
confidence: 99%