2023
DOI: 10.1007/978-3-031-37742-6_21
|View full text |Cite
|
Sign up to set email alerts
|

Combining Automatic Speaker Verification and Prosody Analysis for Synthetic Speech Detection

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
0
0

Year Published

2023
2023
2023
2023

Publication Types

Select...
3
2

Relationship

1
4

Authors

Journals

citations
Cited by 8 publications
(2 citation statements)
references
References 38 publications
0
0
0
Order By: Relevance
“…The authors of [42] exploit the lack of emotional content in synthetic voices generated via TTS techniques to recognize them. Finally, in [43] ASV and prosody features are combined to perform synthetic speech detection.…”
Section: Speech Deepfake Detection Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…The authors of [42] exploit the lack of emotional content in synthetic voices generated via TTS techniques to recognize them. Finally, in [43] ASV and prosody features are combined to perform synthetic speech detection.…”
Section: Speech Deepfake Detection Methodsmentioning
confidence: 99%
“…Although silence is a fundamental component of speech, this is often overlooked in data generation, leading to biased tracks that are easy to discriminate [67]. This is a common problem, especially when dealing with TTS algorithms, where the prosodic component is less present [43] and the duration of the silences is shorter. Table 3 shows the silence durations of our tracks for both the original and the DTW cases.…”
Section: A Timit-tts Statisticsmentioning
confidence: 99%