ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2024
DOI: 10.1109/icassp48485.2024.10445932
|View full text |Cite
|
Sign up to set email alerts
|

Audio Transformer for Synthetic Speech Detection via Formant Magnitude and Phase Analysis

Luca Cuccovillo,
Milica Gerhardt,
Patrick Aichroth

Abstract: This paper introduces a novel multi-task transformer for synthetic speech detection. The network encodes magnitude and phase of the input speech with a feature bottleneck, used to autoencode the input magnitude, to predict the trajectory of the fundamental frequency (f0), and to discern if the input speech is synthetic or natural. The approach achieves state-ofthe-art performance on the ASVspoof 2019 LA dataset while still retaining interpretability, with an AUC score of 0.910.

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...

Citation Types

0
0
0

Year Published

2024
2024
2024
2024

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
references
References 19 publications
(22 reference statements)
0
0
0
Order By: Relevance