The Speaker and Language Recognition Workshop (Odyssey 2018) 2018
DOI: 10.21437/odyssey.2018-36
|View full text |Cite
|
Sign up to set email alerts
|

Convolutional Neural Network Based Speaker De-Identification

Abstract: Concealing speaker identity in speech signals refers to the task of speaker de-identification, which helps protect the privacy of a speaker. Although, both linguistic and paralinguistic features reveal personal information of a speaker and they both need to be addressed, in this study we only focus on speaker voice characteristics. In other words, our goal is to move away from the source speaker identity while preserving naturalness and quality. The proposed speaker de-identification system maps voice of a giv… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
29
0

Year Published

2019
2019
2022
2022

Publication Types

Select...
5
3
1

Relationship

0
9

Authors

Journals

citations
Cited by 33 publications
(29 citation statements)
references
References 23 publications
0
29
0
Order By: Relevance
“…The anonymization of speaker's identity can be performed with various methods [1,3,7,12,15]. In this article, our contributions are based on the Baseline-1 (referred to as the baseline in this article) of the VoicePrivacy challenge that anonymizes speech using x-vectors and neural waveform models [3].…”
Section: Anonymization Techniquementioning
confidence: 99%
“…The anonymization of speaker's identity can be performed with various methods [1,3,7,12,15]. In this article, our contributions are based on the Baseline-1 (referred to as the baseline in this article) of the VoicePrivacy challenge that anonymizes speech using x-vectors and neural waveform models [3].…”
Section: Anonymization Techniquementioning
confidence: 99%
“…For speech, several classic approaches for speaker anonymization are explored in [31]- [34], while a neural-network-based encoder-decoder approach has been proposed in [35].…”
Section: Cyber Worldmentioning
confidence: 99%
“…For logical anonymization, Jin et al [11] presented a voice transformation (VT) system to change the speaker identity into another special speaker. Similarly, Bahmaninezhad et al [12] utilized a convolutional neural network (CNN) as a VT function and averaged different transformation results as a means to anonymize speech. Magarienos et al [13] and Pobar and Ipšić [14] improved the convenience of the VT-based method to enable the user to select an approximate transformation from a pool of pre-trained VT models for speaker anonymization.…”
Section: Related Workmentioning
confidence: 99%