2023
DOI: 10.1186/s13636-023-00312-8
|View full text |Cite
|
Sign up to set email alerts
|

W2VC: WavLM representation based one-shot voice conversion with gradient reversal distillation and CTC supervision

Hao Huang,
Lin Wang,
Jichen Yang
et al.

Abstract: Non-parallel data voice conversion (VC) has achieved considerable breakthroughs due to self-supervised pre-trained representation (SSPR) being used in recent years. Features extracted by the pre-trained model are expected to contain more content information. However, in common VC with SSPR, there is no special implementation to remove speaker information in the content representation extraction by SSPR, which prevents further purification of the speaker information from SSPR representation. Moreover, in conven… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...

Citation Types

0
0
0

Publication Types

Select...

Relationship

0
0

Authors

Journals

citations
Cited by 0 publications
references
References 34 publications
0
0
0
Order By: Relevance

No citations

Set email alert for when this publication receives citations?