2022
DOI: 10.48550/arxiv.2212.06972
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Disentangling Prosody Representations with Unsupervised Speech Reconstruction

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2023
2023
2023
2023

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(2 citation statements)
references
References 0 publications
0
2
0
Order By: Relevance
“…6 in Appendix A.5). In addition, two concurrent methods for SSC (Qu et al, 2022; were proposed recently. Although showing impressive results, they are based on spectrogram representations, hence require an additional vocoding step.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…6 in Appendix A.5). In addition, two concurrent methods for SSC (Qu et al, 2022; were proposed recently. Although showing impressive results, they are based on spectrogram representations, hence require an additional vocoding step.…”
Section: Related Workmentioning
confidence: 99%
“…However, this limits the application to high resource languages and requires large scale data labelling. Some recent prosody aware VC methods use spectrogram representations as input and output (Qu et al, 2022;, rather than operating in the waveform domain. Thus, they involve another phase of converting from the spectral domain to the time domain using a vocoder.…”
Section: Introductionmentioning
confidence: 99%