2020
DOI: 10.48550/arxiv.2002.10708
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Controllable Sequence-To-Sequence Neural TTS with LPCNET Backend for Real-time Speech Synthesis on CPU

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
2
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
1
1

Relationship

0
2

Authors

Journals

citations
Cited by 2 publications
(2 citation statements)
references
References 0 publications
0
2
0
Order By: Relevance
“…When precise duration labels are unavailable or sparse in the training data, the proposed model can be trained explicitly with duration labels or unsupervised or semi-supervised using a fine-grained variational auto-encoder . When trained and wholly supervised, the proposed model slightly beats Tacotron 2 on naturalness [13]. The suggested model outperforms Tacotron 2 on naturalness with unsupervised or semi-supervised duration modelling while still much more resilient on over-generation and equivalent on under-generation.…”
mentioning
confidence: 85%
“…When precise duration labels are unavailable or sparse in the training data, the proposed model can be trained explicitly with duration labels or unsupervised or semi-supervised using a fine-grained variational auto-encoder . When trained and wholly supervised, the proposed model slightly beats Tacotron 2 on naturalness [13]. The suggested model outperforms Tacotron 2 on naturalness with unsupervised or semi-supervised duration modelling while still much more resilient on over-generation and equivalent on under-generation.…”
mentioning
confidence: 85%
“…Those that require control use additional inputs to the networks as in Fig. 1a [4,5,2,6,7]. Those that offer control (but do not require it) aim at controlling latent variables while ensuring that they are interpretable [8,9,10,11], or disentangle one interpretable control variable (often the duration of the utterance) and make it controllable as in Fig.…”
Section: Related Workmentioning
confidence: 99%