2021
DOI: 10.48550/arxiv.2102.09914
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Alternate Endings: Improving Prosody for Incremental Neural TTS with Predicted Future Text Input

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2021
2021
2021
2021

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(1 citation statement)
references
References 0 publications
0
1
0
Order By: Relevance
“…Streaming TTS [74,217,316,399,317,231] synthesizes speech once some input tokens are comming, without waiting for the whole input sentence, which can also speed up inference. FFTNet [142] uses a simple architecture to mimic the Fast Fourier Transform (FFT), which can generate audio samples in real-time.…”
Section: Adaptivementioning
confidence: 99%
“…Streaming TTS [74,217,316,399,317,231] synthesizes speech once some input tokens are comming, without waiting for the whole input sentence, which can also speed up inference. FFTNet [142] uses a simple architecture to mimic the Fast Fourier Transform (FFT), which can generate audio samples in real-time.…”
Section: Adaptivementioning
confidence: 99%