ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2022
DOI: 10.1109/icassp43922.2022.9746103
|View full text |Cite
|
Sign up to set email alerts
|

Neural Speech Synthesis on a Shoestring: Improving the Efficiency of Lpcnet

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
3

Relationship

0
3

Authors

Journals

citations
Cited by 3 publications
(1 citation statement)
references
References 17 publications
0
1
0
Order By: Relevance
“…LPCNet [4] successfully harmonized the linear predictive coding (LPC) and WaveRNN [5] by training the WaveRNN module to predict the excitation of the speech, i.e., the residual of the linear prediction, rather than the raw audio samples. Consequently, the complexity of an LPCNet-based decoder is as low as around 3 GFLOPS with 30 MFLOPS for encoding [6,7] or lower [8]. Their low arithmetic and spatial complexity, however, come at the cost of suboptimal sound quality compared to the WaveNet decoder.…”
Section: Introductionmentioning
confidence: 99%
“…LPCNet [4] successfully harmonized the linear predictive coding (LPC) and WaveRNN [5] by training the WaveRNN module to predict the excitation of the speech, i.e., the residual of the linear prediction, rather than the raw audio samples. Consequently, the complexity of an LPCNet-based decoder is as low as around 3 GFLOPS with 30 MFLOPS for encoding [6,7] or lower [8]. Their low arithmetic and spatial complexity, however, come at the cost of suboptimal sound quality compared to the WaveNet decoder.…”
Section: Introductionmentioning
confidence: 99%