ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2021
DOI: 10.1109/icassp39728.2021.9414943
|View full text |Cite
|
Sign up to set email alerts
|

The Huya Multi-Speaker and Multi-Style Speech Synthesis System for M2voc Challenge 2020

Abstract: Text-to-speech systems now can generate speech that is hard to distinguish from human speech. In this paper, we propose the Huya multi-speaker and multi-style speech synthesis system which is based on DurIAN and HiFi-GAN to generate high-fidelity speech even under low-resource condition. We use the fine-grained linguistic representation which leverages the similarity in pronunciation between different languages and promotes the speech quality of code-switch speech synthesis. Our TTS system uses the HiFi-GAN as… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
2

Relationship

1
1

Authors

Journals

citations
Cited by 2 publications
(1 citation statement)
references
References 10 publications
0
1
0
Order By: Relevance
“…For the joint optimization solution, the straightforward method is speaker look-up table, in which each speaker code is encoded in a trainable embedding table, jointly trained with acoustic model via backpropagation. T06 and T24 [39] have used this solution in their systems.…”
Section: Speaker and Style Modelingmentioning
confidence: 99%
“…For the joint optimization solution, the straightforward method is speaker look-up table, in which each speaker code is encoded in a trainable embedding table, jointly trained with acoustic model via backpropagation. T06 and T24 [39] have used this solution in their systems.…”
Section: Speaker and Style Modelingmentioning
confidence: 99%