2021
DOI: 10.48550/arxiv.2104.09995
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Review of end-to-end speech synthesis technology based on deep learning

Abstract: As an indispensable part of modern humancomputer interaction system, speech synthesis technology helps users get the output of intelligent machine more easily and intuitively, thus has attracted more and more attention. Due to the limitations of high complexity and low efficiency of traditional speech synthesis technology, the current research focus is the deep learning-based end-to-end speech synthesis technology, which has more powerful modeling ability and a simpler pipeline. It mainly consists of three mod… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
6
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
4
3

Relationship

0
7

Authors

Journals

citations
Cited by 8 publications
(6 citation statements)
references
References 179 publications
(199 reference statements)
0
6
0
Order By: Relevance
“…GANs are revolutionizing music creation by tapping into existing compositions' patterns and structures [71]. This technology fosters original music composition and assists musicians in their creative journey.…”
Section: Music Generationmentioning
confidence: 99%
“…GANs are revolutionizing music creation by tapping into existing compositions' patterns and structures [71]. This technology fosters original music composition and assists musicians in their creative journey.…”
Section: Music Generationmentioning
confidence: 99%
“…Harshvardhan et al [76] instead covered deep generation as part of generation in machine learning and proposed future directions. In addition to surveys on general deep data generation, other surveys may focus on the deep data generation in specific domains including graph generation [77][78][79], image synthesis [80,81], text generation [82,83] and audio generation [84][85][86].…”
Section: Relationship With Existing Surveysmentioning
confidence: 99%
“…The widespread use of deep learning has significantly advanced the development of speech synthesis technology [1]. This innovation not only enables artificial intelligence technology to expand its application scope to encompass more audio synthesis scenarios and enhance natural language interaction experience through a more authentic and credible audio output, but also gives rise to numerous acclaimed applications on public platforms.…”
Section: Introductionmentioning
confidence: 99%