10th ISCA Workshop on Speech Synthesis (SSW 10) 2019
DOI: 10.21437/ssw.2019-26
|View full text |Cite
|
Sign up to set email alerts
|

Loss Function Considering Temporal Sequence for Feed-Forward Neural Network–Fundamental Frequency Case

Abstract: This paper describes a novel loss function for training feedforward neural networks (FFNNs), which can generate smooth speech parameter sequences without post-processing. In statistical parametric speech synthesis based on deep neural networks (DNNs), maximum likelihood parameter generation (MLPG) or recurrent neural networks (RNNs) are generally used to generate smooth speech parameter sequences. However, because the MLPG process requires utterance-level processing, it is not suitable for speech synthesis req… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2020
2020
2021
2021

Publication Types

Select...
2

Relationship

1
1

Authors

Journals

citations
Cited by 2 publications
(1 citation statement)
references
References 7 publications
0
1
0
Order By: Relevance
“…In this paper, we propose a new training method that enables FFNNs to acquire parameters captured with a natural temporal structure for these sequences by Copyright c 2020 The Institute of Electronics, Information and Communication Engineers backpropagating the errors of multiple attributes from the temporal sequence through the loss function. As a preliminary study, the effectiveness of the proposed loss function in modeling the F0 sequence was investigated [11]. In this paper, the effectiveness of the proposed loss function in modeling the spectral feature sequence in addition to the F0 sequence is discussed.…”
Section: Introductionmentioning
confidence: 99%
“…In this paper, we propose a new training method that enables FFNNs to acquire parameters captured with a natural temporal structure for these sequences by Copyright c 2020 The Institute of Electronics, Information and Communication Engineers backpropagating the errors of multiple attributes from the temporal sequence through the loss function. As a preliminary study, the effectiveness of the proposed loss function in modeling the F0 sequence was investigated [11]. In this paper, the effectiveness of the proposed loss function in modeling the spectral feature sequence in addition to the F0 sequence is discussed.…”
Section: Introductionmentioning
confidence: 99%