2018
DOI: 10.1016/j.specom.2017.10.004
|View full text |Cite
|
Sign up to set email alerts
|

Intonation modelling using a muscle model and perceptually weighted matching pursuit

Abstract: We propose a physiologically based intonation model using perceptual relevance. Motivated by speech synthesis from a speech-to-speech translation (S2ST) point of view, we aim at a language independent way of modelling intonation. The model presented in this paper can be seen as a generalisation of the command response (CR) model, albeit with the same modelling power. It is an additive model which decomposes intonation contours into a sum of critically damped system impulse responses. To decompose the intonatio… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2

Citation Types

0
2
0

Year Published

2019
2019
2020
2020

Publication Types

Select...
1
1

Relationship

1
1

Authors

Journals

citations
Cited by 2 publications
(2 citation statements)
references
References 32 publications
0
2
0
Order By: Relevance
“…This can be explained by our second hypothesis. Physiological filters are known to have asymmetric impulse responses [36]. This is one explanation for the large improvement arising from doubling up the Uni-GRU to explicitly modelling the backward recursion.…”
Section: Phoneme Recognition Performance On Timitmentioning
confidence: 99%
“…This can be explained by our second hypothesis. Physiological filters are known to have asymmetric impulse responses [36]. This is one explanation for the large improvement arising from doubling up the Uni-GRU to explicitly modelling the backward recursion.…”
Section: Phoneme Recognition Performance On Timitmentioning
confidence: 99%
“…This can be explained by our second hypothesis. Physiological filters are known to have asymmetric impulse responses [Honnet et al, 2018]. This is one explanation for the large improvement arising from doubling up the Uni-GRU to explicitly modelling the backward recursion.…”
Section: Phoneme Recognition Performance On Timitmentioning
confidence: 99%