2007 15th International Conference on Digital Signal Processing 2007
DOI: 10.1109/icdsp.2007.4288657
|View full text |Cite
|
Sign up to set email alerts
|

Spectral Dynamics as a Source of Discontinuity in Concatenative Speech Synthesis

Abstract: The quality of concatenative speech synthesis depends on the cost function employed for unit selection. Effective cost functions for spectral continuity have proven difficult to define and standard measures do not accurately reflect human perception of spectral discontinuity in concatenated speech. Previous studies on spectral join costs have focused predominantly on static spectral measures extracted from the unit boundary. In this paper spectral dynamic behaviour is investigated as a source of discontinuity … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
1
0
1

Year Published

2012
2012
2012
2012

Publication Types

Select...
1
1

Relationship

0
2

Authors

Journals

citations
Cited by 2 publications
(2 citation statements)
references
References 16 publications
0
1
0
1
Order By: Relevance
“…Audible spectral discontinuities in concatenated signals were researched in [4]. Signal components can change in a number of ways at the join; an abrupt termination of signal components, an abrupt onset of signal components and more subtle changes in signal components sustained across the join [5]. The synthesised speech can sound very natural if the discontinuities at the concatenation points are inaudible.…”
Section: Spectral Discontinuity In Concatenated Speechmentioning
confidence: 99%
“…Audible spectral discontinuities in concatenated signals were researched in [4]. Signal components can change in a number of ways at the join; an abrupt termination of signal components, an abrupt onset of signal components and more subtle changes in signal components sustained across the join [5]. The synthesised speech can sound very natural if the discontinuities at the concatenation points are inaudible.…”
Section: Spectral Discontinuity In Concatenated Speechmentioning
confidence: 99%
“…Οι παράμετροι που αποτελούν το διάνυσμα χαρακτηριστικών είναι ζήτημα του σχεδιαστή. Τα πιο συχνά χρησιμοποιούμενα χαρακτηριστικά είναι η θεμελιώδης συχνότητα (pitch) και κάποια φασματική αναπαράσταση ενώ πολλές φορές συμπεριλαμβάνονται μεγέθη όπως η ένταση (intensity) και η διάρκεια καθώς και οι πρώτες και δεύτερες παράγωγοι όλων των προηγούμενων μεγεθών [Hunt, 1996;Fraser, 2007;Plumpe, 1998;Blouin, 2002;Hirchfeld, 2000;Kirkpatrick, 2007]. Κάθε χαρακτηριστικό συνοδεύεται και από μια μετρική η οποία παίζει το ρόλο τους επιμέρους κόστους.…”
Section: η συνάρτηση κόστους ένωσης (Concatenation ή Join Cost)unclassified