Spectral Dynamics as a Source of Discontinuity in Concatenative Speech Synthesis

Kirkpatrick, Barry; O’Brien, Darragh; Scaife, Ronarn; Errity, Andrew

doi:10.1109/icdsp.2007.4288657

Cited by 2 publications

(2 citation statements)

References 16 publications

Supporting

Mentioning

Contrasting

Unclassified

Order By: Relevance

“…Audible spectral discontinuities in concatenated signals were researched in [4]. Signal components can change in a number of ways at the join; an abrupt termination of signal components, an abrupt onset of signal components and more subtle changes in signal components sustained across the join [5]. The synthesised speech can sound very natural if the discontinuities at the concatenation points are inaudible.…”

Section: Spectral Discontinuity In Concatenated Speechmentioning

confidence: 99%

Removal of Spectral Discontinuity in Concatenated Speech Waveform

Singh¹,

Singh²

2012

IJCA

View full text Add to dashboard Cite

Speech synthesis systems which involve concatenation of recorded speech units are currently very popular. These systems are known for producing high quality, naturalsounding speech as they generate speech by joining together waveforms of different speech units. This method of speech generation is quite practical. However the speech units that are being concatenated may have different spectra on either side of the concatenation points. Such mismatches are spectral in nature and give rise to spectral discontinuity in concatenated speech waveforms. The presence of such discontinuities can be very distracting to the listener and degrade the overall quality of output speech. This paper proposes a speech signal processing technique that deals with the problem of spectral discontinuity in the context of concatenated waveform synthesis. It involves the post-processing of the synthesized speech waveform in time domain. This technique is implemented on different single channel Punjabi wave audio files which were created by concatenating different Punjabi syllables. A listening test was conducted to evaluate the proposed technique, and it was observed that the spectral discontinuity is reduced to a large extent and the output speech sounds more natural with the reduction of audible noise.

show abstract

Section: Spectral Discontinuity In Concatenated Speechmentioning

confidence: 99%

Removal of Spectral Discontinuity in Concatenated Speech Waveform

Singh¹,

Singh²

2012

IJCA

View full text Add to dashboard Cite

show abstract

“…Οι παράμετροι που αποτελούν το διάνυσμα χαρακτηριστικών είναι ζήτημα του σχεδιαστή. Τα πιο συχνά χρησιμοποιούμενα χαρακτηριστικά είναι η θεμελιώδης συχνότητα (pitch) και κάποια φασματική αναπαράσταση ενώ πολλές φορές συμπεριλαμβάνονται μεγέθη όπως η ένταση (intensity) και η διάρκεια καθώς και οι πρώτες και δεύτερες παράγωγοι όλων των προηγούμενων μεγεθών [Hunt, 1996;Fraser, 2007;Plumpe, 1998;Blouin, 2002;Hirchfeld, 2000;Kirkpatrick, 2007]. Κάθε χαρακτηριστικό συνοδεύεται και από μια μετρική η οποία παίζει το ρόλο τους επιμέρους κόστους.…”

Section: η συνάρτηση κόστους ένωσης (Concatenation ή Join Cost)unclassified

Βελτίωση Της Ποιότητας Συνθετικής Φωνής Και Εφαρμογή Σε Σύγχρονα Τηλεπικοινωνιακά Περιβάλλοντα Και Υπηρεσίες

Karabetsos¹,

Καραμπέτσος²

View full text Add to dashboard Cite

show abstract

Spectral Dynamics as a Source of Discontinuity in Concatenative Speech Synthesis

Cited by 2 publications

References 16 publications

Removal of Spectral Discontinuity in Concatenated Speech Waveform

Removal of Spectral Discontinuity in Concatenated Speech Waveform

Βελτίωση Της Ποιότητας Συνθετικής Φωνής Και Εφαρμογή Σε Σύγχρονα Τηλεπικοινωνιακά Περιβάλλοντα Και Υπηρεσίες

Contact Info

Product

Resources

About