2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03).
DOI: 10.1109/icassp.2003.1198877
|View full text |Cite
|
Sign up to set email alerts
|

High quality time-scale modification of speech using a peak alignment overlap-add algorithm (PAOLA)

Abstract: The duration of a speech passage can he altered using audio time-scale modification techniques. Time-scale modification can be achieved in the time domain by segmenting the input signal into overlapping frames and recombining the frames with an overlap differing from the analysis overlap. We present a time-scale modification algorithm that uses a simple peak alignment technique to synchronize overlapping synthesis frames. The peak alignment overlap-add (PAOLA) algorithm also takes advantage of waveform propert… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
6
0

Publication Types

Select...
3
3

Relationship

1
5

Authors

Journals

citations
Cited by 9 publications
(6 citation statements)
references
References 12 publications
0
6
0
Order By: Relevance
“…Pairs of real-speech and Jabberwocky stories were matched in terms of silence-to-signal ratio by increasing silences (the portions of signal with amplitude between 0.001 and − 0.001 of the maximum amplitude and longer than 50 ms) with the adequate time constant. The length of each story was also matched by slightly changing the sound tempo with a MATLAB (Mathworks Inc.) implementation of the VSOLA (variable parameter synchronized overlap add) algorithm 47 . The volume of acoustic stimuli was set between 45 and 50 dB following participants' preferences and in line with our previous studies 15,27,28 .…”
Section: Methodsmentioning
confidence: 99%
“…Pairs of real-speech and Jabberwocky stories were matched in terms of silence-to-signal ratio by increasing silences (the portions of signal with amplitude between 0.001 and − 0.001 of the maximum amplitude and longer than 50 ms) with the adequate time constant. The length of each story was also matched by slightly changing the sound tempo with a MATLAB (Mathworks Inc.) implementation of the VSOLA (variable parameter synchronized overlap add) algorithm 47 . The volume of acoustic stimuli was set between 45 and 50 dB following participants' preferences and in line with our previous studies 15,27,28 .…”
Section: Methodsmentioning
confidence: 99%
“…Once the peak has been determined, the lowest energy point between the two peaks is configured as the syllabic boundary (Jarman et al, 2003;Kwon & Kim, 2011;O'Haver, 2001). The time scale is modified by the Synchronized Overlap-Add Algorithm (Covell, Withgott, & Slaney, 1998;Dorran et al, 2003;Hejna & Musicus, 2003;Ninness & Henriksen, 2008).…”
Section: Methodsmentioning
confidence: 99%
“…But editing sound effects is yet another field that demands knowledge and expertise that most users do not possess. If there were intuitive tools that could allow an individual to create character motions through simple finger strokes and match sound effects to the specific situations of a scene, then content creation could become a much easier endeavor (Dorran, Lawlor, & Coyle, 2003;Gillet & Richard, 2005;Ishihara, Nakatani, Ogata, & Okuno, 2004;Ishihara et al, 2003;Jarman, Daly, Anderson, & Wahl, 2003;Kwon & Kim, 2011).…”
Section: Introductionmentioning
confidence: 98%
“…This section introduces the theoretical basis of the real-time iterative inversion (RTISI) and its implementation. In recent years, several TSM algorithms [4] have been proposed [5,6,7,8]. This paper adopts the successful RTISI algorithm [9,10], which processes according to Fig.…”
Section: Speech Time-scale Modificationmentioning
confidence: 99%