Proceedings 3rd IEEE International Conference on Advanced Technologies
DOI: 10.1109/icalt.2003.1215049
|View full text |Cite
|
Sign up to set email alerts
|

Speech-adaptive time-scale modification for computer assisted language-learning

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
6
0

Publication Types

Select...
5
1

Relationship

0
6

Authors

Journals

citations
Cited by 9 publications
(7 citation statements)
references
References 7 publications
0
6
0
Order By: Relevance
“…1 Corresponding author 2 time-scaled by applying different scaling factors to different speech segments, depending on the broad phonetic characteristics, without reducing its quality and naturalness (Donnellan et al, 2003). It was concluded in Kuwabara and Nakamura (2000) that voiced frames need to be more affected by time-scaling than mixed frames, and much more than unvoiced frames (Campbell and Isard, 1991).…”
Section: Broad Phonetic Classification Using Discriminative Bayesian mentioning
confidence: 99%
See 1 more Smart Citation
“…1 Corresponding author 2 time-scaled by applying different scaling factors to different speech segments, depending on the broad phonetic characteristics, without reducing its quality and naturalness (Donnellan et al, 2003). It was concluded in Kuwabara and Nakamura (2000) that voiced frames need to be more affected by time-scaling than mixed frames, and much more than unvoiced frames (Campbell and Isard, 1991).…”
Section: Broad Phonetic Classification Using Discriminative Bayesian mentioning
confidence: 99%
“…To maintain the characteristics of plosives or parts of plosives (a closure or release), time-scale modification should not be so applied. Silence frames, moreover, should be treated like voiced frames (Donnellan et al, 2003).…”
Section: Broad Phonetic Classification Using Discriminative Bayesian mentioning
confidence: 99%
“…Time stretching corresponds to the extension of the signal, but this term is used as a synonym for TSM. Audio time stretching has numerous applications, such as fast browsing of speech recordings [4], music production [5], foreign language and music learning [6], fitting of a piece of music to a prescribed time slot [7], and slowing down the soundtrack for slow-motion video [8]. Additionally, TSM is often used as a processing step in pitch shifting, which aims at changing the frequencies in the signal without changing its duration [2,3,7,9,10].…”
Section: Introductionmentioning
confidence: 99%
“…Attempts to incorporate nonuniform duration modification are reported in the literature [4], [10], [11].…”
mentioning
confidence: 99%
“…Malcolm Slaney et al have used nonuniform time scaling along with spectral shape and pitch modification for automatically morphing one sound to another sound [10]. Olovia Donnellan et al have proposed a method for speech adaptive TSM, which allows slowing down speech without compromising the quality or naturalness of the slowed speech [11]. In their method, different scaling factors are applied to different types of speech segments.…”
mentioning
confidence: 99%