2007
DOI: 10.1016/j.specom.2007.01.002
|View full text |Cite
|
Sign up to set email alerts
|

Acoustic variability and automatic recognition of children’s speech

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

8
70
0
4

Year Published

2009
2009
2024
2024

Publication Types

Select...
5
2
1

Relationship

1
7

Authors

Journals

citations
Cited by 124 publications
(82 citation statements)
references
References 47 publications
8
70
0
4
Order By: Relevance
“…Also, for ASR of adults' speech, the improvements obtained are less than that for ASR of children's speech under mismatched conditions. This is because the variances for the observation densities of the phone models are greater for the poor models trained on child speech than for the models trained on adult speech [2,8]. This means that the Gaussian densities are more scattered and thus less separable in the acoustic feature space for models trained on child speech.…”
Section: C) Proposed Algorithm For Adaptive Mfcc Feature Truncation Fmentioning
confidence: 99%
See 2 more Smart Citations
“…Also, for ASR of adults' speech, the improvements obtained are less than that for ASR of children's speech under mismatched conditions. This is because the variances for the observation densities of the phone models are greater for the poor models trained on child speech than for the models trained on adult speech [2,8]. This means that the Gaussian densities are more scattered and thus less separable in the acoustic feature space for models trained on child speech.…”
Section: C) Proposed Algorithm For Adaptive Mfcc Feature Truncation Fmentioning
confidence: 99%
“…VTLN and CMLLR are the two effective techniques in the literature that are used to reduce acoustic mismatch between adult speech and child speech [8,11]. Our proposed algorithm for adaptive MFCC feature truncation also addresses acoustic mismatch and has given significant improvements in performance.…”
Section: D) Combining Proposed Algorithm With Vtln And/or Cmllrmentioning
confidence: 99%
See 1 more Smart Citation
“…In (McGowan and Nittrouer, 1988;Nittrouer and Whalen, 1989;Lee et al, 1999;Narayanan and Potamianos, 2002;Gerosa et al, 2007), it was shown that acoustic and linguistic characteristics of children's speech are widely different from those of adults. Furthermore, these studies also show that characteristics of children's speech vary rapidly as a function of age due to the anatomical and physiological changes occurring during a child's growth and because children become more skilled in coarticulation with age.…”
Section: Speech Corporamentioning
confidence: 99%
“…These research findings motivated us to collect a corpus of children conversational data. In fact, the few existing corpora of children speech turned out to be not usable in our system for none of them was in Danish and moreover consisted of either prompted speech or monologues of children recounting stories (D'Arcy et al, 2004;Eskenazi, 1996;Gerosa & Giuliani, 2004;Hagen et al, 1996). We transcribed and analyzed several hours of collected video and audio-taped conversation of young subjects involved in a series of interactive sessions in both Wizard of Oz studies and in an after-school class where they played with a real human actor impersonating Hans Christian Andersen (Figure 3, left).…”
Section: Children Spoken Language Recognition: Issuesmentioning
confidence: 99%