Interspeech 2019 2019
DOI: 10.21437/interspeech.2019-1847
|View full text |Cite
|
Sign up to set email alerts
|

A Frequency Normalization Technique for Kindergarten Speech Recognition Inspired by the Role of fo in Vowel Perception

Abstract: Accurate automatic speech recognition (ASR) of kindergarten speech is particularly important as this age group may benefit the most from voice-based educational tools. Due to the lack of young child speech data, kindergarten ASR systems often are trained using older child or adult speech. This study proposes a fundamental frequency (fo)-based normalization technique to reduce the spectral mismatch between kindergarten and older child speech. The technique is based on the tonotopic distances between formants an… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
11
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
4
1

Relationship

1
4

Authors

Journals

citations
Cited by 6 publications
(11 citation statements)
references
References 31 publications
0
11
0
Order By: Relevance
“…We note that the CMU Kids testing set had a narrower age range (approximately 6-9 years old excluding the two outlier children) compared to the OGI Kids' testing set (approximately 5-11 years old). The f o normalization method has been shown to produce larger improvements when the range of ages used in training and testing data is wider [15]. A similar phenomenon may be occurring for f o perturbation.…”
Section: Resultsmentioning
confidence: 78%
See 4 more Smart Citations
“…We note that the CMU Kids testing set had a narrower age range (approximately 6-9 years old excluding the two outlier children) compared to the OGI Kids' testing set (approximately 5-11 years old). The f o normalization method has been shown to produce larger improvements when the range of ages used in training and testing data is wider [15]. A similar phenomenon may be occurring for f o perturbation.…”
Section: Resultsmentioning
confidence: 78%
“…The remainder of the paper is organized as follows. Section 2 reviews the f o normalization technique proposed in [15] and formulates the data augmentation technique. Section 3 describes the databases and experimental setup.…”
Section: Introductionmentioning
confidence: 99%
See 3 more Smart Citations