2019
DOI: 10.1016/j.csl.2019.03.005
|View full text |Cite
|
Sign up to set email alerts
|

Vocal Tract Length Normalization using a Gaussian mixture model framework for query-by-example spoken term detection

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
4
1
1

Relationship

0
6

Authors

Journals

citations
Cited by 7 publications
(1 citation statement)
references
References 61 publications
0
1
0
Order By: Relevance
“…A classic approach to tackle the problem of speaker variablity in Automatic Speech Recognition is Vocal Tract Length Normalization (VTLN) [9][10][11], which has also been applied in lowresource settings, e.g., for spoken term detection [12]. Furthermore, Feature Space Maximum Likelihood Linear Regression (fMLLR) is often used, even in low-resource setups [13].…”
Section: Related Workmentioning
confidence: 99%
“…A classic approach to tackle the problem of speaker variablity in Automatic Speech Recognition is Vocal Tract Length Normalization (VTLN) [9][10][11], which has also been applied in lowresource settings, e.g., for spoken term detection [12]. Furthermore, Feature Space Maximum Likelihood Linear Regression (fMLLR) is often used, even in low-resource setups [13].…”
Section: Related Workmentioning
confidence: 99%