4th European Conference on Speech Communication and Technology (Eurospeech 1995) 1995
DOI: 10.21437/eurospeech.1995-363
|View full text |Cite
|
Sign up to set email alerts
|

Connected digit recognition using statistical template matching

Abstract: In this paper we describe the optimization of 'conventional' template matching techniques for connected digit recognition (TI/NIST connected digit corpus). In particular we carried out a series of experiments in which we studied various aspects of signal processing, acoustic modeling, mixture densities and linear transforms of the acoustic vector. After all optimization steps, our best string error rate on the TI/NIST connected digit corpus was 1.71% for single densities and 0.74% for mixture densities.

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2

Citation Types

0
2
0

Year Published

1999
1999
2001
2001

Publication Types

Select...
6
1

Relationship

2
5

Authors

Journals

citations
Cited by 12 publications
(2 citation statements)
references
References 4 publications
0
2
0
Order By: Relevance
“…The baseline recognizer applies ML training using the Viterbi approximation in combination with an optional LDA. A detailed description of the baseline system can be found in [11]. The word error rates obtained with the baseline system for the combined recognition of both genders are summarized in Table 2 (0 tangent vectors (tv) per mixture (mix)).…”
Section: Resultsmentioning
confidence: 99%
“…The baseline recognizer applies ML training using the Viterbi approximation in combination with an optional LDA. A detailed description of the baseline system can be found in [11]. The word error rates obtained with the baseline system for the combined recognition of both genders are summarized in Table 2 (0 tangent vectors (tv) per mixture (mix)).…”
Section: Resultsmentioning
confidence: 99%
“…The baseline recognizer applies ML training using the Viterbi approximation which serves as a starting point for the additional discriminative training. A detailed description of the baseline system could be found in [9].…”
Section: Resultsmentioning
confidence: 99%