2007
DOI: 10.1109/tasl.2007.894524
|View full text |Cite
|
Sign up to set email alerts
|

Template-Based Continuous Speech Recognition

Abstract: Despite their known weaknesses, hidden Markov models (HMMs) have been the dominant technique for acoustic modeling in speech recognition for over two decades. Still, the advances in the HMM framework have not solved its key problems: it discards information about time dependencies and is prone to overgeneralization. In this paper, we attempt to overcome these problems by relying on straightforward template matching. The basis for the recognizer is the well-known DTW algorithm. However, classical DTW continuous… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

2
87
1
1

Year Published

2010
2010
2021
2021

Publication Types

Select...
5
1
1

Relationship

0
7

Authors

Journals

citations
Cited by 130 publications
(91 citation statements)
references
References 25 publications
2
87
1
1
Order By: Relevance
“…(De Wachter et al, 2007)) in the scope of this paper, which would immediately enhance the recognition performance at higher SNR levels. In such a setting, the acoustic scores obtained from both streams can be combined to benefit from the noise robustness of exemplar-based acoustic modeling and better discrimination of the statistical models such as complex GMM distributions in conjunction with MFCC features or DNNs.…”
Section: General Discussion and Concluding Remarksmentioning
confidence: 97%
See 3 more Smart Citations
“…(De Wachter et al, 2007)) in the scope of this paper, which would immediately enhance the recognition performance at higher SNR levels. In such a setting, the acoustic scores obtained from both streams can be combined to benefit from the noise robustness of exemplar-based acoustic modeling and better discrimination of the statistical models such as complex GMM distributions in conjunction with MFCC features or DNNs.…”
Section: General Discussion and Concluding Remarksmentioning
confidence: 97%
“…Considering the dimensionality and computational restrictions, the same framework using exemplars associated with more general subword units such as phones or syllables could be applied to a medium or large vocabulary task. Only the current decoding scheme would need to be redesigned in a way that it will incorporate a language model combined with the acoustic costs, but for this it could largely rely on existing exemplar matching frameworks (De Wachter et al, 2007).…”
Section: General Discussion and Concluding Remarksmentioning
confidence: 99%
See 2 more Smart Citations
“…In the history of automatic speech recognition (ASR), DTW first became popular in isolated and connected word recognition and then was supplanted by hidden Markov models (HMMs), a statistical modeling framework appropriate for large vocabulary continuous speech recognition (LVCSR). However, DTW has drawn much interest recently for unsupervised and low-resource tasks, e.g., template-based speech recognition [2,3], unsupervised speech pattern discovery [4,5], example-based spoken term detection (STD) [6,7] and acousticbased spoken document segmentation [8]. Recently, Zhang et.…”
Section: Introductionmentioning
confidence: 99%