2013 IEEE International Conference on Acoustics, Speech and Signal Processing 2013
DOI: 10.1109/icassp.2013.6639236
|View full text |Cite
|
Sign up to set email alerts
|

Confidence index dynamic time warping for language-independent embedded speech recognition

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1

Citation Types

0
4
0

Year Published

2014
2014
2016
2016

Publication Types

Select...
3
2
1

Relationship

1
5

Authors

Journals

citations
Cited by 6 publications
(4 citation statements)
references
References 15 publications
0
4
0
Order By: Relevance
“…Under quiet environment and bad recording condition (Speaker ), OAWDTW improves the accuracy by about 0.5% and acquires a 9.09% RRER. As DTW already does almost perfect speech recognition under quiet environment [27] [33] – [34] , it is likely that we will not get any improvement by using OAWDTW. Thus it is encouraging that our OAWDTW achieves a little better recognition accuracy for bad recording condition.…”
Section: Resultsmentioning
confidence: 99%
See 1 more Smart Citation
“…Under quiet environment and bad recording condition (Speaker ), OAWDTW improves the accuracy by about 0.5% and acquires a 9.09% RRER. As DTW already does almost perfect speech recognition under quiet environment [27] [33] – [34] , it is likely that we will not get any improvement by using OAWDTW. Thus it is encouraging that our OAWDTW achieves a little better recognition accuracy for bad recording condition.…”
Section: Resultsmentioning
confidence: 99%
“…It is noted that these weighted DTW do not decrease time complexities. We have developed confidence index dynamic time warping (CIDTW) [27] and merge-weighted dynamic time warping (MWDTW) [28] methods of fast and accurate speech recognition for clean speech data. Both methods involve a merging step that merges adjacent similar time frames in one speech signal and then performs DTW on merged speech data.…”
Section: Introductionmentioning
confidence: 99%
“…The problems with the original DTW formulation are its high computational burden, the low performance in speaker independent tasks and the discrimination between in-domain and out-ofdomain sentences. In the literature, particular attention has been devoted to develop efficient versions of DTW for devices with limited computational resources [5][6][7].…”
Section: Introductionmentioning
confidence: 99%
“…That is, speeding up DTW process at the expense of accuracy. On the other hand, considering that DTW gives each time frame equal weight to align two time series, the authors of [13] and [14] introduced two weighted DTW methods to avoid potential misclassification caused by equal weight. It is noted that these weighted DTW do not decrease time complexities.…”
Section: Introductionmentioning
confidence: 99%