2014
DOI: 10.1371/journal.pone.0085458
|View full text |Cite
|
Sign up to set email alerts
|

One-against-All Weighted Dynamic Time Warping for Language-Independent and Speaker-Dependent Speech Recognition in Adverse Conditions

Abstract: Considering personal privacy and difficulty of obtaining training material for many seldom used English words and (often non-English) names, language-independent (LI) with lightweight speaker-dependent (SD) automatic speech recognition (ASR) is a promising option to solve the problem. The dynamic time warping (DTW) algorithm is the state-of-the-art algorithm for small foot-print SD ASR applications with limited storage space and small vocabulary, such as voice dialing on mobile devices, menu-driven recognition… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
10
0

Year Published

2014
2014
2023
2023

Publication Types

Select...
4
2
1

Relationship

0
7

Authors

Journals

citations
Cited by 15 publications
(11 citation statements)
references
References 20 publications
0
10
0
Order By: Relevance
“…Then, the pre-emphasized speech signal is divided into short time segments called "frames" [5]. The length of the overlapping part between adjacent frames, as shown in Fig.…”
Section: B Frame Blockingmentioning
confidence: 99%
See 2 more Smart Citations
“…Then, the pre-emphasized speech signal is divided into short time segments called "frames" [5]. The length of the overlapping part between adjacent frames, as shown in Fig.…”
Section: B Frame Blockingmentioning
confidence: 99%
“…ICA), Linear Predictive Coding, Cepstral Analysis, Filter bank analysis, kernel-based methods [2] are the common feature extraction methods and have been widely employed in speech recognition research field. Among these methods, the MFCC method has been widely used and has achieved the high recognition accuracy in speech recognition system [3,4,5]. There are many works studying MFCC, especially in improving the recognition accuracy [3].…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…In an effort to improve performance, several variations of DTW have been proposed since its inception. For example, a one-against-all index (OAI) for each time series under consideration is proposed in [4]. The OAI is subsequently used to weight the corresponding DTW alignment score in a speech recognition system.…”
Section: Introductionmentioning
confidence: 99%
“…DTW has proven to be very effective in speech recognition under source and ambient variations (Zhang et al (2014)) and ECG profile characterization (Huang and Kinsner (2002)), which are similar to the ENS EAP analysis at hand. We established a fastDTW method with automatic thresholding for ENS recordings with large spike variability for real-time applications.…”
Section: Introductionmentioning
confidence: 99%