2010 IEEE International Conference on Acoustics, Speech and Signal Processing 2010
DOI: 10.1109/icassp.2010.5495649
|View full text |Cite
|
Sign up to set email alerts
|

Sparse coding for speech recognition

Abstract: This paper proposes a novel feature extraction technique for speech recognition based on the principles of sparse coding. The idea is to express a spectro-temporal pattern of speech as a linear combination of an overcomplete set of basis functions such that the weights of the linear combination are sparse. These weights (features) are subsequently used for acoustic modeling. We learn a set of overcomplete basis functions (dictionary) from the training set by adopting a previously proposed algorithm which itera… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
5

Citation Types

0
48
0

Year Published

2010
2010
2023
2023

Publication Types

Select...
3
2
2

Relationship

0
7

Authors

Journals

citations
Cited by 67 publications
(49 citation statements)
references
References 12 publications
0
48
0
Order By: Relevance
“…In recent years, sparse representation (SR) based features are used for speech recognition where, given a segment of speech signal (frame) and a dictionary, a sparse feature vector is computed for the classification/recognition task [1], [2]. The SR based signal processing is supported by an observation that signal can be written as linear combination of minimum number of atoms of a dictionary [2].…”
Section: Introductionmentioning
confidence: 99%
See 3 more Smart Citations
“…In recent years, sparse representation (SR) based features are used for speech recognition where, given a segment of speech signal (frame) and a dictionary, a sparse feature vector is computed for the classification/recognition task [1], [2]. The SR based signal processing is supported by an observation that signal can be written as linear combination of minimum number of atoms of a dictionary [2].…”
Section: Introductionmentioning
confidence: 99%
“…In particular, speech recognition in exemplar based approaches is performed either using the atom activations of the estimated sparse feature vector [3], [4], or using the minimum reconstruction error [5] between the test exemplar and its estimate. On the contrary, in feature based approaches, either the derived sparse vector [1] or the estimate of speech is used as a feature [6] for acoustic modeling. For computing the sparse feature vector, approaches in [3], [4] use a single overcomplete dictionary while [5] use multiple dictionaries corresponding to different speech units.…”
Section: Introductionmentioning
confidence: 99%
See 2 more Smart Citations
“…However, compared with image processing, audio processing has payed less attention on sparse coding, which has been ever applied on speech recognition [10], speaker identification [11], speech enhancement [12] and so on. Furthermore, in [13], it proposed a novel algorithm for computing SISC (shift-invariant sparse coding) aimed to implement audio classification.…”
Section: Introductionmentioning
confidence: 99%