2010
DOI: 10.1109/lsp.2010.2079930
|View full text |Cite
|
Sign up to set email alerts
|

Data-Driven and Feedback Based Spectro-Temporal Features for Speech Recognition

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
7
0

Year Published

2012
2012
2022
2022

Publication Types

Select...
7
1

Relationship

1
7

Authors

Journals

citations
Cited by 14 publications
(7 citation statements)
references
References 16 publications
0
7
0
Order By: Relevance
“…This joint spectrotemporal analysis effectively performs a double wavelet transform of the auditory spectrogram revealing and capturing efficiently the distinctive patterns of modulations of speech, music, and other complex environmental sounds. For example, “cortical features” have been shown to be effective in numerous applications ranging from speech intelligibility assessment [26], to hearing impairments and hearing aids, to the detection of speech, speaker identification, speech recognition [27,28], and musical timbre classification [29]. Most enlightening, however, have been the mathematical analyses of closely related constructs employing 'cascaded wavelet transforms' that have explained the reasons underlying the effectiveness of these cortical features [30].…”
Section: Primary Auditory Cortical Processing and Representationsmentioning
confidence: 99%
“…This joint spectrotemporal analysis effectively performs a double wavelet transform of the auditory spectrogram revealing and capturing efficiently the distinctive patterns of modulations of speech, music, and other complex environmental sounds. For example, “cortical features” have been shown to be effective in numerous applications ranging from speech intelligibility assessment [26], to hearing impairments and hearing aids, to the detection of speech, speaker identification, speech recognition [27,28], and musical timbre classification [29]. Most enlightening, however, have been the mathematical analyses of closely related constructs employing 'cascaded wavelet transforms' that have explained the reasons underlying the effectiveness of these cortical features [30].…”
Section: Primary Auditory Cortical Processing and Representationsmentioning
confidence: 99%
“…In most of the research on speech processing, the Fourier transform and its variants are used to convert the speech from the time domain into the frequency domain. With this approach, some well known speech features were introduced such as spectral, cepstral [2] [3] [4], Mel cepstral [5] [6], MFCC [7], [8]. Because the human brain understands the properties of speech in the frequency domain, these features are widely used and gain many possible results.…”
Section: Introductionmentioning
confidence: 99%
“…Consequently, some of the research works have considered one of the South Asian languages like Marathi. However, there is no evidence for o ering e ective solutions while recognizing the Marathi language [3]. Moreover, the Marathi language model suffers from inadequate SC and small size vocabulary systems [4].…”
Section: Introductionmentioning
confidence: 99%