1999
DOI: 10.1109/97.789604
|View full text |Cite
|
Sign up to set email alerts
|

Teager energy based feature parameters for speech recognition in car noise

Abstract: In this letter, a new set of speech feature parameters based on multirate signal processing and the Teager energy operator is introduced. The speech signal is first divided into nonuniform subbands in mel-scale using a multirate filterbank, then the Teager energies of the subsignals are estimated. Finally, the feature vector is constructed by log-compression and inverse discrete cosine transform (DCT) computation. The new feature parameters have robust speech recognition performance in the presence of car engi… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
39
0
4

Year Published

2004
2004
2016
2016

Publication Types

Select...
5
4

Relationship

0
9

Authors

Journals

citations
Cited by 96 publications
(43 citation statements)
references
References 11 publications
0
39
0
4
Order By: Relevance
“…Different methods are used to analyze the audio signal in order to extract the features and use them in the fall detection approach. Such methods are analysis of the audio signal based on wavelet domain feature extraction or analysis based on Fourier domain feature extraction [11]. The problem with this type of detectors is that other sounds coming from the background may be mixed with falling events; so again, the reliability of the system is in question.…”
Section: Related Approachesmentioning
confidence: 99%
“…Different methods are used to analyze the audio signal in order to extract the features and use them in the fall detection approach. Such methods are analysis of the audio signal based on wavelet domain feature extraction or analysis based on Fourier domain feature extraction [11]. The problem with this type of detectors is that other sounds coming from the background may be mixed with falling events; so again, the reliability of the system is in question.…”
Section: Related Approachesmentioning
confidence: 99%
“…In practice, since corrupted noise is effectively suppressed by the TE operator, the TE operator can provide better ability to discriminate speech characteristics from noise. The TE operator is easily implemented through the time domain and is defined as given by [5], [6]:…”
Section: Review Of the Teager Energy Operatormentioning
confidence: 99%
“…In this letter, we propose a novel approach to the VAD algorithm in which global speech absence probability (GSAP) [3], based on Teager energy (TE) [5], [6], is derived to improve the performance of VAD in various noisy environments. Statistical model-based GSAP is one of the feature parameters which is widely adopted in the decision rule for VAD, and it is used as the smoothing parameter for updating the noise signal in the speech enhancement algorithm.…”
Section: Introductionmentioning
confidence: 99%
“…Wavelet filterbanks have been proposed for speech recognition (Jabloun et al, 1999) and speaker identification (Sarikaya et al, 1998). These applications use respectively 21 subbands (Fig.…”
Section: Inverse Transformationmentioning
confidence: 99%