2014
DOI: 10.1007/s11277-014-1752-9
|View full text |Cite
|
Sign up to set email alerts
|

Improvement of Speech Detection Using ERB Feature Extraction

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
4
0

Year Published

2014
2014
2024
2024

Publication Types

Select...
5
1
1

Relationship

1
6

Authors

Journals

citations
Cited by 14 publications
(4 citation statements)
references
References 22 publications
0
4
0
Order By: Relevance
“…Speech recognition performance in noisy car environments is demonstrated to be improved by combining microphone array processing techniques with a visual-audio Voice Activity Detector (VAD) [5]. The proposed localization framework combined with delay and beamforming yields a 7.1 A learning model using acoustic models to increase speech recognition rates is created [6]. In speech processing applications, noise processing for speech recognition systems is often expressed as a digital filtering process where noisy speech is passed through a linear filter to obtain clean speech predictions.…”
Section: Related Workmentioning
confidence: 99%
“…Speech recognition performance in noisy car environments is demonstrated to be improved by combining microphone array processing techniques with a visual-audio Voice Activity Detector (VAD) [5]. The proposed localization framework combined with delay and beamforming yields a 7.1 A learning model using acoustic models to increase speech recognition rates is created [6]. In speech processing applications, noise processing for speech recognition systems is often expressed as a digital filtering process where noisy speech is passed through a linear filter to obtain clean speech predictions.…”
Section: Related Workmentioning
confidence: 99%
“…The feature extraction procedure extracts some of the popular features like mel-frequency spectra and perceptual linear prediction (PLP) coefficients. The features can also be extracted using a rectangular equivalent bandwidth (ERB) to increase the rate of speech recognition (Oh & Chung, 2014). Amplitude modulation spectrogram (AMS) design has also been used for speech recognition for increasing its performance by noise attenuation (Ma & Zhou, 2008).…”
Section: Introductionmentioning
confidence: 99%
“…[22] proposed a novel feature extraction algorithm based on the double-combined Fourier transform and envelope line fitting is proposed. [23] extracted feature using an equivalent rectangular bandwidth (ERB) filter band cepstrum and constructed a learning model using the acoustic model to improve the speech detection and recognition, etc. Those methods take the advantages of various characteristic and intelligence algorithm, improve the detection effect under noise environment effectively, but their algorithmically complex increased sharply, and the detection effect remains unsatisfactory under low SNR, especially for non-stationary noise.…”
Section: Introductionmentioning
confidence: 99%