IEEE Workshop on Automatic Speech Recognition and Understanding, 2001. ASRU '01.
DOI: 10.1109/asru.2001.1034579
|View full text |Cite
|
Sign up to set email alerts
|

Histogram based normalization in the acoustic feature space

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
40
0

Publication Types

Select...
3
2
2

Relationship

0
7

Authors

Journals

citations
Cited by 51 publications
(40 citation statements)
references
References 6 publications
0
40
0
Order By: Relevance
“…HQ was verified to be able to handle various noisy conditions including non-stationary noisy environments [5,6]. The Histogram Equalization (HEQ) has been proposed and popularly used to equalize the cumulative distributions (or histograms) of both the training and testing feature parameters, and shown to produce very robust features for recognition [8,9,10]. The HEQ can be viewed as the limiting case of HQ proposed here when the number of the HQ quantization levels becomes infinite.…”
Section: Basic Formulation Of Hqmentioning
confidence: 98%
“…HQ was verified to be able to handle various noisy conditions including non-stationary noisy environments [5,6]. The Histogram Equalization (HEQ) has been proposed and popularly used to equalize the cumulative distributions (or histograms) of both the training and testing feature parameters, and shown to produce very robust features for recognition [8,9,10]. The HEQ can be viewed as the limiting case of HQ proposed here when the number of the HQ quantization levels becomes infinite.…”
Section: Basic Formulation Of Hqmentioning
confidence: 98%
“…The rest of the ASR system can be seen from [9]. There are different possible positions for pHEQ in the front-end [4]. In this paper, pHEQ is applied before the Mel-filter bank, since this configuration consistently outperformed the alternatives in our preliminary experiments.…”
Section: Parametric Histogram Equalizationmentioning
confidence: 99%
“…Both during training and testing the observed data is transformed as to match the target CDF as good as possible. The observation and target probability density functions (PDFs) pX (P log X ) and pY (P log Y ) can be approximated reasonably well by a bimodal Gaussian process [4]. The bimodal Gaussian statistics form a simple Gaussian Mixture Models (GMM) for which the parameters can be efficiently estimated using Expectation Maximization (EM).…”
Section: Parametric Histogram Equalizationmentioning
confidence: 99%
See 2 more Smart Citations