2015 IEEE International Conference on Multimedia Big Data 2015
DOI: 10.1109/bigmm.2015.46
|View full text |Cite
|
Sign up to set email alerts
|

A Robust Acoustic Feature Extraction Approach Based on Stacked Denoising Autoencoder

Abstract: Acoustic feature extraction (AFE) is considered as one of the most challenging techniques for speech applications since the adverse environment noises always cause significant variation on the extracted acoustic features. In this paper, we propose a systematical AFE approach which based on stacked denoising autoencoder (SDAE) aiming at extracting acoustic features automatically. Denoising autoencoder (DAE), which is trained to reconstruct a clean "repaired" input from a corrupted version of it, works as the ba… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
5
0

Year Published

2017
2017
2023
2023

Publication Types

Select...
5
2

Relationship

0
7

Authors

Journals

citations
Cited by 10 publications
(5 citation statements)
references
References 9 publications
0
5
0
Order By: Relevance
“…Because of its success, a lot of effort has been made to improve the noise and duration robustness of the i-vector/PLDA framework in recent years. For example, attempts have been made to enhance and restore speech in the feature domain [5] using factor analysis and in the spectral domain [6,7] or ivector space [8,9] using denoising autoencoders (DAE) [10]. Improving noise robustness of PLDA models is another direction.…”
Section: Introductionmentioning
confidence: 99%
“…Because of its success, a lot of effort has been made to improve the noise and duration robustness of the i-vector/PLDA framework in recent years. For example, attempts have been made to enhance and restore speech in the feature domain [5] using factor analysis and in the spectral domain [6,7] or ivector space [8,9] using denoising autoencoders (DAE) [10]. Improving noise robustness of PLDA models is another direction.…”
Section: Introductionmentioning
confidence: 99%
“…In our framework, other sizes of convolution kernels are also allowed. Moreover, the type of noise added is not limited to Gaussian noise, and other noise is also possible [63]. Further literature [64] proposed a classification approach called denoising and spare auto-encoder by combining DAE and SAE.…”
Section: Discussionmentioning
confidence: 99%
“…The idea of SDAE is to stack multiple DAEs to form a new deep network structure. The specific number of stacked DAEs depends on the actual demand [18,19]. Each layer of the network can be regarded as a DAE, which is trained layer by layer through multiple iterations.…”
Section: Stacked Dae (Sdae)mentioning
confidence: 99%