2016
DOI: 10.1080/21655979.2016.1197617
|View full text |Cite
|
Sign up to set email alerts
|

Wavelet speech enhancement algorithm using exponential semi-soft mask filtering

Abstract: In this paper, we propose a new speech enhancement algorithm based on wavelet packet decomposition and mask filtering. In the traditional mask filtering such as ideal binary mask (IBM), the basic idea is to classify speech components as target signal and non-speech components as background noises. However, speech and non-speech components cannot be well separated in target signal and background noise. Therefore, the IBM has residual noise and signal loss. To overcome this problem, the proposed algorithm used s… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
6
0

Year Published

2017
2017
2020
2020

Publication Types

Select...
4
1
1

Relationship

2
4

Authors

Journals

citations
Cited by 7 publications
(6 citation statements)
references
References 9 publications
0
6
0
Order By: Relevance
“…In addition, all audio signals used in the experiments are sampled at 8 kHz and a 128 samples Hamming window with a 50% overlap is used to compute the DFT coefficients. The proposed algorithm is also compared with the classic prior SNR‐based Wiener filter algorithm [2], the audible noise suppression algorithm [21], the acoustic mask threshold constrained algorithm proposed in [24], and the mask filtering‐based algorithm [26].…”
Section: Experiments and Resultsmentioning
confidence: 99%
See 2 more Smart Citations
“…In addition, all audio signals used in the experiments are sampled at 8 kHz and a 128 samples Hamming window with a 50% overlap is used to compute the DFT coefficients. The proposed algorithm is also compared with the classic prior SNR‐based Wiener filter algorithm [2], the audible noise suppression algorithm [21], the acoustic mask threshold constrained algorithm proposed in [24], and the mask filtering‐based algorithm [26].…”
Section: Experiments and Resultsmentioning
confidence: 99%
“…The higher score indicates the better intelligibility of the enhanced speech. After making the pairwise comparison through the scores of five algorithms ( [2,21,24,26] and the proposed algorithm) with the t-test technique, the final subjective intelligibility test results are obtained. When the input SNR ranges from 0 to 5 dB, the t-test results from Tables 3-5 are the same, the test statistic t of means between the proposed method and other comparing methods are >2.262.…”
Section: Subjective Evaluation Of the Enhanced Speechmentioning
confidence: 99%
See 1 more Smart Citation
“…It is possible to resolve high-frequency components within a small time window of the signal [ 32 ]. Generally, WT is employed to decompose a signal by transforming a wavelet packet into time–frequency wavelet coefficients of multiple sub-bands [ 33 , 34 ]. In this study, WT decomposition was designed to represent the time–frequency form of the fNIRS signal using the Daubechies6 wavelet basis.…”
Section: Methodsmentioning
confidence: 99%
“…22 In general, WPD decomposes the noisy signal into time-frequency wavelet coefficients of multiple sub-bands using the wavelet packet transform. 23 In this paper, WPD was designed to enhance an fNIRS signal based on Daubechies 6 wavelet, which is a very efficient algorithm to reconstruct at an analysis of time-frequency domain. 24 The structure of the critical bands in WPD is optimized to compartmentalize fNIRS signal bands.…”
Section: Theory and Methodsmentioning
confidence: 99%