2016
DOI: 10.1007/s10772-014-9240-x
|View full text |Cite
|
Sign up to set email alerts
|

Wavelet energy based voice activity detection and adaptive thresholding for efficient speech coding

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
3
0

Year Published

2017
2017
2023
2023

Publication Types

Select...
5
2
1

Relationship

0
8

Authors

Journals

citations
Cited by 8 publications
(3 citation statements)
references
References 8 publications
0
3
0
Order By: Relevance
“…In this paper, a VAD based on DWT is considered since this transform is already computed as part of the noise classification pipeline, thus limiting the computational burden on the overall system. DWT-based VADs have been explored in the literature such as [7] but in this work, it is demonstrated that a fairly robust VAD can be achieved from simple energy comparisons at the frame and sub-band levels.…”
Section: B Voice Activity Detectionmentioning
confidence: 99%
“…In this paper, a VAD based on DWT is considered since this transform is already computed as part of the noise classification pipeline, thus limiting the computational burden on the overall system. DWT-based VADs have been explored in the literature such as [7] but in this work, it is demonstrated that a fairly robust VAD can be achieved from simple energy comparisons at the frame and sub-band levels.…”
Section: B Voice Activity Detectionmentioning
confidence: 99%
“…For threshold processing function, the hard threshold is easy to appear the pseudo‐Gibbs effects and while the wavelet coefficient is larger than the threshold, soft threshold wavelet comes out a constant deviation [38, 39]. To improve the inherent defects of soft and hard threshold methods, Gao and Bruce [40, 41] jointly proposed a semi‐soft threshold function and the Minimax threshold processing method.…”
Section: Related Workmentioning
confidence: 99%
“…Voice activity detection (VAD), i.e., detecting the presence or absence of human speech, is crucial for several speech processing applications, such as noisy speech enhancement [1], speaker recognition [2], speech coding systems [3], echo cancellation and hands-free telephony [4]. VAD algorithms trade off noise sensitivity, precision, and computational complexity and consist of two consecutive phases, namely, speechrelated feature extraction and a discriminating model.…”
Section: Introductionmentioning
confidence: 99%