Robust Speech Recognition and Understanding 2007
DOI: 10.5772/4740
|View full text |Cite
|
Sign up to set email alerts
|

Voice Activity Detection. Fundamentals and Speech Recognition System Robustness

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
106
0
3

Year Published

2010
2010
2021
2021

Publication Types

Select...
6
3

Relationship

0
9

Authors

Journals

citations
Cited by 186 publications
(109 citation statements)
references
References 37 publications
0
106
0
3
Order By: Relevance
“…Detection performance as a function of the SNR [7] was assessed in terms of the non-speech hit-rate (HR0) and the speech hit-rate (HR1). Most of the VAD algorithms [4] fail when the noise level increases and the noise completely mask the speech signal. A VAD module is used in the speech recognition systems within the feature extraction process.…”
Section: Voice Activity Detector (Vad)mentioning
confidence: 99%
See 1 more Smart Citation
“…Detection performance as a function of the SNR [7] was assessed in terms of the non-speech hit-rate (HR0) and the speech hit-rate (HR1). Most of the VAD algorithms [4] fail when the noise level increases and the noise completely mask the speech signal. A VAD module is used in the speech recognition systems within the feature extraction process.…”
Section: Voice Activity Detector (Vad)mentioning
confidence: 99%
“…If the noise estimate is too high, speech will be distorted resulting possibly in eligibility loss. The simplest approach is to estimate and update the noise spectrum during the silent (pauses) segments of the signal using a voice-activity detection (VAD) [4]. An approach might work satisfactorily in stationary noise, it will not work well in more realistic environments where the spectral characteristics of the noise might be changing constantly.…”
Section: Noise Estimation Algorithmsmentioning
confidence: 99%
“…Often, a voice activity detector (VAD) [38,39] is used to detect the speech and non-speech segments in the noisy signal and, then, noise is estimated from the latter segments. Other traditional noise estimation methods are based on tracking spectral minima in each frequency band [29], MMSE-based spectral tracking [21] or comb-filtering [30].…”
Section: Noise Model Estimationmentioning
confidence: 99%
“…VAD, also known as speech activity detection or speech detection, is a technique used in speech processing in which the presence or absence of human speech is detected [18]. The main applications of VAD are in speech coding, speech recognition and speech searching [25].…”
Section: Voice Activity Detectormentioning
confidence: 99%