2005
DOI: 10.1155/asp.2005.487
|View full text |Cite
|
Sign up to set email alerts
|

A Computationally Efficient Mel-Filter Bank VAD Algorithm for Distributed Speech Recognition Systems

Abstract: This paper presents a novel computationally efficient voice activity detection (VAD) algorithm and emphasizes the importance of such algorithms in distributed speech recognition (DSR) systems. When using VAD algorithms in telecommunication systems, the required capacity of the speech transmission channel can be reduced if only the speech parts of the signal are transmitted. A similar objective can be adopted in DSR systems, where the nonspeech parameters are not sent over the transmission channel. A novel appr… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

0
20
0

Year Published

2009
2009
2022
2022

Publication Types

Select...
5
2

Relationship

1
6

Authors

Journals

citations
Cited by 25 publications
(20 citation statements)
references
References 14 publications
0
20
0
Order By: Relevance
“…Results for the G.729, G.723.1 and MFB VAD methods are cited from [17], results for the LTSV and GMM-NLSM methods are from [27], and results for the DSR-AFE and variable frame rate (VFR) methods are from [3]. The comparison in this table is conducted in terms of frame error rate (FER) since results of LTSV and GMM-NSLM are only available in terms of FER.…”
Section: Comparison With Referenced Methods and Evaluation Of Differementioning
confidence: 99%
See 2 more Smart Citations
“…Results for the G.729, G.723.1 and MFB VAD methods are cited from [17], results for the LTSV and GMM-NLSM methods are from [27], and results for the DSR-AFE and variable frame rate (VFR) methods are from [3]. The comparison in this table is conducted in terms of frame error rate (FER) since results of LTSV and GMM-NSLM are only available in terms of FER.…”
Section: Comparison With Referenced Methods and Evaluation Of Differementioning
confidence: 99%
“…The comparison in this table is conducted in terms of frame error rate (FER) since results of LTSV and GMM-NSLM are only available in terms of FER. Note that the identical experimental settings and labels are used across [3,17,27] and the present work, so the comparison is valid.…”
Section: Comparison With Referenced Methods and Evaluation Of Differementioning
confidence: 99%
See 1 more Smart Citation
“…V AD is an important frontend in many speech-related applications, such as mobile communication system [1], echo cancellation [2], speech enhancement [3], speech coding [4], automatic speech recognition [5], etc. The accuracy of V AD is quite critical to the overall performance of those applications.…”
Section: Introductionmentioning
confidence: 99%
“…Second, automatic speech recognition systems no longer have to work on those parts of the signal, which contain only noise, the benefit of which is a faster speech recognition process. The second advantage is based on a frame dropping strategy [1,2,3]. In addition to a speed-up of the speech recognition process the frame dropping strategy can also increase its accuracy, if the VAD algorithm gives the right information, about which frame contains only noise and is, therefore, excluded from the further speech recognition process.…”
Section: Introductionmentioning
confidence: 99%