2014
DOI: 10.1109/tce.2014.7027350
|View full text |Cite
|
Sign up to set email alerts
|

Voice activity detection system for smart earphones

Abstract: This paper presents a real-time voice activity detection (VAD) algorithm implemented in a miniature Digital Signal Processor (DSP) for in-ear listening devices such as earphones or headphones. This system allows consumers to hear external speech signals such as public announcements or oral communication while listening to music without removing their listening devices. The proposed algorithm uses two normalized energy features that compare the energy in the frequency region containing speech information with t… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
9
0

Year Published

2016
2016
2024
2024

Publication Types

Select...
6
2
1

Relationship

1
8

Authors

Journals

citations
Cited by 16 publications
(9 citation statements)
references
References 14 publications
0
9
0
Order By: Relevance
“…Sampling frequency and sample width are important design parameters for a digital VAD but either or both the parameters are not provided in [5], [15], [20]. Mixed-signal designs presented in [9], [19], [22]- [24] do not provide complete information about the bandwidth, sample frequency or sample width.…”
Section: A Bandwidth and Dynamic Rangementioning
confidence: 99%
See 1 more Smart Citation
“…Sampling frequency and sample width are important design parameters for a digital VAD but either or both the parameters are not provided in [5], [15], [20]. Mixed-signal designs presented in [9], [19], [22]- [24] do not provide complete information about the bandwidth, sample frequency or sample width.…”
Section: A Bandwidth and Dynamic Rangementioning
confidence: 99%
“…Covering the last 12 years (Jan 2010 -Dec 2021), we found several hardware implementations of VAD (either mea- sured or simulated at the device level using foundry PDKs) [5]- [25]. This includes digital ( [5], [7], [16], [18], [20], [21]), analog ( [8], [10]), and mixed-signal ( [6], [9], [17], [19], [22]- [25]) designs along with some implementations on Field Programmable Gate Array (FPGA) ( [13], [14]), Field Programmable Analog Array (FPAA) ( [12]), Digital Signal Processor (DSP) ( [15]), and Neuromorphic platforms (e.g. Intel Loihi [26]) ( [11]), see Fig.…”
mentioning
confidence: 99%
“…As far as real-time VADs are concerned, in [19], Lezzoum et al utilized normalized energy features along with a thresholding technique. The real-time VAD developed by Sehgal et al in [2] was implemented to run on smartphones as an app using the features developed in [1].…”
Section: Introductionmentioning
confidence: 99%
“…These AWDs are capable of protecting the ear from background noise while simultaneously transmitting warning signals to the wearer's ear [3] and enabling face-to-face communication by detecting and transmitting enhanced speech signals to the protected ear [4].…”
Section: Introductionmentioning
confidence: 99%