2009
DOI: 10.1109/tce.2009.5278015
|View full text |Cite
|
Sign up to set email alerts
|

Space-time voice activity detection

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3

Citation Types

0
3
0

Year Published

2016
2016
2023
2023

Publication Types

Select...
3
2

Relationship

0
5

Authors

Journals

citations
Cited by 5 publications
(3 citation statements)
references
References 13 publications
0
3
0
Order By: Relevance
“…These VAD methods, however, cannot be used to differentiate between a target speaker and interfering speech sources as the proposed TAD does. In this case, multichannel methods are beneficial, which can be based, e.g., on localization methods like the Steered Response Power (SRP) method [7,8]. In a similar way, the cross-correlation function between two microphones can be exploited for TAD by looking for peaks at the time lag corresponding to the (known) target source position [9,10], and the Magnitude Squared Coherence (MSC) allows for differentiating between a dominant coherent point source and incoherent background noise [11].…”
Section: Introductionmentioning
confidence: 99%
“…These VAD methods, however, cannot be used to differentiate between a target speaker and interfering speech sources as the proposed TAD does. In this case, multichannel methods are beneficial, which can be based, e.g., on localization methods like the Steered Response Power (SRP) method [7,8]. In a similar way, the cross-correlation function between two microphones can be exploited for TAD by looking for peaks at the time lag corresponding to the (known) target source position [9,10], and the Magnitude Squared Coherence (MSC) allows for differentiating between a dominant coherent point source and incoherent background noise [11].…”
Section: Introductionmentioning
confidence: 99%
“…In order to be able to exploit spatial information, multi-microphone recordings are required. Conventional methods for acoustic source localization can be modified to allow a discrimination between multiple point sources [5] or between background noise (assumed to be incoherent) and point sources [6]. Similarly, the position of the null of an adaptive nullsteering beamformer can be tracked, indicating a dominant target source if the null is steered towards the target source position [7].…”
Section: Introductionmentioning
confidence: 99%
“…Conventional acoustic source localization techniques for multi-microphone arrays can be modified to provide information on target source activity. For instance, the Steered Response Power (SRP) method can be exploited to either distinguish between multiple point sources [4] or between point sources and incoherent background noise [5]. Similarly, the cross-correlation function between two microphones can be calculated, allowing for a detection of target activity when a peak is observed for the time lag corresponding to a target source position [6,7,8].…”
Section: Introductionmentioning
confidence: 99%