2017
DOI: 10.1007/978-3-319-53547-0_7
|View full text |Cite
|
Sign up to set email alerts
|

VAST: The Virtual Acoustic Space Traveler Dataset

Abstract: Abstract. This paper introduces a new paradigm for sound source localization referred to as virtual acoustic space traveling (VAST) and presents a first dataset designed for this purpose. Existing sound source localization methods are either based on an approximate physical model (physics-driven) or on a specific-purpose calibration set (data-driven). With VAST, the idea is to learn a mapping from audio features to desired audio properties using a massive dataset of simulated room impulse responses. This virtu… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
22
0

Year Published

2017
2017
2022
2022

Publication Types

Select...
5
1
1

Relationship

5
2

Authors

Journals

citations
Cited by 12 publications
(22 citation statements)
references
References 21 publications
0
22
0
Order By: Relevance
“…Moreover, we observed that incorporating random diffusion effects in simulations significantly increased the spatial richness of binaural features, improving estimation of all parameters. In a simultaneous companion paper [19], we validated the virtually-supervised learning framework by successfully localizing sources from real-room signals and showed the superiority of the approach over a traditional time-delay-based method. Extensions to speech [6], multiple sources [5] and additional partially-latent variables in GLLiM [20] will also be considered.…”
Section: Resultsmentioning
confidence: 94%
See 1 more Smart Citation
“…Moreover, we observed that incorporating random diffusion effects in simulations significantly increased the spatial richness of binaural features, improving estimation of all parameters. In a simultaneous companion paper [19], we validated the virtually-supervised learning framework by successfully localizing sources from real-room signals and showed the superiority of the approach over a traditional time-delay-based method. Extensions to speech [6], multiple sources [5] and additional partially-latent variables in GLLiM [20] will also be considered.…”
Section: Resultsmentioning
confidence: 94%
“…Overall, each pair of generated RIR depends on four parameters: the source's azimuth, elevation and range, and the mean absorption coefficient of walls. The complete dataset is available online at http://theVASTproject.inria.fr (see also [19]).…”
Section: Description Of Experimental Setupmentioning
confidence: 99%
“…To evaluate the embeddings in varying acoustic conditions, we used the VAST dataset [24] of simulated binaural room impulse responses of a KEMAR dummy head [25,26]. The training set consists of 15 rooms with reverberation time 0.1-0.4 s. Each room contains spherical grids of positions with radii 1, 1.5, and 2 meters, centered at 9 positions.…”
Section: Varying Acoustic Conditionsmentioning
confidence: 99%
“…This mapping is then reversed through Bayesian inversion, yielding an efficient estimator ofz givenτ(ξ). GLLiM was notably successfully applied to supervised binaural sound source localization using either real [16,12] or simulated [22] training sets. Here, a fixed value Q = 50, diagonal and equal noise covariance matrices and equal mixture weights are used in all experiments (see [13] for details on the GLLiM method).…”
Section: The Training Dataset Is Composed Ofmentioning
confidence: 99%