2022
DOI: 10.48550/arxiv.2204.01787
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

GWA: A Large High-Quality Acoustic Dataset for Audio Processing

Zhenyu Tang,
Rohith Aralikatti,
Anton Ratnarajah
et al.

Abstract: Figure 1: Our IR data generation pipeline starts from a 3D model of a complex scene and its visual material annotations (unstructured texts). We sample multiple collision-free source and receiver locations in the scene. We use a novel scheme to automatically assign acoustic material parameters by semantic matching from a large acoustic database. Our hybrid acoustic simulator generates accurate impulse responses (IRs), which become part of the large synthetic impulse response dataset after post-processing.

Help me understand this report
View published versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
14
0

Year Published

2022
2022
2022
2022

Publication Types

Select...
2
1

Relationship

1
2

Authors

Journals

citations
Cited by 3 publications
(14 citation statements)
references
References 38 publications
0
14
0
Order By: Relevance
“…The publicly available IR datasets are either recorded in a realworld environments (recorded IR) [Carlo et al 2021;Hadad et al 2014;Koyama et al 2021; or generated using IR simulation tools (synthetic IR) Grondin et al 2020;Ko et al 2017;Tang et al 2022]. The recorded IR datasets are limited in size (fewer than 5000 IRs) and number of environments (fewer than 10 rooms), and no sufficient information on the recording conditions is provided to train a deep learning model.…”
Section: Indoor 3d Scenes and Ir Databasesmentioning
confidence: 99%
See 4 more Smart Citations
“…The publicly available IR datasets are either recorded in a realworld environments (recorded IR) [Carlo et al 2021;Hadad et al 2014;Koyama et al 2021; or generated using IR simulation tools (synthetic IR) Grondin et al 2020;Ko et al 2017;Tang et al 2022]. The recorded IR datasets are limited in size (fewer than 5000 IRs) and number of environments (fewer than 10 rooms), and no sufficient information on the recording conditions is provided to train a deep learning model.…”
Section: Indoor 3d Scenes and Ir Databasesmentioning
confidence: 99%
“…The recorded IR datasets are limited in size (fewer than 5000 IRs) and number of environments (fewer than 10 rooms), and no sufficient information on the recording conditions is provided to train a deep learning model. Synthetic IR datasets like SoundSpaces and GWA [Tang et al 2022] contain millions of IRs. The IRs in SoundSpaces is simulated using a geometric acoustic method [Schissler and Manocha 2011] and GWA dataset contains high-quality IRs computed using a hybrid method.…”
Section: Indoor 3d Scenes and Ir Databasesmentioning
confidence: 99%
See 3 more Smart Citations