2019
DOI: 10.1109/jstsp.2019.2917582
|View full text |Cite
|
Sign up to set email alerts
|

Building and evaluation of a real room impulse response dataset

Abstract: This paper presents BUT ReverbDB -a dataset of real room impulse responses (RIR), background noises and re-transmitted speech data. The retransmitted data includes LibriSpeech test-clean, 2000 HUB5 English evaluation and part of 2010 NIST Speaker Recognition Evaluation datasets. We provide a detailed description of RIR collection (hardware, software, post-processing) that can serve as a "cook-book" for similar efforts. We also validate BUT ReverbDB in two sets of automatic speech recognition (ASR) experiments … Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
75
0

Year Published

2019
2019
2023
2023

Publication Types

Select...
3
2
2

Relationship

0
7

Authors

Journals

citations
Cited by 91 publications
(75 citation statements)
references
References 40 publications
0
75
0
Order By: Relevance
“…Therefore, these methods will not work well for low-frequency components of the sound, which inevitably introduces significant simulation error at low frequencies under 500Hz compared to accurate wave-based solvers [23]. Far-field ASR experiments that aim to compare the effectiveness of using simulated IRs against real IRs confirm that real IRs are superior in training better ASR systems [9]. The main drawback of geometric acoustic simulation is the absence of low-frequency wave effects such as diffraction [24] and room resonance [25], of which sound diffraction is a less noticeable phenomenon.…”
Section: Related Workmentioning
confidence: 99%
See 3 more Smart Citations
“…Therefore, these methods will not work well for low-frequency components of the sound, which inevitably introduces significant simulation error at low frequencies under 500Hz compared to accurate wave-based solvers [23]. Far-field ASR experiments that aim to compare the effectiveness of using simulated IRs against real IRs confirm that real IRs are superior in training better ASR systems [9]. The main drawback of geometric acoustic simulation is the absence of low-frequency wave effects such as diffraction [24] and room resonance [25], of which sound diffraction is a less noticeable phenomenon.…”
Section: Related Workmentioning
confidence: 99%
“…Based on this representation, we calculate and extract sub-band EQs for a set of recorded IRs, collected from the BUT Reverb Database (ReverbDB) [9]. The BUT ReverbDB contains 1891 IRs and 9114 background noises (both with some repetitions), recorded in 9 different real-world environments.…”
Section: Equalization Matchingmentioning
confidence: 99%
See 2 more Smart Citations
“…The original utterances are sampled at 48kHz, which we down-sample to 16kHz for faster processing. We used noise signals from the BUTReverbDB database [19] to contaminate clean speech utterances. The noise files consist of recordings from silent office and conference rooms.…”
Section: Experimental Set-upmentioning
confidence: 99%