ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2020
DOI: 10.1109/icassp40776.2020.9052970
|View full text |Cite
|
Sign up to set email alerts
|

Impulse Response Data Augmentation and Deep Neural Networks for Blind Room Acoustic Parameter Estimation

Abstract: The reverberation time (T60) and the direct-to-reverberant ratio (DRR) are commonly used to characterize room acoustic environments. Both parameters can be measured from an acoustic impulse response (AIR) or using blind estimation methods that perform estimation directly from speech. When neural networks are used for blind estimation, however, a large realistic dataset is needed, which is expensive and time consuming to collect. To address this, we propose an AIR augmentation method that can parametrically con… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
21
0
3

Year Published

2020
2020
2024
2024

Publication Types

Select...
5
1

Relationship

1
5

Authors

Journals

citations
Cited by 34 publications
(24 citation statements)
references
References 9 publications
0
21
0
3
Order By: Relevance
“…Salamon et al used data augmentation to improve environmental sound classification [47]. Similarly, Bryan estimates the T 60 and the direct-toreverberant ratio (DRR) from a single speech recording via augmented datasets [5]. Tang et al trained CRNN models purely based on synthetic spatial IRs that generalize to real-world recordings [60].…”
Section: Related Workmentioning
confidence: 99%
See 4 more Smart Citations
“…Salamon et al used data augmentation to improve environmental sound classification [47]. Similarly, Bryan estimates the T 60 and the direct-toreverberant ratio (DRR) from a single speech recording via augmented datasets [5]. Tang et al trained CRNN models purely based on synthetic spatial IRs that generalize to real-world recordings [60].…”
Section: Related Workmentioning
confidence: 99%
“…Synthetic IRs are easy to obtain and can be used, but again lack wave-based effects as well as other simulation deficiencies. Recent work has addressed this issue by combining real-word IR measurements with augmentation to increase the diversity of existing real-world datasets [5]. This work, however, only addresses T 60 and DRR augmentation, and lacks a method to augment the frequencyequalization of existing IRs.…”
Section: Learning Reverberation and Equalizationmentioning
confidence: 99%
See 3 more Smart Citations