2016
DOI: 10.1109/taslp.2016.2577502
|View full text |Cite
|
Sign up to set email alerts
|

Estimation of Room Acoustic Parameters: The ACE Challenge

Abstract: Abstract-Reverberation Time (T60) and Direct-to-Reverberant Ratio (DRR) are important parameters which together can characterize sound captured by microphones in non-anechoic rooms. These parameters are important in speech processing applications such as speech recognition and dereverberation. The values of T60 and DRR can be estimated directly from the Acoustic Impulse Response (AIR) of the room. In practice, the AIR is not normally available, in which case these parameters must be estimated blindly from the … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

1
116
0
2

Year Published

2017
2017
2024
2024

Publication Types

Select...
3
3
2

Relationship

1
7

Authors

Journals

citations
Cited by 135 publications
(119 citation statements)
references
References 27 publications
1
116
0
2
Order By: Relevance
“…We compare our proposed CNN network trained on our data (Our CNN + AIRA) with previously published state-of-theart results from the ACE challenge [5], previously published GT-CNN results [10] for T60 only, and our reimplementation of the GT-CNN [10] estimator trained on our augmented dataset (GT-CNN + AIRA) for both T60 and DRR. Table 2 and Table 3 show T60 and DRR estimation bias, mean squared error, and Pearson correlation coefficient ρ results.…”
Section: Discussionmentioning
confidence: 99%
See 1 more Smart Citation
“…We compare our proposed CNN network trained on our data (Our CNN + AIRA) with previously published state-of-theart results from the ACE challenge [5], previously published GT-CNN results [10] for T60 only, and our reimplementation of the GT-CNN [10] estimator trained on our augmented dataset (GT-CNN + AIRA) for both T60 and DRR. Table 2 and Table 3 show T60 and DRR estimation bias, mean squared error, and Pearson correlation coefficient ρ results.…”
Section: Discussionmentioning
confidence: 99%
“…where h(t) is an AIR, t is a discrete time index, h e (t) is the early response, h l (t) is the late-field response, t d is the time delay of the direct path, and t 0 is tolerance window set to 2.5 ms [5]. We identify the location of the direct path as the time of the maximum of the absolute value of h(t).…”
Section: Impulse Response Augmentationmentioning
confidence: 99%
“…1a appears to almost perfectly model the AIR when comparinĝ h(t) to h(t). The same process was repeated for a measured AIR, part of the Acoustic Characterization of Environments (ACE) Database [22]. The recording took place in a lecture room with dimensions [6.9, 9.7, 3.0] m and the receiver is one of the channels of a 32 microphone array.…”
Section: Methodsmentioning
confidence: 99%
“…For the second experiment the task was to model the 42 AIRs recorded using a 3 microphone mobile phone array in 7 rooms provided in the ACE database [22]. This corresponds to 2 sets of measurements, with the receiver positions varying between the two and the rest of the setup unchanged.…”
Section: Methodsmentioning
confidence: 99%
“…We follow the methodology of Karjalainen et al [25] when computing the T 60 from real IRs with a measurable noise floor. This method was found to be the most robust estimator when computing the T 60 from real IRs in recent work [14]. The final composition of our dataset is listed in Table 2.…”
Section: Data Augmentationmentioning
confidence: 99%