2016
DOI: 10.1121/1.4954254
|View full text |Cite
|
Sign up to set email alerts
|

Predicting binaural speech intelligibility using the signal-to-noise ratio in the envelope power spectrum domain

Abstract: This study proposes a binaural extension to the multi-resolution speech-based envelope power spectrum model (mr-sEPSM) [Jørgensen, Ewert, and Dau (2013). J. Acoust. Soc. Am. 134, 436-446]. It consists of a combination of better-ear (BE) and binaural unmasking processes, implemented as two monaural realizations of the mr-sEPSM combined with a short-term equalization-cancellation process, and uses the signal-to-noise ratio in the envelope domain (SNRenv) as the decision metric. The model requires only two parame… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
5

Citation Types

0
16
0

Year Published

2017
2017
2024
2024

Publication Types

Select...
7
1

Relationship

1
7

Authors

Journals

citations
Cited by 23 publications
(16 citation statements)
references
References 48 publications
0
16
0
Order By: Relevance
“…In the case of spatially-separated sources, speech intelligibility can be improved compared to collocated sources due to “better-ear” listening, where the sound at one ear, at a given moment, may provide an improved target-to-masker ratio, and/or due to the benefit of binaural unmasking, which improves the “internal” target-to-masker ratio (often conceptualized as an equalization-cancellation process; Durlach, 1972). Both strategies, better-ear listening and binaural unmasking, have been considered in various speech intelligibility modelling approaches (e.g., Beutelmann and Brand, 2006; Beutelmann et al , 2010; Lavandier and Culling, 2010; Wan et al , 2010; Rennies et al , 2011; Lavandier et al , 2012; Wan et al , 2014; Chabot-Leclerc et al , 2016) and are thought to reduce the effects of energetic masking (EM) of the target sound by the interferer(s).…”
Section: Introductionmentioning
confidence: 99%
“…In the case of spatially-separated sources, speech intelligibility can be improved compared to collocated sources due to “better-ear” listening, where the sound at one ear, at a given moment, may provide an improved target-to-masker ratio, and/or due to the benefit of binaural unmasking, which improves the “internal” target-to-masker ratio (often conceptualized as an equalization-cancellation process; Durlach, 1972). Both strategies, better-ear listening and binaural unmasking, have been considered in various speech intelligibility modelling approaches (e.g., Beutelmann and Brand, 2006; Beutelmann et al , 2010; Lavandier and Culling, 2010; Wan et al , 2010; Rennies et al , 2011; Lavandier et al , 2012; Wan et al , 2014; Chabot-Leclerc et al , 2016) and are thought to reduce the effects of energetic masking (EM) of the target sound by the interferer(s).…”
Section: Introductionmentioning
confidence: 99%
“…This reduction of binaural unmasking adds to the detrimental effect of reverberation and echoes on the speech signal (e.g., Steeneken and Houtgast, 1980;Arweiler and Buchholz, 2011;Rennies et al, 2011a;Warzybok et al, 2013), which is superimposed with delayed and attenuated versions of itself, resulting in temporal smearing and significantly reduced temporal modulations, which are considered crucial for good speech understanding (Steeneken and Houtgast, 1980). The studies of binaural advantages for speech intelligibility reviewed above, as well as many others reported over the years, have led to a relatively good understanding of the factors that most affect intelligibility in noise and reverberation, and quantitative models have been developed that can predict intelligibility in many binaural conditions with a very high degree of precision (e.g., Lavandier and Culling, 2010;Lavandier et al, 2012;Leclère et al, 2015;Chabot-Leclerc et al, 2016;Rennies et al, 2014b;Hauth and Brand, 2018). In contrast, binaural effects in listening effort have received very little attention.…”
Section: Introductionmentioning
confidence: 99%
“…[12], [13], [14], [15], and conditions with binaural listening, e.g. [16], [17], [8], [18], [9], [19], [20], [21]. Several works have, furthermore, proposed SIP algorithms that account for hearing loss, e.g.…”
Section: Introductionmentioning
confidence: 99%
“…[4], [22], [23], [24], [25]. On the theoretical level, efforts have been made to design algorithms that accurately model the auditory system in a physiological sense [26], [27], [10], [21], [24], [28], or which account for speech intelligibility from an information theoretical viewpoint [29], [30], [31], [32]. SIP algorithms can be classified according to the input signals required for making predictions.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation