2015
DOI: 10.1121/1.4916271
|View full text |Cite
|
Sign up to set email alerts
|

Structure in time-frequency binary masking errors and its impact on speech intelligibility

Abstract: Although requiring prior knowledge makes the ideal binary mask an impractical algorithm, substantial increases in measured intelligibility make it a desirable benchmark. While this benchmark has been studied extensively, many questions remain about the factors that influence the intelligibility of binary-masked speech with non-ideal masks. To date, researchers have used primarily uniformly random, uncorrelated mask errors and independently presented error types (i.e., false positives and negatives) to characte… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

1
12
0

Year Published

2016
2016
2018
2018

Publication Types

Select...
4
1

Relationship

1
4

Authors

Journals

citations
Cited by 9 publications
(13 citation statements)
references
References 25 publications
1
12
0
Order By: Relevance
“…However, H-FA fails to predict the impact of the distribution of errors, and instead, predicts that all masks with the same error rates yield the same intelligibility outcome. Thus, even though the correlation between mean H-FA and behavioral scores for conditions with c ¼ 2.0 (i.e., the conditions with an error distribution that most closely match the error distribution of the estimated masks of Kim et al;Kressner and Rozell, 2015) is high (r ¼ 0.97), H-FA is unable to account for the differences in the behavioral scores that arise when masks contain errors that are distributed differently. In contrast to H-FA, STOI is able to qualitatively predict the trends in the behavioral data when false negative errors are presented, as demonstrated by the similarities between Figs.…”
Section: Methodsmentioning
confidence: 93%
See 4 more Smart Citations
“…However, H-FA fails to predict the impact of the distribution of errors, and instead, predicts that all masks with the same error rates yield the same intelligibility outcome. Thus, even though the correlation between mean H-FA and behavioral scores for conditions with c ¼ 2.0 (i.e., the conditions with an error distribution that most closely match the error distribution of the estimated masks of Kim et al;Kressner and Rozell, 2015) is high (r ¼ 0.97), H-FA is unable to account for the differences in the behavioral scores that arise when masks contain errors that are distributed differently. In contrast to H-FA, STOI is able to qualitatively predict the trends in the behavioral data when false negative errors are presented, as demonstrated by the similarities between Figs.…”
Section: Methodsmentioning
confidence: 93%
“…RESULTS Figure 1 shows the behavioral results and predicted scores for the first two experiments of Kressner and Rozell (2015). Figure 1(a) shows the behavioral results when false positive errors are introduced (i.e., more energy from the interferer-dominated T-F units is erroneously retained), whereas Fig.…”
Section: Methodsmentioning
confidence: 99%
See 3 more Smart Citations