2011
DOI: 10.1109/tasl.2011.2114881
|View full text |Cite
|
Sign up to set email alerts
|

An Algorithm for Intelligibility Prediction of Time–Frequency Weighted Noisy Speech

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

7
966
0
19

Year Published

2016
2016
2023
2023

Publication Types

Select...
8
1

Relationship

0
9

Authors

Journals

citations
Cited by 1,831 publications
(992 citation statements)
references
References 28 publications
7
966
0
19
Order By: Relevance
“…Especially popular measures are STOI [9], PESQ [10], or the word error rates of speech recognition systems. PESQ was originally designed as a measure for speech quality rather than intelligibility, but was then found to also correlate reasonably well with subjective intelligibility [11].…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…Especially popular measures are STOI [9], PESQ [10], or the word error rates of speech recognition systems. PESQ was originally designed as a measure for speech quality rather than intelligibility, but was then found to also correlate reasonably well with subjective intelligibility [11].…”
Section: Introductionmentioning
confidence: 99%
“…PESQ was originally designed as a measure for speech quality rather than intelligibility, but was then found to also correlate reasonably well with subjective intelligibility [11]. None of today's objective measures of intelligibility can perfectly predict intelligibility to humans, and their correlation depends on the type of speech degradation present [9,12].…”
Section: Introductionmentioning
confidence: 99%
“…Although these measures are designed to model human hearing, many of the most successful ones are based on principles that apply equally well to machine speech recognition. For example, in recent years, a simple algorithm known as the Short-Time Objective Intelligibility (STOI) measure has been shown to be a good predictor of intelligibility in a wide range of applications including time-frequency weighted noisy speech (Taal et al, 2011). The STOI measure is based on the sum of the correlation between the envelopes of the clean speech signal and the corrupted speech measured with 15 1/3-octave frequency bands starting at 150 Hz.…”
Section: Objective Intelligibility Measuresmentioning
confidence: 99%
“…The same reconstruction procedures were used in [15] [17]. Table 2 summarizes the comparisons of the wide-matching method against five conventional speech enhancement methods on the Segmental SNR, PESQ and STOI [21] measures, respectively, as a function of the input test sentence SNR averaged over 1152 test sentences (i.e., 192 test sentences per noise type × 6 noise types) under each SNR condition. The wide-matching method did not use any noise estimation while the conventional methods, Log-MMSE [22], LogMMSE-SPU [23], Wiener filtering [24], KLT [25] and Perceptual KLT [26], each used an algorithm to estimate the noise.…”
Section: Experimental Studiesmentioning
confidence: 99%