2019
DOI: 10.1250/ast.40.84
|View full text |Cite
|
Sign up to set email alerts
|

Speech intelligibility prediction with the dynamic compressive gammachirp filterbank and modulation power spectrum

Abstract: The speech-based envelope power spectrum model (sEPSM) was developed to predict the speech intelligibility of sounds produced by nonlinear speech enhancement algorithms such as spectral subtraction. It is a linear model with a linear, level-independent gammatone (GT) filterbank as the front-end. Therefore, it seems difficult to evaluate speech sounds with low and high sound pressure levels (SPLs) consistently because the intelligibility of the speech is dependent on the SPL as well as the signal-to-noise ratio… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
10
0

Year Published

2019
2019
2020
2020

Publication Types

Select...
2
1

Relationship

2
1

Authors

Journals

citations
Cited by 3 publications
(10 citation statements)
references
References 24 publications
0
10
0
Order By: Relevance
“…The dcGC-FB was also used in models for speech intelligibility prediction [9][10][11][12][13]. A new model referred to as GEDI (the gammachirp envelope distortion index) [10,11] predicted the intelligibility of speech sounds processed with non-linear enhancement algorithms better than other recent indexes like STOI, CSII, and HASPI [12,13].…”
Section: Resultsmentioning
confidence: 99%
See 1 more Smart Citation
“…The dcGC-FB was also used in models for speech intelligibility prediction [9][10][11][12][13]. A new model referred to as GEDI (the gammachirp envelope distortion index) [10,11] predicted the intelligibility of speech sounds processed with non-linear enhancement algorithms better than other recent indexes like STOI, CSII, and HASPI [12,13].…”
Section: Resultsmentioning
confidence: 99%
“…The GC architecture enables us to construct a hearing impairment simulator which allows normal hearing listeners to experience the difficulties of hearing impaired listeners [5,6]. The GC has also been used to model speaker size perception [7,8] and speech intelligibility [9][10][11][12][13].…”
Section: Introductionmentioning
confidence: 99%
“…To incorporate characteristics of a human auditory filter, Yamamoto et al (2019) extended sEPSM using a dynamic compressive gammachirp filterbank (dcGC-FB) (Irino & Patterson, 2006), in which the level-dependent frequency selectivity and gain of the auditory filter were reasonably determined by the data obtained from psychoacoustic masking experiments (Patterson et al, 2003). For OIMs, it is important to introduce the appropriate level dependency to incorporate the well-known fundamental knowledge that speech intelligibility is lower as sound level decreases and that peripheral hearing loss decreases the intelligibility.…”
Section: Objective Intelligibility Measures For Speech Enhancementmentioning
confidence: 99%
“…A bank of modulation filters, defined in envelope frequency domain (f env ), is applied to the spectra. There are seven modulation filters whose power spectra are W f c env (f env ) for the modulation center frequency of f c env , as illustrated in Figure 2 and described in previous studies (Jørgensen & Dau, 2011;Yamamoto et al, 2019). The envelope power at the output of the modulation filter is calculated as…”
Section: Sdr In the Envelope Modulation Domainmentioning
confidence: 99%
See 1 more Smart Citation