2012
DOI: 10.1016/j.specom.2011.09.004
|View full text |Cite
|
Sign up to set email alerts
|

Speech intelligibility prediction using a Neurogram Similarity Index Measure

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
41
0

Year Published

2013
2013
2024
2024

Publication Types

Select...
7
1

Relationship

2
6

Authors

Journals

citations
Cited by 71 publications
(41 citation statements)
references
References 31 publications
0
41
0
Order By: Relevance
“…ViSQOL was inspired by prior work on speech intelligibility by two of the authors [32,33]. This work used a model of the auditory periphery [34] to produce auditory nerve discharge outputs by computationally simulating the middle and inner ear.…”
Section: Measuring Speech Quality Through Spectrogram Similaritymentioning
confidence: 99%
See 2 more Smart Citations
“…ViSQOL was inspired by prior work on speech intelligibility by two of the authors [32,33]. This work used a model of the auditory periphery [34] to produce auditory nerve discharge outputs by computationally simulating the middle and inner ear.…”
Section: Measuring Speech Quality Through Spectrogram Similaritymentioning
confidence: 99%
“…Each reference patch is aligned with the corresponding area from the test spectrogram. The Neurogram Similarity Index Measure (NSIM) [33] is used to measure the similarity between the reference patch and a test spectrogram patch frame by frame, thus identifying the maximum similarity point for each patch. This is shown in the bottom pane of Figure 5 where each line graphs the NSIM similarity score over time for each patch in the reference signal compared with the example signal.…”
Section: Time Alignmentmentioning
confidence: 99%
See 1 more Smart Citation
“…Each reference patch is aligned with the corresponding area from the test spectrogram. The Neurogram Similarity Index Measure (NSIM) [11] is used to align and measure the similarity between the reference and a test spectrogram patches, as in Figure 1. The NSIM score per patch is averaged over the patches to yield the similarity metric for the test signal.…”
Section: Visqol Modelmentioning
confidence: 99%
“…This work considers four metrics from this perspective. A Structural Similarity Metric (SSIM) originating from image compression analysis has been successfully applied recently to time-frequency representations of biological neurogram signals [7] and speech signals [8], prompting the current study into its use for aeroacoustics. For images, the optimal compression stores an acceptable representation of the image as compactly as possible, ignoring extraneous detail and retaining only the information most important for human interpretation (i.e.…”
Section: Introductionmentioning
confidence: 99%