2008
DOI: 10.1121/1.2935732
|View full text |Cite
|
Sign up to set email alerts
|

Estimation model for the speech-quality dimension “Noisiness.”

Abstract: State-of-the-art assessment method of speech-transmission quality (e.g., PESQ or TOSQA) predict the mean-opinion score (MOS) quite accurately, but cannot provide diagnostic information, which is, however, highly desirable for system developers. In our research project, we aim at the development of an attribute-based speech-quality measure, which provides estimates of different attributes of speech samples and then maps them to one integral-quality estimate. Three dominant, mutually orthogonal perceptual dimens… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
7
0

Year Published

2010
2010
2022
2022

Publication Types

Select...
3
1
1

Relationship

1
4

Authors

Journals

citations
Cited by 5 publications
(7 citation statements)
references
References 5 publications
0
7
0
Order By: Relevance
“…Additive Artifacts sub-dimension (i) The "r A " indicator found in [3] quantifies the rate of perceived artifacts using the artifact estimator developed in [8]. This estimator uses a Weighted Spectral Slope (WSS) algorithm to detect distortions in the interpolated frames, mostly due to PLC techniques.…”
Section: A Interruptedness Sub-dimensionmentioning
confidence: 99%
See 1 more Smart Citation
“…Additive Artifacts sub-dimension (i) The "r A " indicator found in [3] quantifies the rate of perceived artifacts using the artifact estimator developed in [8]. This estimator uses a Weighted Spectral Slope (WSS) algorithm to detect distortions in the interpolated frames, mostly due to PLC techniques.…”
Section: A Interruptedness Sub-dimensionmentioning
confidence: 99%
“…The sub-dimensions of the "Continuity" dimension we used in what follows have been determined (see [8]) in a purely narrow-band context (but we believe that they apply also on larger audio bandwidths) from auditory test results on speech samples degraded by different levels of packet/frame loss, front-end clipping or musical noise. These sub-dimensions are:…”
Section: Introductionmentioning
confidence: 99%
“…In this section, we first report how we identified and/or estimated these QDIs in POLQA and compared them with the corresponding explicit indicators found in DIAL, for the four perceptual dimensions mentioned above. Furthermore, recent studies [18][19][20] have…”
Section: Comparative Analysis In Terms Of Perceptual Dimensionsmentioning
confidence: 99%
“…This set is composed of a DFC estimator developed by Scholz and Heute [15] and a Continuity estimator developed by Huo [16], both extended to S-WB transmissions. Two new estimators have been developed for the dimensions Noisiness and Loudness.…”
Section: A Cognitive Modelmentioning
confidence: 99%
“…including continuous variable delay. The estimator for the dimension Continuity has been developed by [16]. This estimator detects the discontinuities in the speech signal using Weighted Spectral Slope (WSS) distances [18] and the system gain variation (∆ gain).…”
Section: Continuitymentioning
confidence: 99%