2006 IEEE International Conference on Acoustics Speed and Signal Processing Proceedings
DOI: 10.1109/icassp.2006.1660151
|View full text |Cite
|
Sign up to set email alerts
|

Enhanced Non-Intrusive Speech Quality Measurement Using Degradation Models

Abstract: The speech quality estimation scheme in [1] is improved with the addition of a reference model of the behavior of speech degraded by different transmission and/or coding schemes. Moreover, via maximization of a mutual information measure, we validate the use of segmental SNR as a measure of the amount of multiplicative noise present in the test signal. These two additions result in an algorithm that is more accurate and more robust to certain distortion conditions. When tested on unseen data, the proposed algo… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
7
0

Publication Types

Select...
4
2
1

Relationship

1
6

Authors

Journals

citations
Cited by 8 publications
(7 citation statements)
references
References 7 publications
0
7
0
Order By: Relevance
“…Current speech quality measurement algorithms should handle such degradation conditions efficiently. In previous work [24], estimating multiplicative noise is shown to be beneficial for GMM-based speech quality measurement. A multiplicative noise estimator, similar to the one described in [16], was deployed and performance improvement was reported for MNRU degradations.…”
Section: Detecting and Estimating Multiplicative Noisementioning
confidence: 99%
See 1 more Smart Citation
“…Current speech quality measurement algorithms should handle such degradation conditions efficiently. In previous work [24], estimating multiplicative noise is shown to be beneficial for GMM-based speech quality measurement. A multiplicative noise estimator, similar to the one described in [16], was deployed and performance improvement was reported for MNRU degradations.…”
Section: Detecting and Estimating Multiplicative Noisementioning
confidence: 99%
“…Here, a modification to the GMM-based architecture described in [13] is proposed. It is found that accuracy can be enhanced if the algorithm is also equipped with information regarding the behavior of speech degraded by different transmission and coding schemes [24]. To this end, clean speech signals are used to train three different Gaussian mixture densities, .…”
Section: Consistency Calculation and Mos Mappingmentioning
confidence: 99%
“…The noise PSD matrix (estimated during silent periods) and the diffuse sound power were estimated by an autoregressive averaging procedure with a time constant of 50 ms. The VM signal was evaluated using the average fullband segmental signal-to-diffuse-plus-noise ratio (SDNR), the speech-toreverberation modulation ratio (SRMR) [11], and PESQ score [12]. Fig.…”
Section: Setup and Performance Measuresmentioning
confidence: 99%
“…Although there are many studies which shows that the performance provided by this metric in VoIP service can be improved [5], [6] and [7]. This is due to the intrinsic characteristics of the algorithm that implements the P.563 metric, which is mainly based on the analysis of the vocal tract [4], and is not planned to consider external factors, such as, packet losses in an IP network.…”
Section: Introductionmentioning
confidence: 99%