2006
DOI: 10.1109/lsp.2005.861598
|View full text |Cite
|
Sign up to set email alerts
|

Nonintrusive speech quality estimation using Gaussian mixture models

Abstract: We propose a novel method to estimate the quality of coded speech signals. The joint probability distribution of the subjective mean opinion score (MOS) and perceptual distortion feature variables is modelled using a Gaussian mixture density. The feature variables are sifted from a large pool of candidate features using statistical data mining techniques. We study what combinations of features and mixture model configuration are most effective. For our speech database, a five-feature, three-component GMM furni… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
34
0

Year Published

2008
2008
2023
2023

Publication Types

Select...
4
3
2

Relationship

1
8

Authors

Journals

citations
Cited by 45 publications
(34 citation statements)
references
References 13 publications
0
34
0
Order By: Relevance
“…Falk et al [20] made an extensive use of GMMs. In their first works [20], GMMs were used to generate artificial reference models of speech behavior.…”
Section: Gaussian Mixture Modelsmentioning
confidence: 99%
See 1 more Smart Citation
“…Falk et al [20] made an extensive use of GMMs. In their first works [20], GMMs were used to generate artificial reference models of speech behavior.…”
Section: Gaussian Mixture Modelsmentioning
confidence: 99%
“…In their first works [20], GMMs were used to generate artificial reference models of speech behavior. This technique compares distortion features introduced in these reference models with those affecting the real signal stream.…”
Section: Gaussian Mixture Modelsmentioning
confidence: 99%
“…The current ITU-T industry standard algorithm for non-intrusive speech quality assessment is P.563 [15], which uses a number of features from the audio stream to estimate the quality score directly on the MOS scale. More recently a number of techniques that use machine learning methods such as Gaussian mixture models (GMMs) to model perceptual speech features have been proposed by Falk et al [16], [17], [18]. Additionally, speech quality metrics based on a data-mining approach using CART have also been developed [19], [20], [21].…”
Section: Introductionmentioning
confidence: 99%
“…GM models have been used extensively for speech processing and have also shown to be useful in intrusive measurement algorithms [10]. GM models are introduced here only for the sake of notation.…”
Section: Gmm-based Non-intrusive Speech Quality Estimationmentioning
confidence: 99%