2010 Second International Workshop on Quality of Multimedia Experience (QoMEX) 2010
DOI: 10.1109/qomex.2010.5516236
|View full text |Cite
|
Sign up to set email alerts
|

Probabilistic non-intrusive quality assessment of speech for bounded-scale preference scores

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
18
0

Year Published

2012
2012
2017
2017

Publication Types

Select...
4

Relationship

0
4

Authors

Journals

citations
Cited by 4 publications
(18 citation statements)
references
References 13 publications
0
18
0
Order By: Relevance
“…The work is based on traditional PLP model, bark spectrum and MFCC. Additionally, a probabilistic, non-intrusive method for quality assessment of speech that takes into consideration the bounded character of the preference scores was proposed in [23].The special achievement of this approach is that by using the Maximum likelihood estimation to learn the model parameters, the learning stage is efficiently even from a small amount of training data.…”
Section: Recent Findingsmentioning
confidence: 99%
“…The work is based on traditional PLP model, bark spectrum and MFCC. Additionally, a probabilistic, non-intrusive method for quality assessment of speech that takes into consideration the bounded character of the preference scores was proposed in [23].The special achievement of this approach is that by using the Maximum likelihood estimation to learn the model parameters, the learning stage is efficiently even from a small amount of training data.…”
Section: Recent Findingsmentioning
confidence: 99%
“…Reviewing the results in Table 4.4 indicates that our model is competing with the model introduced in [62]. In the following, we provide a more detailed comparison of these competitive results for each iteration of the cross-validation and this will be statistically significance tested at the end of the section.…”
Section: Experiments With Quality Assessmentmentioning
confidence: 99%
“…These systems may differ in the considered features, the predicting function, or both. The work de-scribed in [60] makes use of a classifier to predict the discrete value of quality score while the works described in [23,24,25,26,27,28,29,31,61,62] apply regression methods (with shallow architectures) to estimate the subjective Mean Opinion Score (MOS) assigned to a speech file. On the other hand, approaches in [16,30] use a combination of classification and regression algorithms as the predicting function.…”
Section: Relevant Workmentioning
confidence: 99%
“…The labelled data that are publicly available for development and training of supervised learning based quality assessment is limited, and collecting more training data is usually expensive. The ITU-T coded speech data set, Supplement 23 [34], is the most well-known public labelled database that is commonly used for training or the evaluation of objective speech quality systems [14,23,24,26,28,29,44,8,46,61,62,64,66]. Supplement 23 database contains speech affected by noise, packet loss and various codecs and their corresponding subjective quality score.…”
Section: Relevant Workmentioning
confidence: 99%
See 1 more Smart Citation