Interspeech 2022 2022
DOI: 10.21437/interspeech.2022-10597
|View full text |Cite
|
Sign up to set email alerts
|

ConferencingSpeech 2022 Challenge: Non-intrusive Objective Speech Quality Assessment (NISQA) Challenge for Online Conferencing Applications

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
5
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
6
4

Relationship

0
10

Authors

Journals

citations
Cited by 21 publications
(5 citation statements)
references
References 0 publications
0
5
0
Order By: Relevance
“…However, it was developed for narrow-band applications, and works on limited impairment types, but correlates poorly with human ratings [32]. More recently, DNN-based approaches have been proposed to estimate the speech quality scores [13], [14], [32]- [38]. Some of these learning-based approaches use other objective metrics as the ground truth to train their speech quality predictor.…”
Section: Related Workmentioning
confidence: 99%
“…However, it was developed for narrow-band applications, and works on limited impairment types, but correlates poorly with human ratings [32]. More recently, DNN-based approaches have been proposed to estimate the speech quality scores [13], [14], [32]- [38]. Some of these learning-based approaches use other objective metrics as the ground truth to train their speech quality predictor.…”
Section: Related Workmentioning
confidence: 99%
“…Non-intrusive objective speech quality assessment tools like ITU-T P.563 [15] do not require a reference, though it has a low correlation to subjective quality [16]. Newer neural net-based methods such as [16], [17], [18], [19] provide better correlations to subjective quality. NISQA [20] is an objective metric for P.804, though the correlation to subjective quality is not sufficient to use as a challenge metric (in the ConferencingSpeech 2022 Challenge [19] NISQA was used as a baseline model and achieved a Pearson Correlation Coefficient = 0.724 to MOS).…”
Section: Related Workmentioning
confidence: 99%
“…Another recent challenge [24] focused on predicting speech quality in speech conferencing applications, and also saw several submissions, e.g. [25,26], making use of SSL representations.…”
Section: Quality Prediction Using Ssl Modelsmentioning
confidence: 99%