Interspeech 2022 2022
DOI: 10.21437/interspeech.2022-799
|View full text |Cite
|
Sign up to set email alerts
|

Using Rater and System Metadata to Explain Variance in the VoiceMOS Challenge 2022 Dataset

Abstract: Non-reference speech quality models are important for a growing number of applications. The VoiceMOS 2022 challenge provided a dataset of synthetic voice conversion and text-tospeech samples with subjective labels. This study looks at the amount of variance that can be explained in subjective ratings of speech quality from metadata and the distribution imbalances of the dataset. Speech quality models were constructed using wav2vec 2.0 with additional metadata features that included rater groups and system iden… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
3
1

Relationship

0
4

Authors

Journals

citations
Cited by 4 publications
(1 citation statement)
references
References 19 publications
0
1
0
Order By: Relevance
“…• Another analysis focused on the metadata of the BVCC dataset [144]. They used the SSL-MOS model and added metadata information.…”
Section: Team Approachesmentioning
confidence: 99%
“…• Another analysis focused on the metadata of the BVCC dataset [144]. They used the SSL-MOS model and added metadata information.…”
Section: Team Approachesmentioning
confidence: 99%