2024
DOI: 10.1121/10.0026358
|View full text |Cite
|
Sign up to set email alerts
|

A perceptual similarity space for speech based on self-supervised speech representations

Bronya R. Chernyak,
Ann R. Bradlow,
Joseph Keshet
et al.

Abstract: Speech recognition by both humans and machines frequently fails in non-optimal yet common situations. For example, word recognition error rates for second-language (L2) speech can be high, especially under conditions involving background noise. At the same time, both human and machine speech recognition sometimes shows remarkable robustness against signal- and noise-related degradation. Which acoustic features of speech explain this substantial variation in intelligibility? Current approaches align speech to t… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...

Citation Types

0
0
0

Year Published

2024
2024
2024
2024

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
references
References 47 publications
0
0
0
Order By: Relevance