2021
DOI: 10.48550/arxiv.2104.10747
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Accented Speech Recognition: A Survey

Arthur Hinsvark,
Natalie Delworth,
Miguel Del Rio
et al.

Abstract: Automatic Speech Recognition (ASR) systems generalize poorly on accented speech. The phonetic and linguistic variability of accents present hard challenges for ASR systems today in both data collection and modeling strategies. The resulting bias in ASR performance across accents comes at a cost to both users and providers of ASR.We present a survey of current promising approaches to accented speech recognition and highlight the key challenges in the space. Approaches mostly focus on single model generalization… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

1
6
0

Year Published

2022
2022
2023
2023

Publication Types

Select...
2
1
1

Relationship

0
4

Authors

Journals

citations
Cited by 4 publications
(7 citation statements)
references
References 30 publications
1
6
0
Order By: Relevance
“…Modeling approaches include accent conversion [14], where a transformation is applied to a non-native utterance to make it sound as if the speaker had a native accent. For a thorough overview of research on improving speech recognition for accents, see [15].…”
Section: Previous Workmentioning
confidence: 99%
“…Modeling approaches include accent conversion [14], where a transformation is applied to a non-native utterance to make it sound as if the speaker had a native accent. For a thorough overview of research on improving speech recognition for accents, see [15].…”
Section: Previous Workmentioning
confidence: 99%
“…These large datasets are usually derived from fairly heterogeneous groups (Koenecke et al 2020). Given this, we see gender (Alsharhan and Ramsay 2020;Tatman 2017), race (Blodgett and O'Connor 2017), disability status (Fok et al 2018), and native language/dialect (Hinsvark et al 2021) disparities in successful ASR, reducing technology accessibility for underrepresented groups. Contemporaneous work (Liu et al 2021) introduced a dataset to measure ASR performance across age, gender and skin type.…”
Section: Related Workmentioning
confidence: 99%
“…In this analysis, we split based on perceived gender, accent, muffled-ness, background noise, and volume. The accessibility hurdles faced by members of minority populations in learning systems are well documented (Hinsvark et al 2021;Tatman 2017;Koenecke et al 2020;Alsharhan and Ramsay 2020). These hurdles can be attributed in part to lack of representation of minority groups in large datasets, but other factors also come into play, especially with smaller datasets and feature-engineered methods.…”
Section: User Trait-based Analysismentioning
confidence: 99%
See 1 more Smart Citation
“…Similar issues exist within many other fields. For example for speech recognition, a big challenge is the low variability of dialects and accents in the available data 14 . An example with a lack of negative data is anomaly detection, where rare events by definition only occur infrequently 15 .…”
mentioning
confidence: 99%