2020
DOI: 10.48550/arxiv.2012.11896
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Adversarial Meta Sampling for Multilingual Low-Resource Speech Recognition

Abstract: Low-resource automatic speech recognition (ASR) is challenging, as the low-resource target language data cannot well train an ASR model. To solve this issue, meta-learning formulates ASR for each source language into many small ASR tasks and meta-learns a model initialization on all tasks from different source languages to access fast adaptation on unseen target languages. However, for different source languages, the quantity and difficulty vary greatly because of their different data scales and diverse phonol… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
2
0

Year Published

2021
2021
2022
2022

Publication Types

Select...
1
1

Relationship

0
2

Authors

Journals

citations
Cited by 2 publications
(5 citation statements)
references
References 38 publications
0
2
0
Order By: Relevance
“…Bilingual and multilingual recognition without CS is where the speech recognition system is not expected to switch from one language to another during the recognition process. These bilingual and multilingual systems usually use the existing databases for well-resourced languages [27,28,35,37,38,[44][45][46]57] and are self-developed for under-resourced languages [31,32,40].…”
Section: Discussionmentioning
confidence: 99%
See 2 more Smart Citations
“…Bilingual and multilingual recognition without CS is where the speech recognition system is not expected to switch from one language to another during the recognition process. These bilingual and multilingual systems usually use the existing databases for well-resourced languages [27,28,35,37,38,[44][45][46]57] and are self-developed for under-resourced languages [31,32,40].…”
Section: Discussionmentioning
confidence: 99%
“…Individuals that can speak more than one language (e.g., bilingual or multilingual) CS or mix their languages when communicating with others [3,5,[27][28][29][30][31][32][33][34][35][36][37][38][39]. CS is common in bilingual and multilingual communities (different cultures and language backgrounds) [40].…”
Section: Cs In Bilingual and Multilingual Speech Recognition Systemsmentioning
confidence: 99%
See 1 more Smart Citation
“…(4). This is probably due to the task-difficulty imbalance issue described in (Xiao et al, 2020), perhaps some speakers with special accents may be hard to separate. Thus, in the future, we will try to solve this problem with meta sampling methods mentioned in (Xiao et al, 2020).…”
Section: Discussionmentioning
confidence: 99%
“…(Winata et al, 2020) applied metatransfer learning on code-switched speech recognition. (Xiao et al, 2020; applied meta-learning to solve the multilingual lowresource speech recognition problem. (Winata et al, 2019) also used MAML to adapt models to unseen accents on speech recognition.…”
Section: Related Workmentioning
confidence: 99%