2022
DOI: 10.48550/arxiv.2210.01512
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Code-Switching without Switching: Language Agnostic End-to-End Speech Translation

Abstract: We propose a) a Language Agnostic end-to-end Speech Translation model (LAST), and b) a data augmentation strategy to increase code-switching (CS) performance.With increasing globalization, multiple languages are increasingly used interchangeably during fluent speech. Such CS complicates traditional speech recognition and translation, as we must recognize which language was spoken first and then apply a language-dependent recognizer and subsequent translation component to generate the desired target language ou… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
3
0

Year Published

2022
2022
2023
2023

Publication Types

Select...
4

Relationship

0
4

Authors

Journals

citations
Cited by 4 publications
(4 citation statements)
references
References 20 publications
0
3
0
Order By: Relevance
“…CS ST exbihit difficulties (Huber et al, 2022;Weller et al, 2022), exposing several limitations in this study: 1) Bengali and Sanskrit (another minority language) are treated without difference, as they originate from the same script and Sanskrit is not supported by the Google TTS service. 2)…”
Section: Limitationsmentioning
confidence: 99%
“…CS ST exbihit difficulties (Huber et al, 2022;Weller et al, 2022), exposing several limitations in this study: 1) Bengali and Sanskrit (another minority language) are treated without difference, as they originate from the same script and Sanskrit is not supported by the Google TTS service. 2)…”
Section: Limitationsmentioning
confidence: 99%
“…Finally, for the end-to-end ST system, we used the language-agnostic model from Huber et al (2022) that can decode en-de ST and de ASR.…”
Section: Transcription and Translation Modelsmentioning
confidence: 99%
“…CS ST exbihit difficulties (Huber et al, 2022;Weller et al, 2022), exposing several limitations in this study: 1) Bengali and Sanskrit (another minority language) are treated without difference, as they originate from the same script and Sanskrit is not supported by Google TTS. 2) We use a open-source language detection tool to calculate the oracle hyper-parameters in the dev set; yet, imperfection of the detector and the fact that source sentences are written in Latin regardless of the language deviate the scores from true values.…”
Section: Limitationsmentioning
confidence: 99%