2020 23rd Conference of the Oriental COCOSDA International Committee for the Co-Ordination and Standardisation of Speech Databa 2020
DOI: 10.1109/o-cocosda50338.2020.9295019
|View full text |Cite
|
Sign up to set email alerts
|

Formosa Speech Recognition Challenge 2020 and Taiwanese Across Taiwan Corpus

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
6
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
6
2

Relationship

0
8

Authors

Journals

citations
Cited by 9 publications
(6 citation statements)
references
References 1 publication
0
6
0
Order By: Relevance
“…The Hokkien ASR is pre-trained on 10k-hr Mandarin speech from WenetSpeech and 2k-hr Hokkien speech, which is a combination of TAT (480hr), Hokkien dramas (1k-hr) and SpeechOcean (597-hr), with Conformer wave2vec 2.0 LARGE model. We then finetuned the model with CTC loss on 480-hr Hokkien speech and Tâi-lô scripts from TAT (Liao et al, 2020), with each Tâi-lô syllable split into initial and final with tone as the finetuning target. To further improve the ASR accuracy, we apply another round of self-training by generating pseudo labels on the same set of Hokkien speech used in speech encoder pre-training.…”
Section: Discussionmentioning
confidence: 99%
See 1 more Smart Citation
“…The Hokkien ASR is pre-trained on 10k-hr Mandarin speech from WenetSpeech and 2k-hr Hokkien speech, which is a combination of TAT (480hr), Hokkien dramas (1k-hr) and SpeechOcean (597-hr), with Conformer wave2vec 2.0 LARGE model. We then finetuned the model with CTC loss on 480-hr Hokkien speech and Tâi-lô scripts from TAT (Liao et al, 2020), with each Tâi-lô syllable split into initial and final with tone as the finetuning target. To further improve the ASR accuracy, we apply another round of self-training by generating pseudo labels on the same set of Hokkien speech used in speech encoder pre-training.…”
Section: Discussionmentioning
confidence: 99%
“…Since there are not many En↔Hokkien bilingual speakers who can directly translate between the two languages, we use Mandarin as a pivot language during the data creation process whenever possible. We sample from the following data sources and adopt different strategies to create human annotated parallel data: (1) Hokkien dramas, which include Hokkien speech and aligned Mandarin subtitles 4 , (2) Taiwanese Across Taiwan (TAT) (Liao et al, 2020), a Hokkien read speech dataset containing transcripts in Tâi-lô and Hanji, and (3) MuST-C v1.2 En-Zh S2T data (Cattoni et al, 2021).…”
Section: Supervised Human Annotated Datamentioning
confidence: 99%
“…NSYSU-MITLab participated in the Formosa Speech Recognition Challenge 2020 (FSR-2020), which focused on the low-resource language Taiwanese (Taiwanese Hokkien) [99].…”
Section: Dnn-based Approach To Build Acoustic Modelmentioning
confidence: 99%
“…Taiwanese Hokkien, also known as Taiwanese, Hokkien, Taigi, Southern Min, or Min-Nan, is a branched-off variety of Southern Min dialects popular in Taiwan. Under the history background (Chen, 2008), the ability to use Taiwanese Hokkien declines by age (Chen, 2008;Liao et al, 2020;Tan, 2019;of Linguistics at Academia Sinica, 2007;Yang, 2021;Pan, 2016;Ho, 2020). Taiwanese Hokkien has always been the most widely spoken dialect in Taiwan, many people can have conversations in both Mandarin and Taiwanese Hokkien.…”
Section: Background Of Taiwanese Hokkienmentioning
confidence: 99%
“…Although Mandarin is the dominant language in Taiwan, Taiwanese Hokkien has nearly as many speakers as Mandarin (Liao et al, 2020). Taiwanese tend to mix dialects and Mandarin in daily communication, creating code-mixed languages such as Taiwanese Hokkien-Mandarin or Hakka-Mandarin.…”
Section: Introductionmentioning
confidence: 99%