2019 International Symposium on Multimedia and Communication Technology (ISMAC) 2019
DOI: 10.1109/ismac.2019.8836146
|View full text |Cite
|
Sign up to set email alerts
|

Korean Grapheme Unit-based Speech Recognition Using Attention-CTC Ensemble Network

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
9
0

Year Published

2020
2020
2023
2023

Publication Types

Select...
3
3

Relationship

0
6

Authors

Journals

citations
Cited by 9 publications
(9 citation statements)
references
References 5 publications
0
9
0
Order By: Relevance
“…End-to-end speech recognition is an active area of research. Many research and papers like [7,22] are announced, and multiple ways to improve efficiency and accuracy are proposed. One of them is an attention mechanism, that provides decoder RNN more information when it produces the output tokens [10].…”
Section: Related Workmentioning
confidence: 99%
See 3 more Smart Citations
“…End-to-end speech recognition is an active area of research. Many research and papers like [7,22] are announced, and multiple ways to improve efficiency and accuracy are proposed. One of them is an attention mechanism, that provides decoder RNN more information when it produces the output tokens [10].…”
Section: Related Workmentioning
confidence: 99%
“…These datasets are still influential as benchmark datasets these days [34,35,36]. Recently, however, Librispeech [18] is the most popular benchmark speech corpus on which the latest state-of-the-art ASR models are evaluated [22,37,38]. Although they are greatly useful and dependable, existing speech corpora mainly deal with English or non-Korean language.…”
Section: Related Workmentioning
confidence: 99%
See 2 more Smart Citations
“…This unique writing system allows us to build a Korean ASR model that is based on either graphemes or syllable blocks. So far, most Korean ASR models (Kim et al, 2020;Bang et al, 2020;Ha et al, 2020) were developed with a high-level modeling unit, syllables, and only a few studies attempted to use graphemes (Park et al, 2019b;. According to previous research that investigated modeling units in Korean ASR tasks, syllable-based models outperform grapheme-based models on Zeroth-Korean dataset (51.6 hrs) in most cases (Wang et al, 2020).…”
Section: Introductionmentioning
confidence: 99%