2020
DOI: 10.48550/arxiv.2003.09024
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Techniques for Vocabulary Expansion in Hybrid Speech Recognition Systems

Nikolay Malkovsky,
Vladimir Bataev,
Dmitrii Sviridkin
et al.

Abstract: The problem of out of vocabulary words (OOV) is typical for any speech recognition system, hybrid systems are usually constructed to recognize a fixed set of words and rarely can include all the words that will be encountered during exploitation of the system. One of the popular approach to cover OOVs is to use subword units rather then words. Such system can potentially recognize any previously unseen word if the word can be constructed from present subword units, but also nonexisting words can be recognized.… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...

Citation Types

0
1
0

Year Published

2021
2021
2021
2021

Publication Types

Select...
1

Relationship

1
0

Authors

Journals

citations
Cited by 1 publication
(1 citation statement)
references
References 23 publications
0
1
0
Order By: Relevance
“…If the system is a conventional hybrid (HMM-DNN-based acoustic model and word-based ngram language model), the OOV problem is often solved by dynamic expanding the system's vocabulary and/or adapting the language model (e.g. Khokhlov et al, 2017;Gandhe et al, 2018;Malkovsky et al, 2020). A less common approach is to use a subword-based n-gram language model (Smit et al, 2017).…”
mentioning
confidence: 99%
“…If the system is a conventional hybrid (HMM-DNN-based acoustic model and word-based ngram language model), the OOV problem is often solved by dynamic expanding the system's vocabulary and/or adapting the language model (e.g. Khokhlov et al, 2017;Gandhe et al, 2018;Malkovsky et al, 2020). A less common approach is to use a subword-based n-gram language model (Smit et al, 2017).…”
mentioning
confidence: 99%