2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2017
DOI: 10.1109/icassp.2017.7953264
|View full text |Cite
|
Sign up to set email alerts
|

Constructing sub-word units for spoken term detection

Abstract: Spoken term detection, especially of out-of-vocabulary (OOV) keywords, benefits from the use of sub-word systems. We experiment with different language-independent approaches to sub-word unit generation, generating both syllable-like and morpheme-like units, and demonstrate how the performance of syllable-like units can be improved by artificially increasing the number of unique units. The effect of unit choice is empirically evaluated using the eight languages from the 2016 IARPA BABEL evaluation.

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
10
0

Year Published

2017
2017
2024
2024

Publication Types

Select...
6
3

Relationship

0
9

Authors

Journals

citations
Cited by 16 publications
(10 citation statements)
references
References 17 publications
0
10
0
Order By: Relevance
“…However, subword-based ASR [22,24,[60][61][62][63][64][65][66] is also being used, sometimes in combination with word-based ASR. One of the main challenges of using word-based ASR in this context is that, in principle, only in-vocabulary (INV) terms can be detected.…”
Section: Spoken Term Detectionmentioning
confidence: 99%
See 1 more Smart Citation
“…However, subword-based ASR [22,24,[60][61][62][63][64][65][66] is also being used, sometimes in combination with word-based ASR. One of the main challenges of using word-based ASR in this context is that, in principle, only in-vocabulary (INV) terms can be detected.…”
Section: Spoken Term Detectionmentioning
confidence: 99%
“…This program focused on building fully automatic and noise-robust speech recognition and search systems in a very limited amount of time (e.g., one week) and with limited amount of training data. The languages addressed in that program were low-resourced, such as Cantonese, Pashto, Tagalog, Turkish, Vietnamese, Swahili, Tamil and so on, and significant research has been carried out [13,61,[147][148][149][150][151][152][153][154][155][156][157][158][159].…”
Section: Comparison With Previous Std International Evaluationsmentioning
confidence: 99%
“…The novel problem of lexeme-set KWS is related to work on out-of-vocabulary KWS, which has been approached by handling sub-word units such as syllables and morphemes (Trmal et al, 2014;Narasimhan et al, 2014;van Heerden et al, 2017;He et al, 2016). In contrast to KWS with sub-word granularity, our approach is to generate likely full-word inflections given a lemma.…”
Section: Abstractearchmentioning
confidence: 99%
“…For the ASR stage, word-based speech recognition has been widely used [35,[48][49][50][51][52][53][54], since this typically yields better performance than subword-based ASR [55][56][57][58][59][60][61][62] due to the lexical and language model (LM) information employed by the word-based ASR. However, one of the main drawbacks of word-based ASR is that it can only detect in-vocabulary (INV) terms.…”
Section: Spoken Term Detection Overviewmentioning
confidence: 99%