2009 IEEE International Conference on Acoustics, Speech and Signal Processing 2009
DOI: 10.1109/icassp.2009.4960726
|View full text |Cite
|
Sign up to set email alerts
|

Efficient subword lattice retrieval for German spoken term detection

Abstract: We present a lattice-based STD method for German broadcast news data and compare it to a previously proposed fuzzy search. Due to the important out-of-vocabulary (OOV) problem in German, we evaluate suitable subword indexing units for lattice retrieval. Hybrid lattice retrieval of words and subwords is investigated because of the robust nature of words as an indexing unit. We show that by using efficient lattice graph and score pruning techniques, precision of subword retrieval is increased by 8% absolute with… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
13
0

Year Published

2009
2009
2015
2015

Publication Types

Select...
6
2

Relationship

0
8

Authors

Journals

citations
Cited by 24 publications
(13 citation statements)
references
References 4 publications
0
13
0
Order By: Relevance
“…In [184], two hybrid approaches for STD that combine syllables and word ASR output are proposed. The first approach combines words with fuzzy search in a 1-best syllable transcript.…”
Section: Hybrid Approachesmentioning
confidence: 99%
“…In [184], two hybrid approaches for STD that combine syllables and word ASR output are proposed. The first approach combines words with fuzzy search in a 1-best syllable transcript.…”
Section: Hybrid Approachesmentioning
confidence: 99%
“…A typical STD system comprises an ASR subsystem for lattice generation and a STD subsystem for term detection, as illustrated in Figure 1. State-of-the-art STD systems include those reported in [2,3,4,5,6,7]. Fig.…”
Section: Introductionmentioning
confidence: 99%
“…By doing so, the OOV problem is mostly solved since subword dictionaries have much higher coverage than word-based dictionaries. Some typical subword units are phones [56][57][58][59][60][61], syllables [62][63][64], morpheme [65].…”
Section: Challenges and Existing Approaches For Stdmentioning
confidence: 99%
“…The first strategy, referred to as query expansion [66,67], transforms the keyword into other word/subword sequences that might be confused with the actual keyword, then use them additionally to perform retrieval. The second strategy, referred to as fuzzy matching [57,63,68], estimates the distance between the subword sequence of the keyword and a lattice, and produces a matching if the distance is lower than a specific threshold. In these strategies, a confusion model that encodes the insertion/deletion/substitution costs between subword units is required.…”
Section: Challenges and Existing Approaches For Stdmentioning
confidence: 99%
See 1 more Smart Citation