2011 9th International Workshop on Content-Based Multimedia Indexing (CBMI) 2011
DOI: 10.1109/cbmi.2011.5972521
|View full text |Cite
|
Sign up to set email alerts
|

Cross-site combination and evaluation of subword spoken term detection systems

Abstract: The design and evaluation of subword-based spoken term detection (STD) systems depends on various factors, such as language, type of the speech to be searched and application scenario. The choice of the subword unit and search approach, however, is oftentimes made regardless of these factors. Therefore, we evaluate two subword STD systems across two data sets with varying properties to investigate the influence of different subword units on STD performance when working with different data types. Results show t… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2011
2011
2013
2013

Publication Types

Select...
1
1

Relationship

1
1

Authors

Journals

citations
Cited by 2 publications
(2 citation statements)
references
References 15 publications
0
2
0
Order By: Relevance
“…This, in turn, consists of two parts: first, a pronunciation is hypothesized using phonemic subword units, and second, said pronunciation is converted to a spelling. Only generating a pronunciation for an unknown word is sufficient in applications such as Spoken Term Detection (STD) [6], where phonemic representations of speech are adequate for indexing and search. For transcription, however, an orthography needs to be estimated from a given phonemic subword sequence, as in [7], where phone transcriptions are converted to spellings using memory-based learning for Dutch OOV words.…”
Section: Introductionmentioning
confidence: 99%
“…This, in turn, consists of two parts: first, a pronunciation is hypothesized using phonemic subword units, and second, said pronunciation is converted to a spelling. Only generating a pronunciation for an unknown word is sufficient in applications such as Spoken Term Detection (STD) [6], where phonemic representations of speech are adequate for indexing and search. For transcription, however, an orthography needs to be estimated from a given phonemic subword sequence, as in [7], where phone transcriptions are converted to spellings using memory-based learning for Dutch OOV words.…”
Section: Introductionmentioning
confidence: 99%
“…Despite the bad performance exhibited by the configuration 4a corresponding to system 4, it must be noted that this was not optimized for the final metric (i.e., ATWV) but to get a predefined hit coverage, which greatly affects the final ATWV performance [62] and hence, a fair comparison with the rest of the systems cannot be made.…”
Section: Performance Analysis Of the Qbe Std Systems For Specific Quementioning
confidence: 99%