Proceedings of the Third Workshop on Searching Spontaneous Conversational Speech 2009
DOI: 10.1145/1631127.1631132
|View full text |Cite
|
Sign up to set email alerts
|

The effect of language models on phonetic decoding for spoken term detection

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
3
0

Year Published

2011
2011
2019
2019

Publication Types

Select...
4

Relationship

0
4

Authors

Journals

citations
Cited by 4 publications
(3 citation statements)
references
References 8 publications
0
3
0
Order By: Relevance
“…Moreover, ASR errors replacing a word by a semantically dissimilar word were shown to have more impact on retrieval performance than a word with close meaning [109]. Another interesting observation is that although better language models were shown to reduce the ASR error rate, this did not always translate to better STD performance [39], [110], [111]. This is probably because language models tend to bias the decoding towards word sequences frequently appearing in the training data of the language models, but in the training data the terminologies or topic-specific terms often used in the queries are usually rare [111].…”
Section: Modified Speech Recognition For Retrieval Purposesmentioning
confidence: 99%
See 1 more Smart Citation
“…Moreover, ASR errors replacing a word by a semantically dissimilar word were shown to have more impact on retrieval performance than a word with close meaning [109]. Another interesting observation is that although better language models were shown to reduce the ASR error rate, this did not always translate to better STD performance [39], [110], [111]. This is probably because language models tend to bias the decoding towards word sequences frequently appearing in the training data of the language models, but in the training data the terminologies or topic-specific terms often used in the queries are usually rare [111].…”
Section: Modified Speech Recognition For Retrieval Purposesmentioning
confidence: 99%
“…Another interesting observation is that although better language models were shown to reduce the ASR error rate, this did not always translate to better STD performance [39], [110], [111]. This is probably because language models tend to bias the decoding towards word sequences frequently appearing in the training data of the language models, but in the training data the terminologies or topic-specific terms often used in the queries are usually rare [111]. In addition, because usually lattices instead of one-best transcripts are used in spoken content retrieval, expected error rate defined over the lattices should be in principle a better predictor of retrieval performance than the error rate of one-best transcriptions [112].…”
Section: Modified Speech Recognition For Retrieval Purposesmentioning
confidence: 99%
“…The deal with this problem, a combination of phoneme recognition and LVCSR output was proposed [15], [16]. In addition, Wallace et al [17] proposed a language modeling method for improving the phoneme recognition.…”
Section: Related Workmentioning
confidence: 99%