2010
DOI: 10.1145/1658377.1658379
|View full text |Cite
|
Sign up to set email alerts
|

Statistical lattice-based spoken document retrieval

Abstract: Recent research efforts on spoken document retrieval have tried to overcome the low quality of 1-best automatic speech recognition transcripts, especially in the case of conversational speech, by using statistics derived from speech lattices containing multiple transcription hypotheses as output by a speech recognizer. We present a method for lattice-based spoken document retrieval based on a statistical n-gram modeling approach to information retrieval. In this statistical lattice-based retrieval (SLBR) metho… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
23
0

Year Published

2010
2010
2023
2023

Publication Types

Select...
6
1
1

Relationship

0
8

Authors

Journals

citations
Cited by 30 publications
(23 citation statements)
references
References 33 publications
(46 reference statements)
0
23
0
Order By: Relevance
“…Furthermore, the various WTM models have also been demonstrated to work effectively in the SDR task. As to future work, we envisage the following three directions: 1) utilizing speech summarization techniques to help better estimate the query and document models [26], 2) integrating the document model with other more elaborate representations of the speech recognition output [10], and 3) further confirming our observations on larger-scale experiments.…”
Section: Discussionmentioning
confidence: 80%
See 1 more Smart Citation
“…Furthermore, the various WTM models have also been demonstrated to work effectively in the SDR task. As to future work, we envisage the following three directions: 1) utilizing speech summarization techniques to help better estimate the query and document models [26], 2) integrating the document model with other more elaborate representations of the speech recognition output [10], and 3) further confirming our observations on larger-scale experiments.…”
Section: Discussionmentioning
confidence: 80%
“…More recently, language modeling (LM) for SDR has received great attention due to its inherent neat formulation and clear probabilistic meaning, as well as state-of-the-art performance [7]- [10]. In practice, the relevance measure for various LM approaches is usually computed by two distinct matching strategies, namely, literal term matching and concept matching [11].…”
Section: Introductionmentioning
confidence: 99%
“…The lack of information about term frequencies compromises the performance of the BIM and led to the development of the extension of the BIM referred to as the Okapi or BM25 model [227,228]. This extension introduces a sensitivity to term frequency and document length into the original BIM and has been demonstrated to yield good performance for many IR tasks including SCR [47,137].…”
Section: Information Retrieval For Scr 263mentioning
confidence: 99%
“…In Equation 2.7, the contributions from terms are represented in the log domain to prevent underflow and they are combined using a simple sum. SCR work that has made use of the language modeling framework includes [47].…”
Section: Information Retrieval For Scr 263mentioning
confidence: 99%
“…For machine translation, [11] used n-best list for reranking by optimizing interpolation weights for ASR and translation, and [12] used confusion network, but without much improvement over n-best. Lattices have been studied intensively for spoken document retrieval and indexing [13,14], with reported better performance than just using 1-best ASR output. For named entity recognition on speech data, [15] used n-best lists and obtained small improvement.…”
Section: Related Workmentioning
confidence: 99%