2009
DOI: 10.1145/1482343.1482345
|View full text |Cite
|
Sign up to set email alerts
|

Word Topic Models for Spoken Document Retrieval and Transcription

Abstract: Statistical language modeling (LM), which aims to capture the regularities in human natural language and quantify the acceptability of a given word sequence, has long been an interesting yet challenging research topic in the speech and language processing community. It also has been introduced to information retrieval (IR) problems, and provided an effective and theoretically attractive probabilistic framework for building IR systems. In this article, we propose a word topic model (WTM) to explore the co-occur… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
21
0

Year Published

2010
2010
2023
2023

Publication Types

Select...
5
1
1

Relationship

2
5

Authors

Journals

citations
Cited by 27 publications
(21 citation statements)
references
References 39 publications
0
21
0
Order By: Relevance
“…Apart from treating each document in the collection as a document topic model, we can regard each word w j of the language as a word topic model (WTM) [11], [17]. To get to this point, all words are assumed to share a common set of latent topic distributions but have different weights over these topics.…”
Section: Word Topic Model (Wtm)mentioning
confidence: 99%
See 2 more Smart Citations
“…Apart from treating each document in the collection as a document topic model, we can regard each word w j of the language as a word topic model (WTM) [11], [17]. To get to this point, all words are assumed to share a common set of latent topic distributions but have different weights over these topics.…”
Section: Word Topic Model (Wtm)mentioning
confidence: 99%
“…In practice, the relevance measure for various LM approaches is usually computed by two distinct matching strategies, namely, literal term matching and concept matching [11]. The unigram language model (ULM) is the most prominent example for literal term matching [7], [8].…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…They have been employed for tackling text mining problems including document classification (Jameel and Lam 2013b;Rubin et al 2012;Li et al 2015) and document retrieval (Wei and Croft 2006;Wang et al 2007;Chen 2009;Yi and Allan 2009;Egozi et al 2011;Andrzejewski and Buttler 2011;Wang et al 2011Wang et al , 2013aLu et al 2011;Yi and Allan 2008;Cao et al 2007a;Park and Ramamohanarao 2009;Duan et al 2012). These models can achieve better performance via detecting the latent topic structure and establishing a relationship between the latent topic and the goal of the problem.…”
Section: Introductionmentioning
confidence: 99%
“…It has been applied unimodally to text or transcribed speech for language modeling [15], document clustering [16], spoken document retrieval [17], document summarization [18], etc. The objective of our current work is to apply LSM in capturing the latent semantics of the multimodal user inputs.…”
Section: Introductionmentioning
confidence: 99%