Proceedings of the ACM International Conference on Image and Video Retrieval 2009
DOI: 10.1145/1646396.1646408
|View full text |Cite
|
Sign up to set email alerts
|

Multilayer pLSA for multimodal image retrieval

Abstract: It is current state of knowledge that our neocortex consists of six layers [10]. We take this knowledge from neuroscience as an inspiration to extend the standard single-layer probabilistic Latent Semantic Analysis (pLSA) [13] to multiple layers. As multiple layers should naturally handle multiple modalities and a hierarchy of abstractions, we denote this new approach multilayer multimodal probabilistic Latent Semantic Analysis (mm-pLSA). We derive the training and inference rules for the smallest possible non… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

3
58
1
1

Year Published

2010
2010
2019
2019

Publication Types

Select...
4
3
1

Relationship

0
8

Authors

Journals

citations
Cited by 74 publications
(63 citation statements)
references
References 22 publications
3
58
1
1
Order By: Relevance
“…This is similar to the methods in [15]. We also compare our multimodal pLSA with multilayer multi modal pLSA(mm-pLSA) proposed by [11]. From the comparative study on a variety of popular data sets, we find that the results obtained by direct method are superior to other methods.…”
Section: Inputsupporting
confidence: 57%
See 2 more Smart Citations
“…This is similar to the methods in [15]. We also compare our multimodal pLSA with multilayer multi modal pLSA(mm-pLSA) proposed by [11]. From the comparative study on a variety of popular data sets, we find that the results obtained by direct method are superior to other methods.…”
Section: Inputsupporting
confidence: 57%
“…Then a maximal margin classifier is applied for retrieval. Romberg et al [11] proposed a mmpLSA, with two separate leaf-pLSAs, and a single top level pLSA node merging the two leaf-pLSAs. Here, they apply pLSA to each mode, i.e., visual features and textual words separately, and then concatenate the derived topic vectors of each mode to learn another pLSA on top of that.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…This method aims at eliminating the most noisy words generated by the vocabulary building process, using multilayer pLSA. Lienhart et al [18] proposed a multilayer multi-modal probabilistic Latent Semantic Analysis (mm-pLSA). The proposed approach (mm-pLSA) has two modes: one mode for visual words and the other one for image tags.…”
Section: Filtering the Noisy Visual Wordsmentioning
confidence: 99%
“…Once the graph is constructed, standard graph mining techniques such as Pagerank [4] can be used to identify the "authority" vertices (images). Besides image visual features, it has been shown in previous work [5] that integrating other modalities can boost retrieval performance. In the work of [6], graphs have been used to convey multimodal information for search and retrieval.…”
Section: Introductionmentioning
confidence: 99%