Procedings of the British Machine Vision Conference 2011 2011
DOI: 10.5244/c.25.112
|View full text |Cite
|
Sign up to set email alerts
|

Max-margin Latent Dirichlet Allocation for Image Classification and Annotation

Abstract: We present the max-margin latent Dirichlet allocation, a max-margin variant of supervised topic models, for image classification and annotation. Our model for image classification (called MMLDA c ) integrates discriminative classification with generative topic models. Our model for image annotation (called MMLDA a ) extends MMLDA c to the case of multi-label problems, where each image can be associated with more than one annotation terms. We derive efficient learning algorithms for both models and demonstrate … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
39
0

Year Published

2012
2012
2018
2018

Publication Types

Select...
4
2
1

Relationship

0
7

Authors

Journals

citations
Cited by 29 publications
(39 citation statements)
references
References 18 publications
0
39
0
Order By: Relevance
“…We performed extensive quantitative comparisons of SupDoc-NADE with the original DocNADE model and supervised LDA (sLDA) 1 [10,9]. We also provide some comparisons with MMLDA [12] and a Spatial Pyramid Matching (SPM) approach [17]. The code to download the datasets and for SupDocNADE is available at https://sites.…”
Section: Experiments and Resultsmentioning
confidence: 99%
See 2 more Smart Citations
“…We performed extensive quantitative comparisons of SupDoc-NADE with the original DocNADE model and supervised LDA (sLDA) 1 [10,9]. We also provide some comparisons with MMLDA [12] and a Spatial Pyramid Matching (SPM) approach [17]. The code to download the datasets and for SupDocNADE is available at https://sites.…”
Section: Experiments and Resultsmentioning
confidence: 99%
“…We also compare with MMLDA [12], which has been applied to image classification and annotation separately. The reported classification accuracy for MMLDA is less than SupDocNADE as shown in Table 1.…”
Section: Quantitative Comparisonmentioning
confidence: 99%
See 1 more Smart Citation
“…Meanwhile the annotation accuracy of our M 3 DA-RF model tends to a relatively stable value with the growing number of topics, which is mainly because the word-level features (also known as the soft label probability P SVM ) play a dominant role in the multi-level maxmargin discriminative feature space and the topic-level features are just helpful supplements. The most related work to ours is detailed in [16], where the authors only exploit different types of feature representation and do not make full use of the contextual information that may be beneficial to annotation tasks. Some other related studies [1,5,9] have investigated the application of topic model in satellite images annotation task.…”
Section: Discussionmentioning
confidence: 99%
“…A great deal of research efforts have been devoted to bridge the semantic gap. Both low level visual features and high level semantics are explored in previous literatures [1][2][3][4][5][6][7][8][9][10].…”
Section: Introductionmentioning
confidence: 99%