2012
DOI: 10.1109/tpami.2011.227
|View full text |Cite
|
Sign up to set email alerts
|

Learning Hybrid Image Templates (HIT) by Information Projection

Abstract: Abstract-This paper presents a novel framework for learning a generative image representation -the hybrid image template (HIT) from a small number (i.e, 3 ∼ 20) of image examples. Each learned template is composed of, typically, 50 ∼ 500 image patches whose geometric attributes (location, scale, orientation) may adapt in a local neighborhood for deformation, and whose appearances are characterized respectively by four types of descriptors: local sketch (edge or bar), texture gradients with orientations, flatne… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
76
0
6

Year Published

2013
2013
2023
2023

Publication Types

Select...
6
3

Relationship

3
6

Authors

Journals

citations
Cited by 80 publications
(82 citation statements)
references
References 40 publications
0
76
0
6
Order By: Relevance
“…Experiment 8: Multi-class classification. Our second set of experiments is on the LHI-Animal-Faces dataset [51], which consists of around 2200 images for 20 categories of animal or human faces. We randomly select half of the images per class for training and the rest for testing.…”
Section: Using Learned Codebooks For Object Classificationmentioning
confidence: 99%
See 1 more Smart Citation
“…Experiment 8: Multi-class classification. Our second set of experiments is on the LHI-Animal-Faces dataset [51], which consists of around 2200 images for 20 categories of animal or human faces. We randomly select half of the images per class for training and the rest for testing.…”
Section: Using Learned Codebooks For Object Classificationmentioning
confidence: 99%
“…Our classification rate is 79.4%. For comparison, Table 3 lists four published results [51] on this dataset obtained by other methods: (a) HoG feature trained with SVM, (b) Hybrid Image Template (HIT) [51], (c) multiple transformation invariant HITs (Mixture of HIT) [51], and (d) part-based HoG feature trained with latent SVM [16]. Our method outperforms the other methods in terms of classification accuracy on this dataset.…”
Section: Using Learned Codebooks For Object Classificationmentioning
confidence: 99%
“…Second experiment had a different scenario, it was done on a single category set of real images of cat faces from the LHI-Animal-Faces dataset [17]. The subset of cat faces consisted of 89 images roughly on a same scale.A small selection of images from this set is shown in the Figure 5(a).The images were preprocessed before entering the learning algorithm in order to extract oriented edge segments.…”
Section: Learning Cat Facesmentioning
confidence: 99%
“…The AOT includes: 1) hierarchical composition as "AND" nodes, 2) deformation and articulation of parts as geometric "OR" nodes, and 3) multiple ways of composition as structural "OR" nodes. The terminal nodes are hybrid image templates (HIT) [3] that are entirely creative to the pixels. Author shows that both the structures and parameters of the AOT model can be learned in an unsupervised way from images using an information projection principle.…”
Section: Zhangzhang Si Et Al [2]mentioning
confidence: 99%