Proceedings of the 6th ACM International Conference on Image and Video Retrieval 2007
DOI: 10.1145/1282280.1282352
|View full text |Cite
|
Sign up to set email alerts
|

Towards optimal bag-of-features for object categorization and semantic video retrieval

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

2
369
0
12

Year Published

2008
2008
2019
2019

Publication Types

Select...
3
3
2

Relationship

0
8

Authors

Journals

citations
Cited by 538 publications
(383 citation statements)
references
References 15 publications
2
369
0
12
Order By: Relevance
“…In recent study, there has been an intense focus on applying term weighting schemes (like tf, idf) to the 'bag-of-visual-words' feature vectors [4,6] . Extensive study in [15] suggests that when the vocabulary size of visual words is around 1000, the tf-idf weighting performs best. Recall that in our approach, the vocabulary size of visual words for the data collection is D=1000, to make the experimental results comparable, we chose to compare our spatial weighting approach with the tf-idf weighted 'bag-of-visual-words' approach [4] .…”
Section: Resultsmentioning
confidence: 99%
See 1 more Smart Citation
“…In recent study, there has been an intense focus on applying term weighting schemes (like tf, idf) to the 'bag-of-visual-words' feature vectors [4,6] . Extensive study in [15] suggests that when the vocabulary size of visual words is around 1000, the tf-idf weighting performs best. Recall that in our approach, the vocabulary size of visual words for the data collection is D=1000, to make the experimental results comparable, we chose to compare our spatial weighting approach with the tf-idf weighted 'bag-of-visual-words' approach [4] .…”
Section: Resultsmentioning
confidence: 99%
“…Many effective text mining and information retrieval algorithms like tf-idf weighting, stop word removal and feature selection have been applied to the vector-space model of visual-words. Problems such as how vocabulary size and term weighting schemes affect the performance of 'bag-of-visualwords' representation are also studied in recent research works [4,15] .…”
Section: Introductionmentioning
confidence: 99%
“…In particular, so-called "bag-of-features" models have become very popular, where the computation of local image descriptors is initiated by either scale invariant or affine invariant interest points (Sivic et al [145], Nowak et al [33], Jiang et al [59]); see also Mikolajczyk et al [126] for an experimental evaluation of image descriptors at interest points, Moreels and Perona [127] and Aanaes et al [1] for evaluations of feature detectors and image descriptors on three-dimensional datasets and Kaneva et al [65] for evaluations using photorealistic virtual worlds. For all of these recognition approaches, the invariance properties of the recognition system rely heavily on the invariance properties of the interest points at which local image descriptors are computed.…”
Section: Related Workmentioning
confidence: 99%
“…The image signature then can be seen as a histogram of occurred visual words. The works ( [5], [6]) reflected the fact that the quantization effect provides a very coarse approximation to the actual distance between two features -zero if assigned to the same visual word and infinite otherwise. Such approach is called hard assignment.…”
Section: Visual Words Assignmentmentioning
confidence: 99%
“…The work [5] compares several detectors of local image features and evaluate their performance on visual codebooks with different sizes. The explored detectors are time expensive and the codebook sizes are only up to 10k.…”
Section: Introductionmentioning
confidence: 99%