Proceedings of the 29th Annual ACM Symposium on Applied Computing 2014
DOI: 10.1145/2554850.2555058
|View full text |Cite
|
Sign up to set email alerts
|

Representing local binary descriptors with BossaNova for visual recognition

Abstract: Binary descriptors have recently become very popular in visual recognition tasks. This popularity is largely due to their low complexity and for presenting similar performances when compared to non binary descriptors, like SIFT. In literature, many researchers have applied binary descriptors in conjunction with mid-level representations (e.g., Bag-ofWords). However, despite these works have demonstrated promising results, their main problems are due to use of a simple mid-level representation and the use of bi… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
23
0

Year Published

2015
2015
2022
2022

Publication Types

Select...
5
1

Relationship

3
3

Authors

Journals

citations
Cited by 14 publications
(23 citation statements)
references
References 25 publications
0
23
0
Order By: Relevance
“…In Figure 4, we illustrate the overview of our experimental approach. Note that this methodology is adapted from [4,17], in which the binary image descriptors are computed followed by the computation of a mid-level representation for each keyframe. Our proposed video descriptors are computed according to Equation 3 for representing each video that will be classified.…”
Section: Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…In Figure 4, we illustrate the overview of our experimental approach. Note that this methodology is adapted from [4,17], in which the binary image descriptors are computed followed by the computation of a mid-level representation for each keyframe. Our proposed video descriptors are computed according to Equation 3 for representing each video that will be classified.…”
Section: Methodsmentioning
confidence: 99%
“…FREAK binary descriptors are encoded by a BoW representation and a SVM classifier is applied to recognition. We also explored in [17] the benefit of using several binary descriptors and mid-level representation for videos to identify pornographic content with a SVM classifier and a majority voting scheme.…”
Section: Local Features and Bag-of-words Based Approachesmentioning
confidence: 99%
“…First, BossaNova Video Descriptor was adopted using SIFT to represent the video frames. We kept the BossaNova parameter values the same as in [4,5] (B = 2, λ min = 0.4, λ max = 2.0, s = 10 −3 , M = 1024) and we used two aggregating functions: max and median. For each video class, BNVDs generated are stored in a Slim-Tree [15] used for similarity searches.…”
Section: Methodsmentioning
confidence: 99%
“…To the best of our knowledge, this outperforms the best results published on this dataset for this type of features. For instance, Caetano et al [6] reports a performance of 36.2% mAP with a model [4] computed based on a vocabulary of 1024 codewords, spatial pyramids and non-linear SVMs.…”
Section: Pascal Voc 2007mentioning
confidence: 99%
“…More related to our work, Uchida and Sakazawa [42] derived a FV based on mixtures of Bernoulli pdfs which was shown to perform better than the BoBW in an object retrieval task. In [6], the authors propose a model that extends the BoVW by computing histograms of distances between the set of descriptors and each element in the codebook, learned using the k-medians algorithm and the Hamming distance.…”
Section: Introductionmentioning
confidence: 99%