2010
DOI: 10.1007/978-3-642-15561-1_11
|View full text |Cite
|
Sign up to set email alerts
|

Improving the Fisher Kernel for Large-Scale Image Classification

Abstract: Abstract. The Fisher kernel (FK) is a generic framework which combines the benefits of generative and discriminative approaches. In the context of image classification the FK was shown to extend the popular bag-of-visual-words (BOV) by going beyond count statistics. However, in practice, this enriched representation has not yet shown its superiority over the BOV. In the first part we show that with several well-motivated modifications over the original framework we can boost the accuracy of the FK. On PASCAL V… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

15
1,935
0
4

Year Published

2012
2012
2019
2019

Publication Types

Select...
4
3
1

Relationship

2
6

Authors

Journals

citations
Cited by 1,974 publications
(1,954 citation statements)
references
References 18 publications
15
1,935
0
4
Order By: Relevance
“…The image representation we used was the improved Fisher Vector (FV) but without spatial pyramids (cf. [12]). Before going into details regarding the experimental procedure, we give a brief overview of this state-of-the-art image signature.…”
Section: Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…The image representation we used was the improved Fisher Vector (FV) but without spatial pyramids (cf. [12]). Before going into details regarding the experimental procedure, we give a brief overview of this state-of-the-art image signature.…”
Section: Methodsmentioning
confidence: 99%
“…Before going into details regarding the experimental procedure, we give a brief overview of this state-of-the-art image signature. Details can be found in [11,12].…”
Section: Methodsmentioning
confidence: 99%
“…Then LOAD feature is calculated on round patches densely extracted from the eigth images (the original plus the seven in different scales). The Improved Fisher Vector (IFV) by Perronnin et al (2010) is used to encode the features to preserve its discriminative power. Then, classification is performed through a linear SVM.…”
Section: Accepted Manuscriptmentioning
confidence: 99%
“…These patch statistics are then aggregated at an image level. We choose to compute the patch statistics using the Fisher Vector (FV) principle [19], since it obtained state-of-the-art results in image retrieval [5] and classification [2]. We assume that we have a generative model of patches (a Gaussian Mixture Model in our case) and measure the gradient of the log-likelihood of the descriptor with respect to the model parameters.…”
Section: Image Embeddingmentioning
confidence: 99%
“…To include spatial information about the word image into the signature, we can partition the image into regions, aggregate the per-patch statistics at a region level and then concatenate the region-level signatures as proposed for instance in [8]. See [19] for more details about the FV.…”
Section: Image Embeddingmentioning
confidence: 99%