Backpropagation Training for Fisher Vectors within Neural Networks

Wieschollek, Patrick; Groh, Fabian; Lensch, Hendrik P. A.

doi:10.48550/arxiv.1702.02549

Cited by 1 publication

(3 citation statements)

References 15 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Further details for the derivation of these gradients can be found in the supplementary material. Note that similar formulas are also provided by [36].…”

Section: New Fisher Vector Encoding Layer (Fve-layer)mentioning

confidence: 92%

“…In contrast to the aforementioned methods that compute an FVE of local features extracted separately from the image, Wieshollek et al [36] and Tang et al [31] deploy the FVE directly in a neural network. As a result, the features are learned jointly with the parameters for both the classification and the mixture model.…”

Section: Variants Of Deep Fisher Vector Encoding (Deep Fve)mentioning

confidence: 99%

“…In contrast to previous approaches that apply some kind of FVE together with CNNs [5,28,31,36], our method determines the underlying Gaussian mixture model (GMM) with an iterative EM algorithm using mini-batch updates of parameters. With our approach, the parameters of the GMM are estimated jointly with the parameters of the CNN in an end-to-end manner.…”

Section: Introductionmentioning

confidence: 99%

See 2 more Smart Citations

End-to-end Learning of a Fisher Vector Encoding for Part Features in Fine-grained Recognition

Korsch,

Bodesheim,

Denzler

2020

Preprint

View full text Add to dashboard Cite

Part-based approaches for fine-grained recognition do not show the expected performance gain over global methods, although being able to explicitly focus on small details that are relevant for distinguishing highly similar classes. We assume that part-based methods suffer from a missing representation of local features, which is invariant to the order of parts and can handle a varying number of visible parts appropriately. The order of parts is artificial and often only given by ground-truth annotations, whereas viewpoint variations and occlusions result in parts that are not observable. Therefore, we propose integrating a Fisher vector encoding of part features into convolutional neural networks. The parameters for this encoding are estimated jointly with those of the neural network in an end-to-end manner. Our approach improves state-of-the-art accuracies for bird species classification on CUB-200-2011 from 90.40% to 90.95%, on NA-Birds from 89.20% to 90.30%, and on Birdsnap from 84.30% to 86.97%.

show abstract

“…Further details for the derivation of these gradients can be found in the supplementary material. Note that similar formulas are also provided by [36].…”

Section: New Fisher Vector Encoding Layer (Fve-layer)mentioning

confidence: 92%

Section: Variants Of Deep Fisher Vector Encoding (Deep Fve)mentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

End-to-end Learning of a Fisher Vector Encoding for Part Features in Fine-grained Recognition

Korsch,

Bodesheim,

Denzler

2020

Preprint

View full text Add to dashboard Cite

show abstract

Backpropagation Training for Fisher Vectors within Neural Networks

Cited by 1 publication

References 15 publications

End-to-end Learning of a Fisher Vector Encoding for Part Features in Fine-grained Recognition

End-to-end Learning of a Fisher Vector Encoding for Part Features in Fine-grained Recognition

Contact Info

Product

Resources

About