Proceedings of the 12th International Conference on Distributed Smart Cameras 2018
DOI: 10.1145/3243394.3243686
|View full text |Cite
|
Sign up to set email alerts
|

An accurate retrieval through R-MAC+ descriptors for landmark recognition

Abstract: The landmark recognition problem is far from being solved, but with the use of features extracted from intermediate layers of Convolutional Neural Networks (CNNs), excellent results have been obtained. In this work, we propose some improvements on the creation of R-MAC descriptors in order to make the newly-proposed R-MAC+ descriptors more representative than the previous ones. However, the main contribution of this paper is a novel retrieval technique, that exploits the fine representativeness of the MAC desc… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

0
17
0

Year Published

2018
2018
2023
2023

Publication Types

Select...
4
2
2

Relationship

4
4

Authors

Journals

citations
Cited by 16 publications
(17 citation statements)
references
References 33 publications
0
17
0
Order By: Relevance
“…This process is known as "transfer learning" and consists in tuning the parameters trained in one feature space in order to work in another feature space [21]. Some methods that use transfer learning are: Spatial pooling [25], MOP-CNN [7], Neural codes [3], Ng et al [34], CCS [33], OC [26], R-MAC [31], Gordo et al [8] and Magliani et al [18]. Also, fine-tuning global descriptors [9] on a similar image dataset, allows to highly improve accuracy results, but with an extra time effort due to the training phase on the new dataset.…”
Section: Related Workmentioning
confidence: 99%
“…This process is known as "transfer learning" and consists in tuning the parameters trained in one feature space in order to work in another feature space [21]. Some methods that use transfer learning are: Spatial pooling [25], MOP-CNN [7], Neural codes [3], Ng et al [34], CCS [33], OC [26], R-MAC [31], Gordo et al [8] and Magliani et al [18]. Also, fine-tuning global descriptors [9] on a similar image dataset, allows to highly improve accuracy results, but with an extra time effort due to the training phase on the new dataset.…”
Section: Related Workmentioning
confidence: 99%
“…[53] proposes R-mac, also a pooling strategy, that was later trained end-to-end in [17] using the Siamese architecture. [28] improves upon R-mac [53] specifically for landmark recognition. In the same spirit, [46] uses R-mac in order to train CNN feature predictors for image retrieval.…”
Section: Related Workmentioning
confidence: 99%
“…Traditionally, vision-based localization [59] is tackled with structure-based methods, such as Structurefrom-Motion (SfM) [20,24,44,42,62,9] and Simultaneous Localization and Mapping (SLAM) [33,11,7,5,16], or with retrieval-based approaches [23,49,4,3,15,28,43,19]. Structure based methods usually focus on accurate relative pose estimation, while retrieval-based approaches prioritize absolute re-localization.…”
Section: Introductionmentioning
confidence: 99%
“…Additionally, several papers have considered the application of embedding strategies on the feature maps extracted from the CNN (the most frequently used being, at present, VGG16 and ResNet101) in order to create a global descriptor. In fact, recently, several strategies have been proposed that are aimed at making the descriptors less sensitive to scale changes, rotation, occlusions, and so on [ 3 , 4 , 5 , 6 ].…”
Section: Introductionmentioning
confidence: 99%