2017 Seventh International Conference on Image Processing Theory, Tools and Applications (IPTA) 2017
DOI: 10.1109/ipta.2017.8310121
|View full text |Cite
|
Sign up to set email alerts
|

Visual place recognition with CNNs: From global to partial

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
12
0

Year Published

2020
2020
2023
2023

Publication Types

Select...
4
2
1

Relationship

0
7

Authors

Journals

citations
Cited by 7 publications
(12 citation statements)
references
References 14 publications
0
12
0
Order By: Relevance
“…An initial set of reference image candidates is obtained based on nearest neighbor distances of imagewise global descriptors (Camara et al 2020;B. Liu et al 2021;Xin et al 2017), while local features are used for obtaining a more accurate estimation based on spatial matching (Camara et al 2020;Xin et al 2017) Given that feature maps can extract different types of features depending on the deepness of the respective layers, J. Zhu et al 2018 extracts features from three layers (conv3-3, conv4-4, conv5-3 ) of a VGG16 network and concatenates these to form a global descriptor for an image.…”
Section: Convolutional Neural Network (Cnn)mentioning
confidence: 99%
See 2 more Smart Citations
“…An initial set of reference image candidates is obtained based on nearest neighbor distances of imagewise global descriptors (Camara et al 2020;B. Liu et al 2021;Xin et al 2017), while local features are used for obtaining a more accurate estimation based on spatial matching (Camara et al 2020;Xin et al 2017) Given that feature maps can extract different types of features depending on the deepness of the respective layers, J. Zhu et al 2018 extracts features from three layers (conv3-3, conv4-4, conv5-3 ) of a VGG16 network and concatenates these to form a global descriptor for an image.…”
Section: Convolutional Neural Network (Cnn)mentioning
confidence: 99%
“…Furthermore, the random selection of descriptor components to reduce its dimensions is employed in Naseer, Oliveira, et al 2017 andXin et al 2017 due to not requiring a learning phase, unlike PCA. Naseer, Oliveira, et al 2017 uses Sparse Random Projection for embedding its FastNet-based descriptor with approximately 130000 dimensions into a reduced 4096-dim descriptor.…”
Section: Descriptor Dimension Reductionmentioning
confidence: 99%
See 1 more Smart Citation
“…The task of coarsely estimating the place where a photo was taken based on a set of previously visited locations is called Visual (Image) Geo-localization (VG) [37,42,86] or Visual Place Recognition (VPR) [21,44] and it is addressed using image matching and retrieval methods on a database of images of known locations. We are witnessing a rapid growth of this field of research, as demonstrated by the increasing number of publications [2,10,14,[22][23][24]28,30,36,37,42,44,46,57,60,72,73,[76][77][78][79]81,87], but this expansion is accompanied by two major limitations: i) A focus on single metric optimization, as it is common practice to compare results solely based on the recall on chosen datasets and ignoring other factors such as execution time, hardware requirements, and scalability. All these aspects are important constraints in the design of a real-world VG system.…”
Section: Introductionmentioning
confidence: 99%
“…It can generally be extended to broader areas, including topological mapping, loop closure and drift removal in geometric mapping and learning scene dynamics for long-term localization and mapping. Long-term operations in environments can cause significant image variations including illumination changes, occlusion and scene dynamics [2].…”
Section: Introductionmentioning
confidence: 99%