2017 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) 2017
DOI: 10.1109/iros.2017.8202131
|View full text |Cite
|
Sign up to set email alerts
|

Only look once, mining distinctive landmarks from ConvNet for visual place recognition

Abstract: -Recently, image representations derived from Convolutional Neural Networks (CNNs) have been demonstrated to achieve impressive performance on a wide variety of tasks, including place recognition. In this paper, we take a step deeper into the internal structure of CNNs and propose novel CNN-based image features for place recognition by identifying salient regions and creating their regional representations directly from the convolutional layer activations. A range of experiments is conducted on challenging dat… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
213
0

Year Published

2018
2018
2023
2023

Publication Types

Select...
4
2
2

Relationship

1
7

Authors

Journals

citations
Cited by 147 publications
(213 citation statements)
references
References 24 publications
0
213
0
Order By: Relevance
“…Similarly, in [26] the attention mechanism was explored to identify salient landmarks for mobile robot localization. Approaches in [8,9] identify salient regions using an external landmark detector. However, their landmark detectors are based on networks that are trained on tasks different in nature from place recognition.…”
Section: B Attention Model For Place Recognitionmentioning
confidence: 99%
See 2 more Smart Citations
“…Similarly, in [26] the attention mechanism was explored to identify salient landmarks for mobile robot localization. Approaches in [8,9] identify salient regions using an external landmark detector. However, their landmark detectors are based on networks that are trained on tasks different in nature from place recognition.…”
Section: B Attention Model For Place Recognitionmentioning
confidence: 99%
“…Similar to [9], we treat activations at a certain convolutional layer as a tensor of size H × W × C, which is considered as a set of C-dimensional local features in H × W spatial location. Formally, the convolutional layer activations X ∈ × × can be expressed as:…”
Section: A Local Feature Extraction From Convolutional Activationsmentioning
confidence: 99%
See 1 more Smart Citation
“…[22] proposes a global image representation by the regional maximum activation of convolutional layers (R-MAC) well-suited for place recognition. [2] proposes novel CNN-based features designed for place recognition by detecting salient regions and extracting regional representations as descriptors. NetVLAD [1] introduces a novel triplet ranking loss together with a VLAD aggregation layer that can learn powerful representations for the VPR task in an end-to end manner.…”
Section: Related Workmentioning
confidence: 99%
“…Their results on some very challenging datasets were remarkable, significantly outperforming state-of-the-art works based on pre-trained CNNs. Recently, following the success of region-based approaches, Chen et al [23] created a system that used a late convolutional layer as a landmark detector and an earlier one to create local descriptors to match the detected landmarks. Their system showed improved recognition under strong viewpoint and condition variations.…”
Section: Related Workmentioning
confidence: 99%