2020
DOI: 10.48550/arxiv.2006.05220
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Rethinking Localization Map: Towards Accurate Object Perception with Self-Enhancement Maps

Abstract: Recently, remarkable progress has been made in weakly supervised object localization (WSOL) to promote object localization maps. The common practice of evaluating these maps applies an indirect and coarse way, i.e., obtaining tight bounding boxes which can cover high-activation regions and calculating intersection-over-union (IoU) scores between the predicted and ground-truth boxes. This measurement can evaluate the ability of localization maps to some extent, but we argue that the maps should be measured dire… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
21
0

Year Published

2020
2020
2023
2023

Publication Types

Select...
4
1
1

Relationship

2
4

Authors

Journals

citations
Cited by 10 publications
(21 citation statements)
references
References 40 publications
0
21
0
Order By: Relevance
“…DANet [43] applied a divergent activation to learn complementary visual cues for WSOL. SEM [50] refined the localization maps by using the point-wise similarity within seed regions. GC-Net [19] took geometric shapes into account and proposed a multi-task loss function for WSOL.…”
Section: Related Workmentioning
confidence: 99%
See 2 more Smart Citations
“…DANet [43] applied a divergent activation to learn complementary visual cues for WSOL. SEM [50] refined the localization maps by using the point-wise similarity within seed regions. GC-Net [19] took geometric shapes into account and proposed a multi-task loss function for WSOL.…”
Section: Related Workmentioning
confidence: 99%
“…To alleviate such a limitation, one solution is to utilize pixel similarity and global cues to refine activation maps [38,39,49,50]. Cao et al [4] found that the global contexts modeled by non-local networks are almost the same for query positions and thereby proposed NLNet [38] with SENet [12] for global context modeling.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…For a similar reason, [46] train two classifiers, where one covers the feature maps of the other. [49] improves the representation of the classifier by employing quantization. In contrast, our method employs a standard pre-trained classifier and obtains state-of-the-art results out-of-the-box.…”
Section: Discussionmentioning
confidence: 99%
“…The divergent activation approach (DA) [42] aggregates and shares information from different spatial layers of the backbone. A similarity score that contrasts high-activation regions with other regions was proposed by [49].…”
Section: Related Workmentioning
confidence: 99%