Object representation enhancement for self‐supervised colocalization

Li, Huifang; Li, Yidong; Jin, Yi; Wang, Tao

doi:10.1002/int.22938

Cited by 2 publications

(5 citation statements)

References 32 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…We report the performances of our method and the state-of-the-art methods in terms of GT-known Loc on five commonly used datasets in Table 1. We compare our method with recent self-supervised object localization methods including ORE [29], PsyNet [1], Ki et al [23], JGP [38] and C 2 AM [48], as well as object localization methods without finetuning including DDT [45], MO [54], LOST [37], and TokeneCut [42]. As shown in Table 1, our method significantly surpasses other methods on all benchmark datasets.…”

Section: Resultsmentioning

confidence: 93%

“…These features are extracted from regions of the image that contain rich information to distinguish them from other instances, and these regions are more likely to be associated with significant objects. Likewise, prior self-supervised colocalization studies [1,23,29,38] also use the magnitude of feature vectors as a clue to discover object regions.…”

Section: Representer Point Selection For Uolmentioning

confidence: 99%

“…Beyond learning with less supervision, most recent works [45,44,54,1,23,29,38,48,37,42] focus on the object localization task using self-supervised or unsupervised learning that does not require any human annotated labels. These works address the problem of identifying which regions are more likely to contain the foreground object, which is a salient object in an image.…”

Section: Introductionmentioning

confidence: 99%

“…These works address the problem of identifying which regions are more likely to contain the foreground object, which is a salient object in an image. In order to discover foreground object regions, several methods [1,23,29,38] attempt to use the magnitude of feature vectors as a clue for class-agnostic activation maps (CAAM) [1]. Most of these methods rely heavily on pre-trained models designed for the image classification task.…”

Section: Introductionmentioning

confidence: 99%

“…Class activation maps (CAM) [59] are frequently used to provide visually interpretable information about a specific class the model has learned, but a detailed explanation of how the model makes its predictions arXiv:2309.04172v1 [cs.CV] 8 Sep 2023 remains unclear. This limitation becomes even more severe for CAAM, which is commonly used in self-supervised object localization methods [1,23,29,38].…”

Section: Introductionmentioning

confidence: 99%

See 4 more Smart Citations

A Task of Convergent AI Ethics Education in School Curriculum - with emphasis on the major of AI Convergent Education in graduate schools of education and the class of ‘AI Ethics’ -

Song¹

2022

Journal of Ethics Education Studies

View full text Add to dashboard Cite

We propose a novel unsupervised object localization method that allows us to explain the predictions of the model by utilizing self-supervised pre-trained models without additional finetuning. Existing unsupervised and selfsupervised object localization methods often utilize classagnostic activation maps or self-similarity maps of a pretrained model. Although these maps can offer valuable information for localization, their limited ability to explain how the model makes predictions remains challenging. In this paper, we propose a simple yet effective unsupervised object localization method based on representer point selection, where the predictions of the model can be represented as a linear combination of representer values of training points. By selecting representer points, which are the most important examples for the model predictions, our model can provide insights into how the model predicts the foreground object by providing relevant examples as well as their importance. Our method outperforms the state-ofthe-art unsupervised and self-supervised object localization methods on various datasets with significant margins and even outperforms recent weakly supervised and few-shot methods. Our code is available at: https://github. com/yeonghwansong/UOLwRPS

show abstract