Do Saliency Models Detect Odd-One-Out Targets? New Datasets and Evaluations

Kotseruba, Iuliia; Wloka, Calden; Rasouli, Amir; Tsotsos, John K.

doi:10.48550/arxiv.2005.06583

Cited by 3 publications

(40 citation statements)

References 53 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…We use 4 datasets namely OSIE [30], MIT1003 [31], P 3 , and O 3 datasets [22] to validate our results. The OSIE dataset contains information at three levels: pixel-level image attributes, object-level attributes, and semantic-level attributes.…”

Section: Experiments and Resultsmentioning

confidence: 89%

“…A second drawback of the DNN-based models is that in addition to not take into account low-level features surprise level, DNN-based models are not generic enough to adapt to new images which are different enough from the training dataset. Indeed, recently, [22] introduced two novel datasets, one based on psycho-physical patterns (P 3 ) and one based on natural odd-one-out (O 3 ) stimuli. They showed that while DNN-based models are good in MIT dataset on natural images, their results drastically drop on P 3 and O 3 .…”

Section: Visual Attention: Deep Learning Troublementioning

confidence: 99%

“…Based on the new datasets in [22], DeepRare2019 [25] provides a new deep-feature saliency model by mixing deep features and the philosophy of an existing classical model [7]. This model is efficient on all the datasets, with no need for any training and efficient in terms of computation even on CPU.…”

Section: Visual Attention: Deep Learning Troublementioning

confidence: 99%

“…In a section 2 DR21 is described and the threshold on feature rarity is used to show how the DNN features rarity can become explainable. In section 3 this model is tested on the datasets proposed in [22] but also on an additional dataset. We finally discuss and conclude on the pertinence of the come back of the feature engineering models.…”

Section: Visual Attention: Deep Learning Troublementioning

confidence: 99%

“…We compare our model to other models on P 3 and O 3 datasets. According to [22], they observe that most classical models perform better on P 3 than DNN-based models. In contrast, DNN-based models perform better on O 3 .…”

Section: Qualitative Validation On the Different Datasetsmentioning

confidence: 99%

See 4 more Smart Citations

DeepRare: Generic Unsupervised Visual Attention Models

Kong¹,

Mancaş²,

Gosselin³

et al. 2021

Preprint

View full text Add to dashboard Cite

show abstract

Section: Experiments and Resultsmentioning

confidence: 89%

Section: Visual Attention: Deep Learning Troublementioning

confidence: 99%

Section: Visual Attention: Deep Learning Troublementioning

confidence: 99%

Section: Visual Attention: Deep Learning Troublementioning

confidence: 99%

Section: Qualitative Validation On the Different Datasetsmentioning

confidence: 99%

See 3 more Smart Citations

DeepRare: Generic Unsupervised Visual Attention Models

Kong¹,

Mancaş²,

Gosselin³

et al. 2021

Preprint

View full text Add to dashboard Cite

show abstract

Visual Attention: Deep Rare Features

Mancaş

Kong

Gosselin

2020

2020 Joint 9th International Conference on Informatics, Electronics &Amp; Vision (ICIEV) and 2020 4th International Conference

View full text Add to dashboard Cite

Human visual system is modeled in engineering field providing feature-engineered methods which detect contrasted/surprising/unusual data into images. This data is "interesting" for humans and leads to numerous applications. Deep learning (DNNs) drastically improved the algorithms efficiency on the main benchmark datasets. However, DNN-based models are counter-intuitive: surprising or unusual data is by definition difficult to learn because of its low occurrence probability. In reality, DNNs models mainly learn top-down features such as faces, text, people, or animals which usually attract human attention, but they have low efficiency in extracting surprising or unusual data in the images.In this paper, we propose a model called DeepRare2019 (DR) which uses the power of DNNs feature extraction and the genericity of feature-engineered algorithms. DR 1) does not need any training, 2) it takes less than a second per image on CPU only and 3) our tests on three very different eye-tracking datasets show that DR is generic and is always in the top-3 models on all datasets and metrics while no other model exhibits such a regularity and genericity. DeepRare2019 code can be found at https://github.com/numediart/VisualAttention-RareFamily.

show abstract

DeepRare: Generic Unsupervised Visual Attention Models

et al. 2022

View full text Add to dashboard Cite

Visual attention selects data considered as “interesting” by humans, and it is modeled in the field of engineering by feature-engineered methods finding contrasted/surprising/unusual image data. Deep learning drastically improved the models efficiency on the main benchmark datasets. However, Deep Neural Networks-based (DNN-based) models are counterintuitive: surprising or unusual data are by definition difficult to learn because of their low occurrence probability. In reality, DNN-based models mainly learn top-down features such as faces, text, people, or animals which usually attract human attention, but they have low efficiency in extracting surprising or unusual data in the images. In this article, we propose a new family of visual attention models called DeepRare and especially DeepRare2021 (DR21), which uses the power of DNNs’ feature extraction and the genericity of feature-engineered algorithms. This algorithm is an evolution of a previous version called DeepRare2019 (DR19) based on this common framework. DR21 (1) does not need any additional training other than the default ImageNet training, (2) is fast even on CPU, (3) is tested on four very different eye-tracking datasets showing that DR21 is generic and is always within the top models on all datasets and metrics while no other model exhibits such a regularity and genericity. Finally, DR21 (4) is tested with several network architectures such as VGG16 (V16), VGG19 (V19), and MobileNetV2 (MN2), and (5) it provides explanation and transparency on which parts of the image are the most surprising at different levels despite the use of a DNN-based feature extractor.

show abstract

Do Saliency Models Detect Odd-One-Out Targets? New Datasets and Evaluations

Cited by 3 publications

References 53 publications

DeepRare: Generic Unsupervised Visual Attention Models

DeepRare: Generic Unsupervised Visual Attention Models

Visual Attention: Deep Rare Features

DeepRare: Generic Unsupervised Visual Attention Models

Contact Info

Product

Resources

About