2020
DOI: 10.48550/arxiv.2007.03262
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

RGBT Salient Object Detection: A Large-scale Dataset and Benchmark

Abstract: Salient object detection in complex scenes and environments is a challenging research topic. Most works focus on RGB-based salient object detection, which limits its performance of real-life applications when confronted with adverse conditions such as dark environments and complex backgrounds. Taking advantage of RGB and thermal infrared images becomes a new research direction for detecting salient object in complex scenes recently, as thermal infrared spectrum imaging provides the complementary information an… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
16
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
4
2
1

Relationship

0
7

Authors

Journals

citations
Cited by 7 publications
(16 citation statements)
references
References 44 publications
0
16
0
Order By: Relevance
“…VT1000 [58] contains 1,000 RGB-T image pairs captured with highly aligned RGB and thermal cameras. VT5000 [59] contains 5,000 pairs of high-resolution, high-diversity and low-deviation RGB-T images. For the sake of fair comparison, we use the same training dataset as in [62], [82], [11], which consists of 2,500 image pairs in VT5000.…”
Section: Methodsmentioning
confidence: 99%
See 2 more Smart Citations
“…VT1000 [58] contains 1,000 RGB-T image pairs captured with highly aligned RGB and thermal cameras. VT5000 [59] contains 5,000 pairs of high-resolution, high-diversity and low-deviation RGB-T images. For the sake of fair comparison, we use the same training dataset as in [62], [82], [11], which consists of 2,500 image pairs in VT5000.…”
Section: Methodsmentioning
confidence: 99%
“…In earlier years, RGB-T SOD adopts machine learning methods, for example, SVM [54], ranking models [55], [56], [57] and graph learning [58]. With the development of CNN, Tu et al [59] propose a baseline model which combines CNN with attention mechanism. Zhang et al [60], [61] propose two end-to-end CNN based RGB-T SOD models to achieve multi-scale, multimodality and multi-level fusion.…”
Section: B Rgb-t Salient Object Detectionmentioning
confidence: 99%
See 1 more Smart Citation
“…Most SOD methods [37,9,38,39,11,12] mainly focus on solving the integration of multi-level features to generate a salient map with accurate location and internal consistency. Other works are devoted to studying how to use multi-modal information, such as depth cues [40,41,42] or thermal infrared cues [43,44], as auxiliary inputs of the models to alleviate the defects of individual RGB sources, such as low-light environments and similar texture scenes. It is noted that, some methods [13,17,10,33] also introduce the edge detection as a joint or auxiliary task for SOD, which make the models pay more attention to the object structure and thus improve the localization of salient objects.…”
Section: Fully Supervised Salient Object Detectionmentioning
confidence: 99%
“…Therefore, thermal images can provide supplementary information to improve SOD performance when salient objects suffer from varying light, reflective light, or shadows. Some RGB-T models [185]- [193] and datasets (VT821 [187], VT1000 [191] and VT5000 [193]) have already been proposed over the past few years. Similar to RGB-D SOD, the key aim of RGB-T SOD is to fuse RGB and thermal infrared images and exploit the correlations between the two modalities.…”
Section: F Extension To Rgb-t Sodmentioning
confidence: 99%