Proceedings of the 28th ACM International Conference on Multimedia 2020
DOI: 10.1145/3394171.3413969
|View full text |Cite
|
Sign up to set email alerts
|

Feature Reintegration over Differential Treatment: A Top-down and Adaptive Fusion Network for RGB-D Salient Object Detection

Abstract: Most methods for RGB-D salient object detection (SOD) utilize the same fusion strategy to explore the cross-modal complementary information at each level. However, this may ignore di erent feature contributions from two modalities on di erent levels towards prediction. In this paper, we propose a novel top-down multi-level fusion structure where di erent fusion strategies are utilized to e ectively explore the low-level and high-level features. This is achieved by designing the interweave fusion module (IFM) t… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
15
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
4
3
2

Relationship

1
8

Authors

Journals

citations
Cited by 43 publications
(15 citation statements)
references
References 39 publications
0
15
0
Order By: Relevance
“…We compare our CDINet with 15 state-of-the-art CNN-based RGB-D SOD methods, including MMCI [3], TAN [2], CPFP [44], DMRA [32], FRDT [41], SSF [40], S2MA [28], A2dele [33], JL-DCF [18], DANet [45], PGAR [4], cmMS [26], BiANet [43], D3Net [15], and ASIFNet [25].…”
Section: Comparisons With Sota Methodsmentioning
confidence: 99%
“…We compare our CDINet with 15 state-of-the-art CNN-based RGB-D SOD methods, including MMCI [3], TAN [2], CPFP [44], DMRA [32], FRDT [41], SSF [40], S2MA [28], A2dele [33], JL-DCF [18], DANet [45], PGAR [4], cmMS [26], BiANet [43], D3Net [15], and ASIFNet [25].…”
Section: Comparisons With Sota Methodsmentioning
confidence: 99%
“…We compare our CDINet with 15 state-of-the-art CNN-based RGB-D SOD methods, including MMCI [3], TAN [2], CPFP [44], DMRA [32], FRDT [41], SSF [40], S2MA [28], A2dele [33], JL-DCF [18], DANet [45], PGAR [4], cmMS [26], BiANet [43], D3Net [15], and ASIFNet [25]. For fair comparisons, we test these methods with the released codes under the default settings to obtain the saliency maps.…”
Section: Comparisons With Sota Methodsmentioning
confidence: 99%
“…We compare our CDINet with 15 state-of-the-art CNN-based RGB-D SOD methods, including MMCI [3], TAN [2], CPFP [44], DMRA [32], FRDT [41], SSF [40], S2MA [28], A2dele [33], JL-DCF [18], DANet [45], PGAR [4], cmMS [26], BiANet [43], D3Net [15] and ASIFNet [25] on five benchmark datasets (i.e., NLPR [31], NJUD [22], DUT [32], STEREO [29], and LFSD [27]). In this supplementary materials, we provide the more visual comparisons and P-R curves in Figure 5 and Figure 6, respectively.…”
Section: B Experimentsmentioning
confidence: 99%
“…Multi-scale modeling. Modeling multi-scale features is an important procedure in deep learning-based computer vision [23,3,25,60,22,64,36,15,50] because it is able to enlarge the receptive field and increase resolution. In 3D representation, modeling multi-scale features is also popular and important.…”
Section: Related Workmentioning
confidence: 99%