Proceedings of the 29th ACM International Conference on Multimedia 2021
DOI: 10.1145/3474085.3475494
|View full text |Cite
|
Sign up to set email alerts
|

Complementary Trilateral Decoder for Fast and Accurate Salient Object Detection

Abstract: Salient object detection (SOD) has made great progress, but most of existing SOD methods focus more on performance than efficiency. Besides, the U-shape structure exists some drawbacks and there is still a lot of room for improvement. Therefore, we propose a novel framework to treat semantic context, spatial detail and boundary information separately in the decoder part. Specifically, we propose an efficient and effective Complementary Trilateral Decoder (CTD) for saliency detection with three branches: Semant… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
40
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
5
2
2

Relationship

2
7

Authors

Journals

citations
Cited by 80 publications
(40 citation statements)
references
References 42 publications
0
40
0
Order By: Relevance
“…Large-scale 2D ISOD datasets such as DUTS [25] and XPIE [24] have extensively promoted the development of CNN-based ISOD methods [43]- [51]. However, 360 • panoramic SOD models are minimal due to insufficient objectlevel pixel-wise annotation [7], [8], [52].…”
Section: B 360 • Panoramic Modelsmentioning
confidence: 99%
“…Large-scale 2D ISOD datasets such as DUTS [25] and XPIE [24] have extensively promoted the development of CNN-based ISOD methods [43]- [51]. However, 360 • panoramic SOD models are minimal due to insufficient objectlevel pixel-wise annotation [7], [8], [52].…”
Section: B 360 • Panoramic Modelsmentioning
confidence: 99%
“…The currently available deep learning-based methods for BAS [11] tend to adopt the U-shape structure proposed by UNet [34]. However, the U-shape structure suffers from two problems [20]: 1) Dilution of semantic information in the process of layer feature fusion; and 2) Weak capacity for capturing global contextual information. These problems may lead to inaccurate locations of salient objects and incomplete segmentation results.…”
Section: B Positioning Modulementioning
confidence: 99%
“…These networks first acquire a coarse saliency map, then refine it by gradually combining spatialdetail-rich features at the lower layers, and finally output the final saliency map at the top layer [12]. Zhou et al [20] created CTDNet comprising three branches with three fusion modules from coarse to fine to improve the localization of a region and the quality of its boundary. They proposed a boundary refinement module (BRM) to utilize the salient boundary information provided by the boundary path to further refine the boundary.…”
mentioning
confidence: 99%
“…We compare our proposed PGNet with 11 SOTA methods, including CPD [34], SCRN [35], DASNet [43], F3Net [32], GCPA [4], ITSD [46], LDF [33], CTD [45], PFS [23], HRSOD [40], DHQSOD [30], where HRSOD and DHQ-SOD are designed for high-resolution salient object detection. All of the above methods use Resnet-50 [10] as the backbone except for HRSOD which uses VGG16 [28].…”
Section: Comparison With the State-of-the-artsmentioning
confidence: 99%