2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2021
DOI: 10.1109/cvpr46437.2021.00265
|View full text |Cite
|
Sign up to set email alerts
|

Non-Salient Region Object Mining for Weakly Supervised Semantic Segmentation

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
72
0

Year Published

2021
2021
2022
2022

Publication Types

Select...
5
3
1

Relationship

0
9

Authors

Journals

citations
Cited by 170 publications
(72 citation statements)
references
References 47 publications
0
72
0
Order By: Relevance
“…Specifically, Ours-L achieves 69.2% and 70.6% mIoU on the PASCAL VOC val set with DeepLabV2 initialized with ImageNet and MS COCO pre-trained weights, respectively, which recover 90.7% and 91.0% of the upper bound of their fully-supervised counterparts. Our methods also achieve comparable performance with recent state-of-the-art WSSS methods us-ing extra saliency maps, such as NSROM (Yao et al, 2021), DRS (Kim et al, 2021), EPS (Lee et al, 2021c), AuxSegNet (Xu et al, 2021), and EDAM (Wu et al, 2021). Our method also outperforms recent methods with superior backbone networks, such as PMM (Li et al, 2021b), which uses Res2Net101 (Gao et al, 2021) as the backbone for semantic segmentation.…”
Section: Imagementioning
confidence: 57%
See 1 more Smart Citation
“…Specifically, Ours-L achieves 69.2% and 70.6% mIoU on the PASCAL VOC val set with DeepLabV2 initialized with ImageNet and MS COCO pre-trained weights, respectively, which recover 90.7% and 91.0% of the upper bound of their fully-supervised counterparts. Our methods also achieve comparable performance with recent state-of-the-art WSSS methods us-ing extra saliency maps, such as NSROM (Yao et al, 2021), DRS (Kim et al, 2021), EPS (Lee et al, 2021c), AuxSegNet (Xu et al, 2021), and EDAM (Wu et al, 2021). Our method also outperforms recent methods with superior backbone networks, such as PMM (Li et al, 2021b), which uses Res2Net101 (Gao et al, 2021) as the backbone for semantic segmentation.…”
Section: Imagementioning
confidence: 57%
“…proposed OAA, which accumulated the activated regions during the different training stage. In (Yao et al, 2021), Yao et al . proposed a graph reasoning and non-salient region mining module to capture more object extents from non-salient regions, since the saliency prior used in OAA did not always correspond to the foreground objects.…”
Section: Wsss With Image-level Labelsmentioning
confidence: 99%
“…As the basic task of scene understanding, semantic segmentation technology based on pixel-by-pixel classification has been widely studied [22][23][24]. Many semantic segmentation methods based on deep learning have been proposed [25][26][27][28]. Currently, there are four main types of networks, namely, the fully convolutional network (FCN) [29], the convolutional neural network (CNN) [30], the recurrent neural network (RNN) [31], and the generative adversarial network (GAN) [32].…”
Section: Semantic Segmentation Based On Deep Learningmentioning
confidence: 99%
“…As an additional guidance for network to pay attention to the entire region of objects, some existing works attempt to devise auxiliary tasks such as sub-category classification [3], self-equivariant regularization with scale variance minimization [49], class-wise co-attention extraction [33,42], anti-adversarial attack [28], and complementary patch loss [60]. Many WSSS methods [15,16,21,30,33,42,55,56] have been proposed to employ the pre-trained saliency detection module, which distinguishes dominant foreground object from its background, as a complementary source of information for enhancing CAMs and generating precise pseudo-pixel labels.…”
Section: Related Workmentioning
confidence: 99%