SegSort: Segmentation by Discriminative Sorting of Segments

Hwang, Juen-Haur; Yu, Stella X.; Shi, Jianbo; Collins, Maxwell D.; Yang, Tien-Ju; Zhang, Xiao; Chen, Liang-Chieh

doi:10.1109/iccv.2019.00743

Cited by 114 publications

(87 citation statements)

References 70 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Although a few prior methods address the idea of metric learning in semantic segmentation, they only account for the local content from objects [29] or instances [16,1,22,42]. It is worth noting [37] also explores cross-image information of training data, i.e., leverage perceptual pixel groups for nonparametric pixel classification. Due to its clustering based metric learning strategy, [37] needs to retrieve extra labeled data for inference.…”

Section: Related Workmentioning

confidence: 99%

“…It is worth noting [37] also explores cross-image information of training data, i.e., leverage perceptual pixel groups for nonparametric pixel classification. Due to its clustering based metric learning strategy, [37] needs to retrieve extra labeled data for inference. Differently, our core idea, i.e., exploit inter-image pixel-to-pixel similarity to enforce global constraints on the embedding space, is conceptually novel and rarely explored before.…”

Section: Related Workmentioning

confidence: 99%

“…Basically, these segmentation models (excluding [37]) utilize deep architectures to project image pixels into a highly non-linear embedding space (Fig. 1(c)).…”

Section: Introductionmentioning

confidence: 99%

See 2 more Smart Citations

Exploring Cross-Image Pixel Contrast for Semantic Segmentation

Yang

Zhou

et al. 2021

2021 IEEE/CVF International Conference on Computer Vision (ICCV)

438

135

View full text Add to dashboard Cite

Current semantic segmentation methods focus only on mining "local" context, i.e., dependencies between pixels within individual images, by context-aggregation modules (e.g., dilated convolution, neural attention) or structureaware optimization criteria (e.g., IoU-like loss). However, they ignore "global" context of the training data, i.e., rich semantic relations between pixels across different images. Inspired by recent advance in unsupervised contrastive representation learning, we propose a pixel-wise contrastive algorithm for semantic segmentation in the fully supervised setting. The core idea is to enforce pixel embeddings belonging to a same semantic class to be more similar than embeddings from different classes. It raises a pixel-wise metric learning paradigm for semantic segmentation, by explicitly exploring the structures of labeled pixels, which were rarely explored before. Our method can be effortlessly incorporated into existing segmentation frameworks without extra overhead during testing. We experimentally show that, with famous segmentation models (i.e., DeepLabV3, HRNet, OCR) and backbones (i.e., ResNet, HRNet), our method brings performance improvements across diverse datasets (i.e., Cityscapes, PASCAL-Context, COCO-Stuff, CamVid). We expect this work will encourage our community to rethink the current de facto training paradigm in semantic segmentation.

show abstract

Section: Related Workmentioning

confidence: 99%

Section: Related Workmentioning

confidence: 99%

See 1 more Smart Citation

Exploring Cross-Image Pixel Contrast for Semantic Segmentation

Yang

Zhou

et al. 2021

2021 IEEE/CVF International Conference on Computer Vision (ICCV)

438

135

View full text Add to dashboard Cite

show abstract

“…SSL [4,41] concentrates on efficiently utilizing scarce labeled and enormous unlabeled data of the same label space. USL [13,31] rid of heavy annotation cost. WSSS [40] [36] further improve the pipeline by dual ranking statistics and mutual knowledge distillation.…”

Section: Related Workmentioning

confidence: 99%

“…Such data can be easily collected but difficult to annotate. USL [13,31] is also introduced to mitigate the annotation cost. Due to the complexity of unlabeled data, USL cannot achieve satisfactory results without any prior knowledge.…”

Section: Introductionmentioning

confidence: 99%

Novel Class Discovery in Semantic Segmentation

Zhao¹,

Zhang²,

Sebe³

et al. 2021

Preprint

View full text Add to dashboard Cite

We introduce a new setting of Novel Class Discovery in Semantic Segmentation (NCDSS), which aims at segmenting unlabeled images containing new classes given prior knowledge from a labeled set of disjoint classes. In contrast to existing approaches that look at novel class discovery in image classification, we focus on the more challenging semantic segmentation. In NCDSS, we need to distinguish the objects and background, and to handle the existence of multiple classes within an image, which increases the difficulty in using the unlabeled data. To tackle this new setting, we leverage the labeled base data and a saliency model to coarsely cluster novel classes for model training in our basic framework. Additionally, we propose the Entropy-based Uncertainty Modeling and Self-training (EUMS) framework to overcome noisy pseudo-labels, further improving the model performance on the novel classes. Our EUMS utilizes an entropy ranking technique and a dynamic reassignment to distill clean labels, thereby making full use of the noisy data via self-supervised learning. We build the NCDSS benchmark on the PASCAL-5 i dataset.Extensive experiments demonstrate the feasibility of the basic framework (achieving an average mIoU of 49.81%) and the effectiveness of EUMS framework (outperforming the basic framework by 9.28% mIoU).

show abstract