Semi-Supervised Semantic Segmentation Using Unreliable Pseudo-Labels

Wang, Yuchao; Wang, Haochen; Shen, Yujun; Fei, Jingjing; Li, Wei; Jin, Guoqiang; Wu, Liwei; Zhao, Rui; Le, Xinyi

doi:10.1109/cvpr52688.2022.00421

Cited by 264 publications

(112 citation statements)

References 116 publications

Supporting

Mentioning

111

Contrasting

Order By: Relevance

“…This idea is extended later to semi-supervised semantic segmentation, which trains the student model with high-confident hard pseudo-labels predicted by the teacher. On this basis, extensive attempts improve semi-supervised semantic segmentation by CutMix augmentation [18], class-balanced training [80,30,23] and contrastive learning [80,1,40,64]. A closely relevant topic to self-training in SSL is consistency regularization, which believes that enforcing semantic or distribution consistency between various perturbations, such as image augmentation [32] and network perturbation [72], can improve the robustness and generalization of the model.…”

Section: Related Workmentioning

confidence: 99%

“…Self-training provides a unified solution and achieves state-of-the-art performance on both settings [29,64]. One of the most common and widely used forms of self-training in semantic segmentation is a variant of mean teacher, which is shown in Fig.…”

Section: Introductionmentioning

confidence: 99%

“…Comparison with state-of-the-art semi-supervised semantic segmentation methods on the validation set. We use FST-D with K = 3 and † means results reported by[64].…”

mentioning

confidence: 99%

See 2 more Smart Citations

Learning from Future: A Novel Self-Training Framework for Semantic Segmentation

Du¹,

Shen²,

Wang³

et al. 2022

Preprint

Self Cite

View full text Add to dashboard Cite

Self-training has shown great potential in semi-supervised learning. Its core idea is to use the model learned on labeled data to generate pseudo-labels for unlabeled samples, and in turn teach itself. To obtain valid supervision, active attempts typically employ a momentum teacher for pseudo-label prediction yet observe the confirmation bias issue, where the incorrect predictions may provide wrong supervision signals and get accumulated in the training process. The primary cause of such a drawback is that the prevailing self-training framework acts as guiding the current state with previous knowledge, because the teacher is updated with the past student only. To alleviate this problem, we propose a novel self-training strategy, which allows the model to learn from the future. Concretely, at each training step, we first virtually optimize the student (i.e., caching the gradients without applying them to the model weights), then update the teacher with the virtual future student, and finally ask the teacher to produce pseudo-labels for the current student as the guidance. In this way, we manage to improve the quality of pseudo-labels and thus boost the performance. We also develop two variants of our future-self-training (FST) framework through peeping at the future both deeply (FST-D) and widely (FST-W). Taking the tasks of unsupervised domain adaptive semantic segmentation and semi-supervised semantic segmentation as the instances, we experimentally demonstrate the effectiveness and superiority of our approach under a wide range of settings. Code is available at https://github.com/usr922/FST.

show abstract

Section: Related Workmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Learning from Future: A Novel Self-Training Framework for Semantic Segmentation

Du¹,

Shen²,

Wang³

et al. 2022

Preprint

Self Cite

View full text Add to dashboard Cite

show abstract

“…Compared to collecting ordinary labels in PL, it would be less laborious for collecting complementary labels in NL [10]. Therefore, NL can not only be easily combined with ordinary classification [5,10], but also assist various vision applications, e.g., [12] dealing with noisy labels by applying NL, [35] using unreliable pixels for semantic segmentation with NL, etc. In this paper, we attempt to leverage NL to augment the few-shot labeled set by predicting negative pseudo-labels from unlabeled data, and thus obtain more accurate pseudo labels to assist classifier modeling under label-constrained scenarios.…”

Section: Related Workmentioning

confidence: 99%

An Embarrassingly Simple Approach to Semi-Supervised Few-Shot Learning

Wei¹,

Xu²,

Zhang³

et al. 2022

Preprint

View full text Add to dashboard Cite

Semi-supervised few-shot learning consists in training a classifier to adapt to new tasks with limited labeled data and a fixed quantity of unlabeled data. Many sophisticated methods have been developed to address the challenges this problem comprises. In this paper, we propose a simple but quite effective approach to predict accurate negative pseudo-labels of unlabeled data from an indirect learning perspective, and then augment the extremely label-constrained support set in fewshot classification tasks. Our approach can be implemented in just few lines of code by only using off-the-shelf operations, yet it is able to outperform state-of-the-art methods on four benchmark datasets.

show abstract

“…Moreover, their method is divided into two stages, first using contrastive learning to pre-train the backbone network, and second stage adding segmentation head to the backbone network to calculate pixel-level crossentropy loss. Inspired by Wang et al [38] and Liu et al [31], our method employs a fine-grained pixel-level instance discrimination task. We select pixel-level sample features for each class of seismic facies, the features of each class of seismic facies and the central feature of the current class form a positive sample pair, and the features of other classes can be regarded as negative samples of the current class.…”

mentioning

confidence: 99%

Contrastive Learning Approach for Semi-Supervised Seismic Facies Identification Using High-Confidence Representations

Li¹,

Liu²,

Dou³

et al. 2022

Preprint

View full text Add to dashboard Cite

The manual seismic facies annotation relies heavily on the experience of seismic interpreters, and the distribution of seismic facies in adjacent locations is very similar, which means that much of the labeling is costly repetitive work. However, we found that training the model with only a few evenly sampled labeled slices still suffers from severe classification confusion, that is, misidentifying one class of seismic facies as another. To address this issue, we propose a semi-supervised seismic facies identification method using features from unlabeled data for contrastive learning. We sample features in regions with high identification confidence, and use a pixel-level instance discrimination task to narrow the intra-class distance and increase the inter-class distance. Instance discrimination encourages the latent space to produce more distinguishable decision boundaries and reduces the bias in the features of the same class. Our method only needs to extend one branch to compute the contrastive loss without extensive changes to the network structure. We have conducted experiments on two public seismic surveys, SEAM AI and Netherlands F3, and the proposed model achieves an IOU score of more than 90 using only 1% of the annotations in the F3 survey. We have made our codes and pre-trained models publicly available: www.github.com/upcliuwenlong/CONSS.

show abstract

Semi-Supervised Semantic Segmentation Using Unreliable Pseudo-Labels

Cited by 264 publications

References 116 publications

Learning from Future: A Novel Self-Training Framework for Semantic Segmentation

Learning from Future: A Novel Self-Training Framework for Semantic Segmentation

An Embarrassingly Simple Approach to Semi-Supervised Few-Shot Learning

Contrastive Learning Approach for Semi-Supervised Seismic Facies Identification Using High-Confidence Representations

Contact Info

Product

Resources

About