“…Recent state-ofthe-art methods, e.g., MeanTeacher [4], FixMatch [7] and Noisy student [8], share a similar philosophy and enjoy the merit of both pseudo labeling [9,8,10] and consistency regularization [2,3,4,1,11]: the teacher model generates pseudo labels for weakly-augmented unlabeled data whereas the student model trains on the strong-augmented counterparts, as shown in Figure 1a. Such strategy can also be applied to semantic segmentation, and the resulting approaches [12,13,14,15,16,17,18,19] have demonstrated outstanding performance. However, pseudo labeling methods are plagued with confirmation bias [20], i.e., the student model is prone to overfit the erroneous pseudo labels.…”