“…Semantic segmentation architectures are typically trained on huge datasets with pixelwise annotations (e.g., the Cityscapes [5] or CamVid [1] datasets), which are highly expensive, time-consuming and error-prone to generate. To overcome this issue, semisupervised methods are emerging, trying to exploit weakly annotated data (e.g., with only image labels or only bounding boxes) [25,31,37,39,13,6,14,32] or completely unlabeled [24,29,15,31,19] data after a first stage of supervised training. In particular the works of [22,31] have paved the way respectively to adversarial learning approaches for the semantic segmentation task and to their application to semi-supervised learning.…”