Unsupervised Domain Adaptation for Semantic Segmentation of Urban Scenes

Biasetton, Matteo; Michieli, Umberto; Agresti, Gianluca; Zanuttigh, Pietro

doi:10.1109/cvprw.2019.00160

Cited by 44 publications

(44 citation statements)

References 39 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Another possible source of errors during the training procedure could be related to the well distinguishable Dirac distributed segmentation ground truth data from other distributions generated by G. We have investigated this issue and in general G produces segmentation maps very close to the Dirac distribution and this forces D to capture also other statistical properties of the two different types of input data. Notice that this issue has been investigated also in [3], [5] with similar conclusions. The discriminator D is used to implement the second loss function for the training of G, L s,t G,2 .…”

Section: Architecture Of the Proposed Approachsupporting

confidence: 70%

“…We present an unsupervised domain adaptation strategy for road driving scenes able to adapt an initial learning performed on synthetic data to the real world case. The domain adaptation strategy presented in this work is based on adversarial learning E-mail: umberto.michieli@dei.unipd.it and is an extension of our previous work introduced in [3]: here we further improve the self-teaching strategy and we present a more robust experimental evaluation.…”

Section: Introductionmentioning

confidence: 99%

“…This key component is based on the idea introduced in [5] that the output of the discriminator can be also used as a measure of the reliability of the network estimations to be exploited in a self-teaching framework. However, this component has been greatly improved in this work, both with respect to [5] and to [3]. First of all, the output of the discriminator has been considered as a weight to be applied to the loss function of the self-teaching component at each location (in place of the hard threshold used in previous work [3]).…”

Section: Introductionmentioning

confidence: 99%

“…However, this component has been greatly improved in this work, both with respect to [5] and to [3]. First of all, the output of the discriminator has been considered as a weight to be applied to the loss function of the self-teaching component at each location (in place of the hard threshold used in previous work [3]). Then, a novel region growing scheme is introduced in order to extend and better represent the shape of reliable regions (the approaches of [3], [5] tend to almost always discard edge regions and small objects).…”

Section: Introductionmentioning

confidence: 99%

“…First of all, the output of the discriminator has been considered as a weight to be applied to the loss function of the self-teaching component at each location (in place of the hard threshold used in previous work [3]). Then, a novel region growing scheme is introduced in order to extend and better represent the shape of reliable regions (the approaches of [3], [5] tend to almost always discard edge regions and small objects). Finally, since the various classes have different frequencies, we also weighted the loss coming from unlabeled data in proportion to the frequency of the various classes in the dataset thus obtaining a better balance of the results between the different classes and avoiding the dramatic drop in performance on less common classes (typically corresponding to small objects and structures that represent the critical elements for an autonomous vehicle).…”

Section: Introductionmentioning

confidence: 99%

See 4 more Smart Citations

Adversarial Learning and Self-Teaching Techniques for Domain Adaptation in Semantic Segmentation

Michieli

Biasetton

Agresti

et al. 2020

IEEE Trans. Intell. Veh.

Self Cite

View full text Add to dashboard Cite

Deep learning techniques have been widely used in autonomous driving systems for the semantic understanding of urban scenes, however they need a huge amount of labeled data for training, which is difficult and expensive to acquire. A recently proposed workaround is to train deep networks using synthetic data, however the domain shift between real world and synthetic representations limits the performance. In this work a novel unsupervised domain adaptation strategy is introduced to solve this issue. The proposed learning strategy is driven by three components: a standard supervised learning loss on labeled synthetic data, an adversarial learning module that exploits both labeled synthetic data and unlabeled real data and finally a selfteaching strategy exploiting unlabeled data. The last component exploits a region growing framework guided by the segmentation confidence. Furthermore, we weighted this component on the basis of the class frequencies to enhance the performance on less common classes. Experimental results prove the effectiveness of the proposed strategy in adapting a segmentation network trained on synthetic datasets, like GTA5 and SYNTHIA, to real world datasets like Cityscapes and Mapillary.

show abstract

Section: Architecture Of the Proposed Approachsupporting

confidence: 70%

Section: Introductionmentioning

confidence: 99%