2018
DOI: 10.1007/978-3-030-01216-8_6
|View full text |Cite
|
Sign up to set email alerts
|

Effective Use of Synthetic Data for Urban Scene Semantic Segmentation

Abstract: Training a deep network to perform semantic segmentation requires large amounts of labeled data. To alleviate the manual effort of annotating real images, researchers have investigated the use of synthetic data, which can be labeled automatically. Unfortunately, a network trained on synthetic data performs relatively poorly on real images. While this can be addressed by domain adaptation, existing methods all require having access to real images during training. In this paper, we introduce a drastically differ… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

1
57
0

Year Published

2019
2019
2020
2020

Publication Types

Select...
5
3
1

Relationship

1
8

Authors

Journals

citations
Cited by 127 publications
(58 citation statements)
references
References 51 publications
(140 reference statements)
1
57
0
Order By: Relevance
“…Pan et al show that a careful balance between the instance normalization and batch normalization could enhance a neural network's cross-domain generalization. EUSD [100]. Arguing that object detectors have better generalization capacity in detecting foreground objects (e.g., car, pedestrian, etc.)…”
Section: Other Methodsmentioning
confidence: 99%
“…Pan et al show that a careful balance between the instance normalization and batch normalization could enhance a neural network's cross-domain generalization. EUSD [100]. Arguing that object detectors have better generalization capacity in detecting foreground objects (e.g., car, pedestrian, etc.)…”
Section: Other Methodsmentioning
confidence: 99%
“…Then the noisy labels were used to guide the training for road scene segmentation. Another increasingly popular way to overcome the lack of large-scale dataset is explored by the usage of synthetic data, such as VEIS [28], SYNTHIA [29], Virtual KITTI [30], and GTA-V [31]. Synthetic data is usually used to augment real training data [29], [32].…”
Section: Related Workmentioning
confidence: 99%
“…The SYNTHIA dataset is generated by rendering a virtual city created with the Unity development platform for semantic segmentation of driving scenes. Saleh et al [28] proposed VEIS environment to generate the VEIS dataset which has richer foreground classes of real traffic environments. Our previous works [15] can also be regarded as a synthetic dataset which is transformed from a real largescale conventional image dataset.…”
Section: Related Workmentioning
confidence: 99%
“…In the past few years, synthesizing images with 3D models using graphics engines have made a figure and attracted much attention in several fields, including human pose estimations [34], indoor scene understanding [25,24], outdoor/urbane scene understanding [30,32], and object detection [26,33,7]. The use of 3D models falls into one of the following categories: (1) Rendering 3D objects on top of static background real-world images [26,34]; (2) Randomly arranging scenes filled with objects [25,24,30,7]; (3) Using commercial game engine, such as Grand Theft Auto V (GTA V) [28,23,32] and the UnrealCV Project [27,3,33].…”
Section: Image Synthesis In 3d Virtual Worldsmentioning
confidence: 99%