2020
DOI: 10.48550/arxiv.2006.07864
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Cityscapes 3D: Dataset and Benchmark for 9 DoF Vehicle Detection

Abstract: Detecting vehicles and representing their position and orientation in the three dimensional space is a key technology for autonomous driving. Recently, methods for 3D vehicle detection solely based on monocular RGB images gained popularity. In order to facilitate this task as well as to compare and drive state-of-the-art methods, several new datasets and benchmarks have been published. Ground truth annotations of vehicles are usually obtained using lidar point clouds, which often induces errors due to imperfec… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
8
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
5
2
1

Relationship

0
8

Authors

Journals

citations
Cited by 8 publications
(8 citation statements)
references
References 23 publications
0
8
0
Order By: Relevance
“…Images are collected from a fixed camera and obtained through consecutive frames. To obtain more pedestrian images in traffic scenes, we filter and crop out pedestrian images from Cityscapes [45][46][47], which is a very large data set system, so there are more types of pedestrians. Our experiment will be conducted on these data sets and compared with the current state-of-the-art methods.…”
Section: Methodsmentioning
confidence: 99%
“…Images are collected from a fixed camera and obtained through consecutive frames. To obtain more pedestrian images in traffic scenes, we filter and crop out pedestrian images from Cityscapes [45][46][47], which is a very large data set system, so there are more types of pedestrians. Our experiment will be conducted on these data sets and compared with the current state-of-the-art methods.…”
Section: Methodsmentioning
confidence: 99%
“…We collected and analyzed commonly used datasets for 3D reconstruction in Tables 1-3. [302] 24 megapixels images, 3D point cloud / / Semantic3D [303] 4 billion points images, 3D point cloud 30 8 classes Paris-Lille-3D [304] 57.79 million images, 3D point cloud 2 50 classes ApolloCar3D [305] 5277 images / 60k Cityscapes 3D [306] 5000 images, 3D point cloud / 8 classes BlendedMVS [307] 17k images, 3D meshes 113 / CSPC-Dataset [308] 68 million points images, 3D point cloud 5 6 classes Toronto-3D [309] 78.3 million points images, 3D point cloud / 8 classes STPLS3D [310] 16 km 2 images, 3D point cloud / / KITTI-360 [311] 300k, 1 billon points images, 3D point cloud / / DiTer [312] / images, 3D point cloud / / SubT-MRS [313] 30 scenes images, 3D point cloud 30 /…”
Section: Datasetsmentioning
confidence: 99%
“…Cityscape has high-quality pixel-level annotations of 5 k frames and is intended for the evaluation of semantic urban scene understanding tasks. Cityscapes 3D [ 46 ] is a new extension of the original dataset with 3D bounding box annotations for 3D object detection, for example. The Pascal Visual Object Classes (PascalVOC) [ 31 ] challenge is not only a dataset but also an annual competition and workshop.…”
Section: Related Workmentioning
confidence: 99%