2017 IEEE International Conference on Robotics and Automation (ICRA) 2017
DOI: 10.1109/icra.2017.7989089
|View full text |Cite
|
Sign up to set email alerts
|

Reconstructing vehicles from a single image: Shape priors for road scene understanding

Abstract: We present an approach for reconstructing vehicles from a single (RGB) image, in the context of autonomous driving. Though the problem appears to be ill-posed, we demonstrate that prior knowledge about how 3D shapes of vehicles project to an image can be used to reason about the reverse process, i.e., how shapes (back-)project from 2D to 3D. We encode this knowledge in shape priors, which are learnt over a small keypoint-annotated dataset. We then formulate a shapeaware adjustment problem that uses the learnt … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
83
0

Year Published

2018
2018
2022
2022

Publication Types

Select...
3
3
2

Relationship

1
7

Authors

Journals

citations
Cited by 73 publications
(83 citation statements)
references
References 23 publications
(74 reference statements)
0
83
0
Order By: Relevance
“…[21] proposes to estimate 3D box using the geometry relations between 2D box edges and 3D box corners. [30,1,22] explicitly utilize sparse information by predicting series of keypoints of regular-shape vehicles. The 3D object pose can be constrained by wireframe template fitting.…”
Section: Related Workmentioning
confidence: 99%
“…[21] proposes to estimate 3D box using the geometry relations between 2D box edges and 3D box corners. [30,1,22] explicitly utilize sparse information by predicting series of keypoints of regular-shape vehicles. The 3D object pose can be constrained by wireframe template fitting.…”
Section: Related Workmentioning
confidence: 99%
“…Deep-MANTA [3] uses 3D CAD models and annotated 3D parts in a coarseto-fine localization process. The work in [26] encodes shape priors using keypoints for recovering the 3D pose and shape of a query object. In Mono3D++ [11], the 3D shape and pose for cars is provided by using a morphable wireframe, and it optimizes projection consistency between generated 3D hypotheses and corresponding, 2D pseudomeasurements.…”
Section: Related Workmentioning
confidence: 99%
“…We also demonstrate that our method is independent of the road plane profile on which vehicles are to be reconstructed. In other words, unlike others (such as [1], [3], [13]) we do not make any assumptions that the ego car and the car to be reconstructed are on the same road plane.…”
Section: Experiments and Resultsmentioning
confidence: 99%
“…From top to bottom -(i) Illustrating how co-planarity assumption results in incorrect initialization in existing approaches (ii) Relying only on minimizing the reprojection error, leaves the optimizer free to rigidly transform the mean car (iii) Joint optimization constrains the car to be on ground while minimizing the reprojection error, resulting in more accurate reconstruction and localization (nc and ng are car base and road plane normals respectively) (iv) Failure of co-planarity assumption for steep roads on SYNTHIA-SF [8]. Notice the incorrect initialization of the car on slopes via method of [1] shown in red. Our method is not bound by this co-planarity assumption and initializes the vehicle correctly, shown in black.…”
Section: Joint Optimization For Ground Plane and Vehicle Pose And Shamentioning
confidence: 99%
See 1 more Smart Citation