2018
DOI: 10.48550/arxiv.1811.06152
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Depth Prediction Without the Sensors: Leveraging Structure for Unsupervised Learning from Monocular Videos

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2

Citation Types

0
18
0

Year Published

2019
2019
2024
2024

Publication Types

Select...
5
1

Relationship

0
6

Authors

Journals

citations
Cited by 8 publications
(18 citation statements)
references
References 0 publications
0
18
0
Order By: Relevance
“…We proposes a self-supervised learning framework inspired from their work, with significant modifications summarized as following: (1) Our system estimates the combined transformation, encapsulating both the camera ego-motion and the object motion. By contrast, Casser et al [3] estimate the object motion on top of the camera ego-motion, predicted by the Pose-net. Thus the accuracy of their object motion prediction is dependent on the performance of their Pose-net.…”
Section: Supervised Depth Estimationmentioning
confidence: 99%
See 2 more Smart Citations
“…We proposes a self-supervised learning framework inspired from their work, with significant modifications summarized as following: (1) Our system estimates the combined transformation, encapsulating both the camera ego-motion and the object motion. By contrast, Casser et al [3] estimate the object motion on top of the camera ego-motion, predicted by the Pose-net. Thus the accuracy of their object motion prediction is dependent on the performance of their Pose-net.…”
Section: Supervised Depth Estimationmentioning
confidence: 99%
“…In this work we try to solve the object motion by modelling it as a rigid-body transform. Similar idea is proposed in [3], where the pre-computed instance segmentation masks are utilized for individual object-motion prediction. We proposes a self-supervised learning framework inspired from their work, with significant modifications summarized as following: (1) Our system estimates the combined transformation, encapsulating both the camera ego-motion and the object motion.…”
Section: Supervised Depth Estimationmentioning
confidence: 99%
See 1 more Smart Citation
“…In these systems depth information can be used to decide whether to accelerate, brake or steer. Sonar, radar, and lidar 1 are examples of technologies that can be used to measure this information directly. As a complementary source of information or as a costeffective alternative, depth can be predicted from camera data.…”
Section: Introductionmentioning
confidence: 99%
“…[11] showed that it is possible to train depth prediction models on video data using image reconstruction as a supervision signal, by adding a parallel network that predicts the image-pair camera transformation that is required for image reconstruction. The reconstruc- 1 Radar that uses laser instead of radio waves tion computation will be discussed in further detail in the methods section.…”
Section: Introductionmentioning
confidence: 99%