2021 IEEE International Conference on Multimedia and Expo (ICME) 2021
DOI: 10.1109/icme51207.2021.9428385
|View full text |Cite
|
Sign up to set email alerts
|

Distortion-Tolerant Monocular Depth Estimation on Omnidirectional Images Using Dual-Cubemap

Abstract: Estimating the depth of omnidirectional images is more challenging than that of normal field-of-view (NFoV) images because the varying distortion can significantly twist an object's shape. The existing methods suffer from troublesome distortion while estimating the depth of omnidirectional images, leading to inferior performance. To reduce the negative impact of the distortion influence, we propose a distortiontolerant omnidirectional depth estimation algorithm using a dual-cubemap. It comprises two modules: D… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
6
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
3
3
1

Relationship

2
5

Authors

Journals

citations
Cited by 14 publications
(6 citation statements)
references
References 15 publications
0
6
0
Order By: Relevance
“…Motivated by previous works [7,16], we first polymerize the most relevant features via a virtue tangent plane on the panoramic sphere projection. Based on the projection formula [16,17,4] (we discuss it in detail in the supplementary material), we can get the distortion-free sampling coordinates p ∈ R H×W ×9×2 (height, width, number of points, number of coordinates) in the 2D feature maps. The learnable offsets ∆p are employed to adjust the sampling locations adaptively.…”
Section: Cross-scale Distortion Awarenessmentioning
confidence: 99%
“…Motivated by previous works [7,16], we first polymerize the most relevant features via a virtue tangent plane on the panoramic sphere projection. Based on the projection formula [16,17,4] (we discuss it in detail in the supplementary material), we can get the distortion-free sampling coordinates p ∈ R H×W ×9×2 (height, width, number of points, number of coordinates) in the 2D feature maps. The learnable offsets ∆p are employed to adjust the sampling locations adaptively.…”
Section: Cross-scale Distortion Awarenessmentioning
confidence: 99%
“…From a lightweight perspective, Jiang et al [17] improve Bifuse by designing an efficient fusion block and propose Unifuse. Besides, Shen et al [7] utilize a dual-cube fusion strategy to get rid of distortion. Compared with cubemap, there are less distortion and pixel loss in the tangent images, and Eder et al [18] propose leveraging tangent images to mitigate the spherical distortion.…”
Section: A Monocular 360°depth Estimationmentioning
confidence: 99%
“…For better supervision, we combine reverse Huber (or Berhu) [8], [16], [17], [39] loss and gradient loss [7] to keep the maximum value in the window, and set others to 0 4: end for 5: Set a non-overlapping sliding window with size of 1×7, 6: for each position in HF s f ct do 7:…”
Section: Objective Functionmentioning
confidence: 99%
See 2 more Smart Citations