2022 International Conference on 3D Vision (3DV) 2022
DOI: 10.1109/3dv57658.2022.00028
|View full text |Cite
|
Sign up to set email alerts
|

The 8-Point Algorithm as an Inductive Bias for Relative Pose Prediction by ViTs

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
5
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
3
2
2

Relationship

0
7

Authors

Journals

citations
Cited by 17 publications
(5 citation statements)
references
References 58 publications
0
5
0
Order By: Relevance
“…Applying a negative log-likelihood loss for this distribution allows for improved rotation estimation and was successfully used for head pose estimation by Liu et al [32]. Rockwell et al [42] present a baseline to directly estimate the relative pose between two images by training a Vision Transformer (ViT) to bring its computations close to the eight-point algorithm. They achieve competitive results in multiple settings.…”
Section: Related Workmentioning
confidence: 99%
See 2 more Smart Citations
“…Applying a negative log-likelihood loss for this distribution allows for improved rotation estimation and was successfully used for head pose estimation by Liu et al [32]. Rockwell et al [42] present a baseline to directly estimate the relative pose between two images by training a Vision Transformer (ViT) to bring its computations close to the eight-point algorithm. They achieve competitive results in multiple settings.…”
Section: Related Workmentioning
confidence: 99%
“…Attention based methods. We also compare to a recent work by Rockwell et al [42] (8PointVit) using a Vision Transformer (ViT) to estimate the relative pose. Although Rockwell et al achieve competitive results in multiple settings, their approach is less suited for extreme view changes.…”
Section: Comparative Baselinesmentioning
confidence: 99%
See 1 more Smart Citation
“…Neural pose prediction from RGB images. A series of methods (Lin et al, 2023a;Rockwell et al, 2022;Cai et al, 2021) have sought to address this issue by directly regressing camera poses through network predictions. Notably, these methods do not incorporate 3D shape information during the camera pose prediction process.…”
Section: Related Workmentioning
confidence: 99%
“…Jiang et al [26] embed epipolar geometry constraints into a self-supervised learning framework through the joint optimization of camera poses and optical flow. In [27,28], they use the Eight-Point Algorithm as a neural network inductive bias to regress fundamental or essential matrices. Wang et al [29] employ scale-invariant loss functions to train their model.…”
Section: Two-view Camera Pose Estimationmentioning
confidence: 99%