Rebalancing gradient to improve self-supervised co-training of depth, odometry and optical flow predictions

Hariat, Marwane; Manzanera, Antoine; Filliat, David

doi:10.1109/wacv56688.2023.00132

Cited by 4 publications

(1 citation statement)

References 41 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…By utilizing advanced algorithms and techniques, scene flow estimation allows us to reconstruct and map the motion of objects across a sequence of images or frames, which has broad applications in various fields, including autonomous driving [1], action recognition [2], and virtual reality [3]. Many scene flow estimation methods based on various types of input data have recently been proposed, such as image sequence [4] [5], 3D point clouds [6] [7]. However, the acquisition of 3D point cloud data usually requires expensive sensor equipment, so the research in this paper focuses on the scene flow estimation methods whose input is image sequence.…”

Section: Introductionmentioning

confidence: 99%

GloFP-MSF: Monocular Scene Flow Estimation with Global Feature Perception

Xiang,

Cu,

Wang

et al. 2024

Preprint

View full text Add to dashboard Cite

Monocular scene flow estimation is a task that allows us to obtain 3D structure and 3D motion from consecutive monocular images. Previous monocular scene flow usually focused on the enhancement of image features and motion features directly while neglecting the utilization of motion features and image features in the decoder, which are equally crucial for accurate scene flow estimation. In this paper, we propose a global feature perception module (GFPM) based on cross-covariance attention and applie it to decoder, which enables the decoder to utilize the motion features and image features of the current layer as well as the coarse estimation result of the scene flow of the previous layer effectively, thus enhancing the decoder's recovery of 3D motion information. In addition, we also propose a parallel architecture of self-attention and convolution (PCSA) for feature extraction, which can enhance the global expression ability of extracted image features. Our proposed method demonstrates remarkable performance on the KITTI 2015 dataset, achieving a relative improvement of 17.6\% compared to the baseline approach. Compared to other recent methods, the proposed model achieves competitive results.

show abstract

Section: Introductionmentioning

confidence: 99%

GloFP-MSF: Monocular Scene Flow Estimation with Global Feature Perception

Xiang,

Cu,

Wang

et al. 2024

Preprint

View full text Add to dashboard Cite

show abstract

Replay-Based Online Adaptation for Unsupervised Deep Visual Odometry

Kuznietsov,

Proesmans,

Van Gool

2023

Lecture Notes in Computer Science

View full text Add to dashboard Cite

GloFP-MSF: monocular scene flow estimation with global feature perception

Xiang,

Cui,

Wang

et al. 2024

Multimedia Systems

View full text Add to dashboard Cite

Monocular scene flow estimation is a task that allows us to obtain 3D structure and 3D motion from consecutive monocular images. Previous monocular scene flow usually focused on the enhancement of image features and motion features directly while neglecting the utilization of motion features and image features in the decoder, which are equally crucial for accurate scene flow estimation. In this paper, we propose a global feature perception module (GFPM) based on cross-covariance attention and applie it to decoder, which enables the decoder to utilize the motion features and image features of the current layer as well as the coarse estimation result of the scene flow of the previous layer effectively, thus enhancing the decoder's recovery of 3D motion information. In addition, we also propose a parallel architecture of self-attention and convolution (PCSA) for feature extraction, which can enhance the global expression ability of extracted image features. Our proposed method demonstrates remarkable performance on the KITTI 2015 dataset, achieving a relative improvement of 17.6% compared to the baseline approach. Compared to other recent methods, the proposed model achieves competitive results.

show abstract

Rebalancing gradient to improve self-supervised co-training of depth, odometry and optical flow predictions

Cited by 4 publications

References 41 publications

GloFP-MSF: Monocular Scene Flow Estimation with Global Feature Perception

GloFP-MSF: Monocular Scene Flow Estimation with Global Feature Perception

Replay-Based Online Adaptation for Unsupervised Deep Visual Odometry

GloFP-MSF: monocular scene flow estimation with global feature perception

Contact Info

Product

Resources

About