2022 IEEE International Conference on Image Processing (ICIP) 2022
DOI: 10.1109/icip46576.2022.9898048
|View full text |Cite
|
Sign up to set email alerts
|

Multi-Scale Raft: Combining Hierarchical Concepts for Learning-Based Optical Flow Estimation

Abstract: Many classical and learning-based optical flow methods rely on hierarchical concepts to improve both accuracy and robustness. However, one of the currently most successful approaches -RAFT -hardly exploits such concepts. In this work, we show that multi-scale ideas are still valuable. More precisely, using RAFT as a baseline, we propose a novel multi-scale neural network that combines several hierarchical concepts within a single estimation framework. These concepts include (i) a partially shared coarse-to-fin… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
6
0

Year Published

2022
2022
2023
2023

Publication Types

Select...
5
1

Relationship

0
6

Authors

Journals

citations
Cited by 11 publications
(6 citation statements)
references
References 26 publications
0
6
0
Order By: Relevance
“…Compared to the results of our MS-RAFT method [1], our new MS-RAFT+ approach leads to a significant improvement in accuracy for KITTI 2015 and Sintel (clean). Largest improvements can be observed in non-occluded regions (non-occluded Fl-all, EPE matched) as well as for small displacements (s0-10).…”
Section: Ms-raft+ Vs Ms-raftmentioning
confidence: 85%
See 4 more Smart Citations
“…Compared to the results of our MS-RAFT method [1], our new MS-RAFT+ approach leads to a significant improvement in accuracy for KITTI 2015 and Sintel (clean). Largest improvements can be observed in non-occluded regions (non-occluded Fl-all, EPE matched) as well as for small displacements (s0-10).…”
Section: Ms-raft+ Vs Ms-raftmentioning
confidence: 85%
“…Our method for the Robust Vision Challenge 2022 is based on our recent MS-RAFT approach, which combines four hierarchical concepts in one estimation framework: (i) a coarse-to-fine computation scheme that provides useful initialization from coarser scales, (ii) a U-Net-style feature extractor which relies on semantically enhanced multiscale features, (iii) RAFT's original correlation pyramid that introduces non-local cost information in the matching process, and (iv) a multi-scale multi-iteration loss that considers a sample-wise robust loss function for fine-tuning the network. We refer the reader to [1] for more details. For the Robust Vision Challenge, we extended the architecture of MS-RAFT by an additional finer scale, which is realized by an on-demand computation of the matching costs, and an adjusted fine-tuning step combining MS-RAFT's sample-wise robust loss function with a different mixed training strategy for better generalization.…”
Section: Approachmentioning
confidence: 99%
See 3 more Smart Citations