2020 IEEE Winter Conference on Applications of Computer Vision (WACV) 2020
DOI: 10.1109/wacv45572.2020.9093302
|View full text |Cite
|
Sign up to set email alerts
|

FlowNet3D++: Geometric Losses For Deep Scene Flow Estimation

Abstract: We present FlowNet3D++, a deep scene flow estimation network. Inspired by classical methods, FlowNet3D++ incorporates geometric constraints in the form of point-toplane distance and angular alignment between individual vectors in the flow field, into FlowNet3D [21]. We demonstrate that the addition of these geometric loss terms improves the previous state-of-art FlowNet3D accuracy from 57.85% to 63.43%. To further demonstrate the effectiveness of our geometric constraints, we propose a benchmark for flow estim… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
72
1

Year Published

2020
2020
2022
2022

Publication Types

Select...
5
3
1

Relationship

0
9

Authors

Journals

citations
Cited by 119 publications
(73 citation statements)
references
References 37 publications
0
72
1
Order By: Relevance
“…There is a rich literature of building learned models for scene flow using end-to-end learned architectures [30], [31], [8], [7], [14], [13], [32], [25] as well as hybrid architectures [33], [34], [35]. We discuss these in Section V in conjunction with building a scalable baseline model that operates in real time.…”
Section: Models For Learning Scene Flowmentioning
confidence: 99%
See 1 more Smart Citation
“…There is a rich literature of building learned models for scene flow using end-to-end learned architectures [30], [31], [8], [7], [14], [13], [32], [25] as well as hybrid architectures [33], [34], [35]. We discuss these in Section V in conjunction with building a scalable baseline model that operates in real time.…”
Section: Models For Learning Scene Flowmentioning
confidence: 99%
“…Furthermore, previous datasets cover a smaller area, e.g., the KITTI scene flow dataset covers 1/5th the area of our proposed dataest. This allows for different subsampling tradeoffs and inspired a class of models that are not able to tractably scale training and inference beyond ∼10K points [7], [8], [9], [13], [14], making the usage of such models impractical in real world AV scenes which often contain 100K -1000K points.…”
Section: Introductionmentioning
confidence: 99%
“…Un-/self-supervised approaches [31,32,69] aim to overcome the dependency on accurate, diverse labeled data, which is not easy to obtain. Approaches using sequences of RGB-D images [16,19,21,37,50,51] or 3D point clouds [4,17,35,45,70,72] have been also proposed, exploiting an already given 3D sparse point input.…”
Section: Related Workmentioning
confidence: 99%
“…As suggested in [19] for the task of scene flow estimation we investigate regularization by adding the cosine distance Lcos dist between the ground truth y = [t, r] T and predictionŷ = [t,r] T , according to eq. (5), as a regularization term to the overall loss function.…”
Section: Regularizationmentioning
confidence: 99%