2020
DOI: 10.1609/aaai.v34i07.6613
|View full text |Cite
|
Sign up to set email alerts
|

Learning End-to-End Scene Flow by Distilling Single Tasks Knowledge

Abstract: Scene flow is a challenging task aimed at jointly estimating the 3D structure and motion of the sensed environment. Although deep learning solutions achieve outstanding performance in terms of accuracy, these approaches divide the whole problem into standalone tasks (stereo and optical flow) addressing them with independent networks. Such a strategy dramatically increases the complexity of the training procedure and requires power-hungry GPUs to infer scene flow barely at 1 FPS. Conversely, we propose DWARF, a… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
26
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
3
2
1

Relationship

0
6

Authors

Journals

citations
Cited by 31 publications
(26 citation statements)
references
References 49 publications
0
26
0
Order By: Relevance
“…As shown in table 1, the refinement model improves over the feedforward baseline in all three metrics, thus supporting the effectiveness of the consistency guided refinement process. Our model also outperforms DWARF [1] by a large margin. Again, this shows the benefit of geometrically modelling the consistency of scene flow outputs.…”
Section: Evaluation On Synthetic Datamentioning
confidence: 72%
See 3 more Smart Citations
“…As shown in table 1, the refinement model improves over the feedforward baseline in all three metrics, thus supporting the effectiveness of the consistency guided refinement process. Our model also outperforms DWARF [1] by a large margin. Again, this shows the benefit of geometrically modelling the consistency of scene flow outputs.…”
Section: Evaluation On Synthetic Datamentioning
confidence: 72%
“…To leverage correlation between tasks, Jiang et al [15] propose an encoder architecture shared among the tasks of disparity, optical flow, and segmentation. Similarly, Aleotti et al [1] propose a lightweight architecture to share information between tasks. Complementary to previous work, our main objective is to improve generalization of the scene flow by means of additional constraints, self-supervised losses, and learnt refinement schemes.…”
Section: Related Workmentioning
confidence: 99%
See 2 more Smart Citations
“…The benefit of our split decoder is that competitive accuracy is achieved more stably and in fewer training iterations (at 56% of the full training schedule), with a lighter network (∼ 10% fewer parameters). 1 Figure 2. Decoder configuration: (a) A single joint decoder [23], (b) removing the context network, and (c) our split decoder design.…”
Section: Refined Backbone Architecturementioning
confidence: 99%