2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2019
DOI: 10.1109/cvpr.2019.00294
|View full text |Cite
|
Sign up to set email alerts
|

Student Becoming the Master: Knowledge Amalgamation for Joint Scene Parsing, Depth Estimation, and More

Abstract: In this paper, we investigate a novel deep-model reusing task. Our goal is to train a lightweight and versatile student model, without human-labelled annotations, that amalgamates the knowledge and masters the expertise of two pretrained teacher models working on heterogeneous problems, one on scene parsing and the other on depth estimation. To this end, we propose an innovative training strategy that learns the parameters of the student intertwined with the teachers, achieved by "projecting" its amalgamated f… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
29
0

Year Published

2020
2020
2022
2022

Publication Types

Select...
5
3
2

Relationship

1
9

Authors

Journals

citations
Cited by 70 publications
(29 citation statements)
references
References 27 publications
0
29
0
Order By: Relevance
“…However, determining the focal distance solely from a single two-dimensional image is not a straight-forward operation. One potential solution is to use a neural network estimate of the depth map from a two-dimensional image [27][28][29][30]. From our framework we extract the location of the strongest edges in the frame; therefore, we can map the edge strengths to the predicted depth map to obtain the focus distance.…”
Section: Depth Of Fieldmentioning
confidence: 99%
“…However, determining the focal distance solely from a single two-dimensional image is not a straight-forward operation. One potential solution is to use a neural network estimate of the depth map from a two-dimensional image [27][28][29][30]. From our framework we extract the location of the strongest edges in the frame; therefore, we can map the edge strengths to the predicted depth map to obtain the focus distance.…”
Section: Depth Of Fieldmentioning
confidence: 99%
“…Knowledge distillation methods have been widely used in many vision tasks, including object detection [30,6,13], line detection [20], semantic segmentation [62,18,34] and human pose estimation [66,40,56,58]. DOPE [58] proposes to distill the 2D and 3D poses from three independent body part expert models to the single whole-body pose detection model.…”
Section: Related Workmentioning
confidence: 99%
“…Recently, image segmentation has been used in a variety of computer vision tasks, such as depth prediction [37], virtual try-on [20], scene understanding [38], [39] and image generation [40]. The use of SLIC can obtain color parsing to extend the line hint to the local area, thereby providing more color information to the interactive colorization.…”
Section: ) Global Hintmentioning
confidence: 99%