2022
DOI: 10.48550/arxiv.2205.01915
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Generalized Knowledge Distillation via Relationship Matching

Abstract: The knowledge of a well-trained deep neural network (a.k.a. the "teacher") is valuable for learning similar tasks. Knowledge distillation extracts knowledge from the teacher and integrates it with the target model (a.k.a. the "student"), which expands the student's knowledge and improves its learning efficacy. Instead of enforcing the teacher to work on the same task as the student, we borrow the knowledge from a teacher trained from a general label space -in this "Generalized Knowledge Distillation (GKD)", th… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...

Citation Types

0
0
0

Publication Types

Select...

Relationship

0
0

Authors

Journals

citations
Cited by 0 publications
references
References 65 publications
0
0
0
Order By: Relevance

No citations

Set email alert for when this publication receives citations?