2021
DOI: 10.48550/arxiv.2101.08471
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Collaborative Teacher-Student Learning via Multiple Knowledge Transfer

Abstract: Knowledge distillation (KD), as an efficient and effective model compression technique, has been receiving considerable attention in deep learning. The key to its success is to transfer knowledge from a large teacher network to a small student one. However, most of the existing knowledge distillation methods consider only one type of knowledge learned from either instance features or instance relations via a specific distillation strategy in teacher-student learning. There are few works that explore the idea o… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...

Citation Types

0
0
0

Year Published

2021
2021
2021
2021

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
references
References 44 publications
0
0
0
Order By: Relevance