Knowledge Distillation Based on Fitting Ground-Truth Distribution of Images

Knowledge distillation based on the features from the penultimate layer allows the student (lightweight model) to efficiently mimic the internal feature outputs of the teacher (high-capacity model). However, the training data may not conform to the ground-truth distribution of images in terms of cla...

Full description

Bibliographic Details
Main Authors: Jianze Li, Zhenhua Tang, Kai Chen, Zhenlei Cui
Format: Article
Language:English
Published: MDPI AG 2024-04-01
Series:Applied Sciences
Subjects:
Online Access:https://www.mdpi.com/2076-3417/14/8/3284