Name		Name	Last commit message	Last commit date
parent directory ..
README.md		README.md
fitnets_backbone_logits_resnet50_resnet18_8xb32_in1k.py		fitnets_backbone_logits_resnet50_resnet18_8xb32_in1k.py
metafile.yml		metafile.yml

README.md

FitNets

FitNets: Hints for Thin Deep Nets

Abstract

While depth tends to improve network performances, it also makes gradient-based training more difficult since deeper networks tend to be more non-linear. The recently proposed knowledge distillation approach is aimed at obtaining small and fast-to-execute models, and it has shown that a student network could imitate the soft output of a larger teacher network or ensemble of networks. In this paper, we extend this idea to allow the training of a student that is deeper and thinner than the teacher, using not only the outputs but also the intermediate representations learned by the teacher as hints to improve the training process and final performance of the student. Because the student intermediate hidden layer will generally be smaller than the teacher's intermediate hidden layer, additional parameters are introduced to map the student hidden layer to the prediction of the teacher hidden layer. This allows one to train deeper students that can generalize better or run faster, a trade-off that is controlled by the chosen student capacity. For example, on CIFAR-10, a deep student network with almost 10.4 times less parameters outperforms a larger, state-of-the-art teacher network.

Results and models

Classification

Location	Dataset	Teacher	Student	Acc	Acc(T)	Acc(S)	Config	Download
backbone & logits	ImageNet	resnet50	resnet18	70.58	76.55	69.90	config	teacher \|model \| log

Citation

@inproceedings{DBLP:journals/corr/RomeroBKCGB14,
  author    = {Adriana Romero, Nicolas Ballas, Samira Ebrahimi Kahou, Antoine Chassang, Carlo Gatta and Yoshua Bengio},
  editor    = {Yoshua Bengio and Yann LeCun},
  title     = {FitNets: Hints for Thin Deep Nets},
  booktitle = {3rd International Conference on Learning Representations, {ICLR} 2015,
               San Diego, CA, USA, May 7-9, 2015, Conference Track Proceedings},
  year      = {2015},
  url       = {http://arxiv.org/abs/1412.6550},
  timestamp = {Thu, 25 Jul 2019 14:25:38 +0200},
  biburl    = {https://dblp.org/rec/journals/corr/RomeroBKCGB14.bib},
  bibsource = {dblp computer science bibliography, https://dblp.org}
}

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fitnets

fitnets

README.md

FitNets

Abstract

Results and models

Classification

Citation

Files

fitnets

Directory actions

More options

Directory actions

More options

Latest commit

History

fitnets

Folders and files

parent directory

README.md

FitNets

Abstract

Results and models

Classification

Citation