系列论文阅读之知识蒸馏（二）《FitNets : Hints for Thin Deep Nets》

其他 2020-03-15 14:11:17 阅读次数: 0

本文成果：

从一个wide and deep的网路蒸馏成一个thin and deeper的网络。

主要的方法如下图所示：

实际上是在KD的基础上，增加了一个中间层的知识蒸馏。

以下是KD的主要方法：

训练要点：

两个loss function:

（1）Teacher网络的某一中间层的权值为Wt=Whint，Student网络的某一中间层的权值为Ws=Wguided。使用一个映射函数Wr来使得Wguided的维度匹配Whint，得到Ws'。其中对于Wr的训练使用MSEloss：

(2) 另外一个是改造的softmax loss（具体见Hinton的论文）:

liqiming100

发布了61 篇原创文章 · 获赞 12 · 访问量 6万+

私信关注

猜你喜欢

转载自blog.csdn.net/liqiming100/article/details/88935353

系列论文阅读之知识蒸馏（二）《FitNets : Hints for Thin Deep Nets》

知识蒸馏（Distillation）相关论文阅读（3）—— FitNets : Hints for Thin Deep Nets

FitNets: Hints for thin deep nets论文笔记

FitNets: Hints for Thin Deep Nets 原理与代码解析

Distillation论文总结（1）Do Deep Nets Really Need to be Deep?

A Fast Learning Algorithm for Deep Belief Nets - 论文学习

Do Deep Nets Really Need to be Deep?

LQ-Nets: Learned Quantization for Highly Accurate and Compact Deep Neural Networks 论文阅读

文章阅读：Semantic Image Segmentation with Deep Convolutional Nets and Fully Connected CRFs

深度学习论文（九）---DeepLabV2-Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution,

深度学习论文（八）---DeepLabV1-SEMANTIC IMAGE SEGMENTATION WITH DEEP CONVOLUTIONAL NETS AND FULLY CONNECTED C

Evaluation of Deep Convolutional Nets for Document Image Classification and Retrieval 论文笔记

论文：SE3-Pose-Nets: Structured Deep Dynamics Models for Visuomotor Planning and Control

论文笔记-DeepLung: Deep 3D Dual Path Nets for Automated Pulmonary Nodule Detection and Classification

论文笔记：DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution,and......

SEMANTIC IMAGE SEGMENTATION WITH DEEP CONVOLUTIONAL NETS AND FULLY CONNECTED CRFS 论文精读

Training Deep Nets with Sublinear Memory Cost

A Taxonomy of Deep Convolutional Neural Nets for Computer Vision

论文阅读-位姿估计-SE3-Nets Learning Rigid Body Motion using Deep Neural Networks

论文阅读笔记十：DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs (DeepLabv2)

Hints

论文阅读——《Generative Adversarial Nets》

Deep Belief Nets in C++ and CUDA C: Volume 3: Convolutional Nets 免积分下载

【deeplab】Semantic Image Segmentation with Deep Convolutional Nets and Fully

【Deep Learning】SeqGAN: Sequence Generative Adversarial Nets with Policy Gradient

Oracle 表连接之Hints

论文阅读——《Conditional Generative Adversarial Nets》

Python type hints 之 Optional，Union

Generative Adversarial Nets (GAN) 阅读笔记

CGAN论文解读：Conditional Generative Adversarial Nets

今日推荐

周排行

【转】mongodb中删除数组内嵌对象文档

php数字金额转换成中文大写显示

枫神之路--Java 的继承机制

四、Spring中使用@Conditional按照条件注册Bean

tomcat中直接使用第3放jar包

进程的创建fork vs vfork

结构体和组合体

“无任何网络提供程序接受指定的网络路径”的解决办法

webpack配置vue项目引入和部分引入

Oracle在不同windows系统中的迁移

每日归档

2024-06-14(0)

2024-06-13(0)

2024-06-12(0)

2024-06-11(0)

2024-06-10(0)

2024-06-09(0)

2024-06-08(0)

2024-06-07(0)

2024-06-06(0)

2024-06-05(0)