KDGAN: Knowledge distillation with generative adversarial networks论文笔记 - 代码天地

KDGAN: Knowledge distillation with generative adversarial networks论文笔记

业界资讯 2023-07-29 02:50:00 阅读次数: 0

论文地址：http://papers.nips.cc/paper/7358-kdgan-knowledge-distillation-with-generative-adversarial-networks.pdf
github地址：https://github.com/xiaojiew1/KDGAN/

Motivation

在训练轻量级分类器时，知识蒸馏虽只需少量样本和训练次数就能收敛，但难以从teacher那里学习到真实的数据分布（real data），而另一种方法，通过GAN对分类器进行对抗性训练学习数据的真实分布，却由于高方差的梯度更新，需要很长时间才能达到平衡。为了解决上述限制，本文提出KDGAN的框架，该框架由一个分类器（student net）、一个teacher net和一个discriminator组成。分类器和教师通过蒸馏损失相互学习，并通过对抗性损失对分类器进行对抗性训练。

Method

文章以多标签分类任务为例展开研究。KDGAN的框架如图所示。除了KD中的teacher net到分类器的蒸馏损失以及NaGAN（naive gan）中的分类器和discriminator的对抗损失外，作者还定义了从分类器到teacher net的蒸馏损失以及teacher net与discriminator之间的对抗性损失。即分类器与teacher net均作为generator，生成的标签均被discriminator视为假。同时，分类器和teacher net通过互相蒸馏软标签的方式互相学习彼此的知识，从而就生成什么伪标签达成一致。

KDGAN

为了加快KDGAN的训练，作者一方面经验性地认为分类器接收到的梯度中来自teacher的梯度的方差会小于discriminator的梯度的方差，因此加权平均后小于原来只用GAN训练的梯度方差，从而能够快速收敛。两一方面，由于分类器和teacher生成的离散样本是不可微的，因此作者使用Gumbel-Max技巧将离散样本的分布转化为连续的分布。从而能够传递梯度值。

模型的具体算法步骤如下：其中，三个部分都需要经过预训练，接着在每个epoch中依次更新三个部分数次。
training

Experiment

KD loss：L2loss、KL Divergence
实验次数：10次
应用场景：模型压缩，图像标签推荐
数据集：MNIST，CIFAR-10，YFCC100M

Results

MNIST
Hyperparameters
YFCC100M

Thoughts

和我想象的KDGAN不太一样，有必要复现一下。

猜你喜欢

转载自blog.csdn.net/qq_43812519/article/details/105474815

KDGAN: Knowledge distillation with generative adversarial networks论文笔记

PKDGAN: Private Knowledge Distillation with Generative Adversarial Networks

【论文笔记】Generative Adversarial Networks

Feature-map-level Online Adversarial Knowledge Distillation论文笔记

论文笔记：Eye In-Painting with Exemplar Generative Adversarial Networks

论文笔记：Self-Attention Generative Adversarial Networks

论文笔记：Generative Adversarial Imitation Learning

论文阅读 - FedACK: Federated Adversarial Contrastive Knowledge Distillation for Cross-Lingual

Generative Adversarial Networks

Triangle Generative Adversarial Networks

UNROLLED GENERATIVE ADVERSARIAL NETWORKS

[论文解读]Explaining Knowledge Distillation by Quantifying the Knowledge

Residual Knowledge Distillation论文精度

论文解读：Decoupled Knowledge Distillation

Private Model Compression via Knowledge Distillation 论文笔记

Learning efficient object detection models with knowledge distillation论文笔记

Preparing Lessons: Improve Knowledge Distillation with Better Supervision论文笔记

Generative Adversarial Nets笔记

论文笔记：Conditional Coupled Generative Adversarial Networks for Zero-Shot Domain Adaptation

【GAN】【论文笔记】A Style-Based Generator Architecture for Generative Adversarial Networks

[论文理解] On the "steerability" of generative adversarial networks

Understanding Generative Adversarial Networks (GAN)

CONSISTENCY REGULARIZATION FOR GENERATIVE ADVERSARIAL NETWORKS

【论文笔记】生成对抗网络Generative adversarial nets

论文阅读——《Generative Adversarial Nets》

Generative Adversarial Networks: An Overview文献阅读笔记

Conditional Generative Adversarial Networks（CGAN）笔记

知识蒸馏（Knowledge distillation）必读论文合集

【CVPR2020 论文翻译】 | Explaining Knowledge Distillation by Quantifying the Knowledge

论文笔记|CVPR2023:Supervised Masked Knowledge Distillation for Few-Shot Transformers

今日推荐

《美国对全球网络空间安全与发展的威胁和破坏》报告发布

火速冲上 GitHub 热榜 —— 开源编程语言、框架哪有这么可爱？

北京人形机器人创新中心发布全球首个纯电驱拟人奔跑的全尺寸人形机器人“天工”

LFOSSA 源来如此公开课 | 掌握云原生未来：CNCF 认证全面攻略与备考秘籍

周排行

让自己的头脑极度开放

CentOS 6.5(x64) 和Redhat6.5操作系误删libc

高可用注册中心

【日记】12.28/【题解】AtCoder AGC041

XML（5）_XML 约束_DTD

Java集合Map（四）

树梅派安装桌面环境教程

pipenv 的使用和安装

小程序白屏问题和内存研究

C语言简单选择排序

每日归档

更多

2024-05-02(0)

2024-05-01(4)

2024-04-30(1)

2024-04-29(40)

2024-04-28(0)

2024-04-27(56)

2024-04-26(39)

2024-04-25(22)

2024-04-24(36)

2024-04-23(26)