论文解读：Self-Distillation from the Last Mini-Batch for Consistency Regularization

其他 2023-04-08 08:32:32 阅读次数: 0

1. 论文基本信息

论文：Self-Distillation from the Last Mini-Batch for Consistency Regularization
地址：https://arxiv.org/pdf/2203.16172.pdf
代码：https://github.com/Meta-knowledge-Lab/DLB
会议：CVPR2022

2. 背景与摘要

关于知识蒸馏方法的研究其实已经很多了，知识蒸馏本质上是一种正则化方法，图像分类任务中，在加上蒸馏之后，数据集的train acc基本上都有所降低，而eval acc在参数合适的情况下，基本都会有所提升。

使用教师模型进行知识蒸馏，一般来说对机器的计算能力要求比较高，同时过程也比较繁琐。以往的自蒸馏策略一般需要改变模型结构，比如加入attention block或者dropout等。本文对这种自蒸馏策略进行改进，基于对相同batch 数据的预测结果一致性进行蒸馏，最终提出了DLB蒸馏方法，达到了SOTA。

3. DLB方法流程图

DLB方法流程图如下，在每次迭代的时候，每个batch的数据包含 $b_t$

猜你喜欢

转载自blog.csdn.net/u012526003/article/details/124560997

论文解读：Self-Distillation from the Last Mini-Batch for Consistency Regularization

【KD】2022 CVPR Self-Distillation from the Last Mini-Batch for Consistency Regularization

Improved Consistency Regularization for GANs

CONSISTENCY REGULARIZATION FOR GENERATIVE ADVERSARIAL NETWORKS

mini-batch的理解

Regularization from Large Wights Perspective

mini-batch梯度下降

batch、随机、Mini-batch梯度下降

[总结] 半监督学习方法: 一致性正则化(Consistency Regularization)

Regularization

Mini-Batch 、Momentum、Adam算法的实现

对随机梯度下降+mini-batch的理解

GAN的优化（十三）：mini-batch discriminator

batch梯度下降法、mini-batch、SGD

【论文阅读】Unsupervised Data Augmentation for Consistency Training

MegDet：大mini-batch 检测器

[转] Torch中实现mini-batch RNN

Mini-batch 梯度下降与Tensorflow中的应用

神经网络算法学习---mini-batch

【深度学习】 BGD、SGD、mini-batch GD

mini-batch是什么以及dataloader的作用

Kmeans算法的经典优化——mini-batch和Kmeans++

mini-batch中关于python一维数组reshape

【论文解读】One Teacher is Enough? Pre-trained Language Model Distillation from Multiple Teachers

【论文笔记】SDCL: Self-Distillation Contrastive Learning for Chinese Spell Checking

[论文阅读] FixMatch: Simplifying Semi-Supervised Learning with Consistency and Confidence

【论文】Unsupervised Monocular Depth Estimation with Left-Right Consistency

The last packet successfully received from the server was 1,266,537 milliseconds ago. The last pack

The last packet successfully received from the server was xx milliseconds ago. The last packet sent

Communications link failure The last packe The last packet successfully received from the server wa

今日推荐

Arc Browser for Windows 1.0 正式 GA

90后程序员开发视频搬运软件、不到一年获利超 700 万，结局很刑！

《美国对全球网络空间安全与发展的威胁和破坏》报告发布

火速冲上 GitHub 热榜 —— 开源编程语言、框架哪有这么可爱？

北京人形机器人创新中心发布全球首个纯电驱拟人奔跑的全尺寸人形机器人“天工”

周排行

rbac——界面、权限

Apache CXF + SpringMVC 整合发布WebService

so插件化

Vue.js实战系列---图标字体制作（svg格式）

PAT乙级 1007 素数对猜想(孪生素数对) (20分) ---（C语言 + 详细注释）

被IRM保护的文档，打开失败

Calendar和Date计算日期差的小问题

win10子系统ubuntu18.4安装docker

利用Wrap Shell Script定位Android Native内存泄漏

MySQL: Transaction (Part I - Basic Concept)

每日归档

更多

2024-05-03(19)

2024-05-02(0)

2024-05-01(4)

2024-04-30(1)

2024-04-29(40)

2024-04-28(0)

2024-04-27(56)

2024-04-26(39)

2024-04-25(22)

2024-04-24(36)