Knowledge Distillation(KD) 知识蒸馏 Pytorch实现 - 代码天地

Knowledge Distillation(KD) 知识蒸馏 Pytorch实现

其他 2021-04-06 18:27:08 阅读次数: 0

简单实现，主要为了理解其原理

import torch
import torch.nn as nn
import numpy as np

from torch.nn import CrossEntropyLoss
from torch.utils.data import TensorDataset,DataLoader,SequentialSampler

class model(nn.Module):
	def __init__(self,input_dim,hidden_dim,output_dim):
		super(model,self).__init__()
		self.layer1 = nn.LSTM(input_dim,hidden_dim,output_dim,batch_first = True)
		self.layer2 = nn.Linear(hidden_dim,output_dim)
	def forward(self,inputs):
		layer1_output,layer1_hidden = self.layer1(inputs)
		layer2_output = self.layer2(layer1_output)
		layer2_output = layer2_output[:,-1,:]#取出一个batch中每个句子最后一个单词的输出向量即该句子的语义向量！！！！！！！!！
		return layer2_output

#建立小模型
model_student = model(input_dim = 2,hidden_dim = 8,output_dim = 4)

#建立大模型（此处仍然使用LSTM代替，可以使用训练好的BERT等复杂模型）
model_teacher = model(input_dim = 2,hidden_dim = 16,output_dim = 4)

#设置输入数据，此处只使用随机生成的数据代替
inputs = torch.randn(4,6,2)
true_label = torch.tensor([0,1,0,0])

#生成dataset
dataset = TensorDataset(inputs,true_label)

#生成dataloader
sampler = SequentialSampler(inputs)
dataloader = DataLoader(dataset = dataset,sampler = sampler,batch_size = 2)

loss_fun = CrossEntropyLoss()
criterion  = nn.KLDivLoss()#KL散度
optimizer = torch.optim.SGD(model_student.parameters(),lr = 0.1,momentum = 0.9)#优化器，优化器中只传入了学生模型的参数，因此此处只对学生模型进行参数更新，正好实现了教师模型参数不更新的目的

for step,batch in enumerate(dataloader):
	inputs = batch[0]
	labels = batch[1]
	
	#分别使用学生模型和教师模型对输入数据进行计算
	output_student = model_student(inputs)
	output_teacher = model_teacher(inputs)
	
	#计算学生模型预测结果和教师模型预测结果之间的KL散度
	loss_soft = criterion(output_student,output_teacher)

	#计算学生模型和真实标签之间的交叉熵损失函数值
	loss_hard = loss_fun(output_student,labels)
		
	loss = 0.9*loss_soft + 0.1*loss_hard
	print(loss)
	optimizer.zero_grad()
	loss.backward()
	optimizer.step()

猜你喜欢

转载自blog.csdn.net/hxxjxw/article/details/115294112

Knowledge Distillation(KD) 知识蒸馏 Pytorch实现

Knowledge Distillation(KD) 知识蒸馏

知识蒸馏是什么？（Knowledge Distillation）KD

知识蒸馏（Knowledge Distillation）的Pytorch实现以及分析

知识蒸馏（Knowledge Distillation）

知识蒸馏Knowledge Distillation

Knowledge Distillation 知识蒸馏详解

知识蒸馏简介（Knowledge Distillation）

【知识蒸馏】 Knowledge Distillation from A Stronger Teacher

【知识蒸馏】Knowledge Distillation with the Reused Teacher Classifier

知识蒸馏综述 Knowledge Distillation: A Survey

知识蒸馏（Knowledge distillation）必读论文合集

概念解析 | 知识蒸馏(Knowledge Distillation)

【知识蒸馏】知识蒸馏（Knowledge Distillation）技术详解

一文搞懂【知识蒸馏】【Knowledge Distillation】算法原理

【经典简读】知识蒸馏(Knowledge Distillation) 经典之作

通俗易懂的知识蒸馏 Knowledge Distillation（上）——理论分析

知识蒸馏之Focal and Global Knowledge Distillation for Detectors

[知识蒸馏] Data Efficient Stagewise Knowledge Distillation模型简介

Knowledge Distillation 知识蒸馏之 Hint layer & self-knowledge distillation

【KD】2022 CVPR Decoupled Knowledge Distillation

【KD】2022 TPAMI Quantifying the Knowledge in a DNN to Explain Knowledge Distillation for Clf

知识蒸馏（Distillation）相关论文阅读（1）——Distilling the Knowledge in a Neural Network（以及代码复现）

知识蒸馏学习笔记2--Structured Knowledge Distillation for Semantic Segmentation

【知识蒸馏】 DistPro: Searching A Fast Knowledge Distillation Process via Meta Optimization

多老师知识蒸馏模型——Anomaly detection based on multi-teacher knowledge distillation

通俗易懂的知识蒸馏 Knowledge Distillation（下）——代码实践（附详细注释）

蒸馏法文章选读——Correlation Congruence for Knowledge Distillation

《Distilling the Knowledge in a Neural Network》知识蒸馏

《Distilling the Knowledge in a Neural Network》知识蒸馏

今日推荐

基于大语言模型的开源知识库问答系统 MaxKB GitHub Star 数量突破 5,000 个！

美国拟限制 AI 大模型出口中国和俄罗斯

苹果将与 OpenAI 达成协议，将 ChatGPT 应用于 iPhone

openKylin 社区生态委员会第六次会议圆满召开

阿里云正式发布通义千问 2.5

Python 3.13 发布首个 Beta：实验性自由线程模式和 JIT、改进交互式解释器

Stack Overflow 拿我的代码去训练 AI 大模型，还封了我的账号

Pop!_OS 的 COSMIC 桌面完成 App Store 上架工作

报告：Django 仍然是 74% 开发者的首选

《2024 年一季度互联网投融资运行情况》研究报告

15 年前上了“FFmpeg 耻辱柱”，今天他还得谢谢咱——腾讯QQPlayer一雪前耻？

TIOBE 5 月榜单：Fortran “复活”进入 Top 10

周排行

BPM为企业带来的实际利益

好程序员web前端分享css常用属性缩写

Java文件下载（excel）

css样式的动态添加及显示和隐藏等零碎用法

axios全局配置以及拦截器

使用Logstash来实时同步MySQL和log日志数据到ES

C++获取当前时间（年月日、时分秒、毫秒）

Odoo产品分析 (四) -- 工具板块(11) -- 网站即时聊天(1)

Java环境配置正确，但是java、javac、java -version均返回“不是内部或外部命令，也不是可运行的程序或批处理文件”？

01 官网下载各种CentOS教程（超详细版）

每日归档

更多

2024-05-14(0)

2024-05-13(18)

2024-05-12(0)

2024-05-11(38)

2024-05-10(38)

2024-05-09(35)

2024-05-08(42)

2024-05-07(14)

2024-05-06(40)

2024-05-05(0)