深度学习概念（术语）：Fine-tuning、Knowledge Distillation, etc

其他 2023-09-15 17:34:18 阅读次数: 0

文章目录

1.Fine-tuning (微调)
2.Transfer Learning (迁移学习)
3.Knowledge Distillation (知识蒸馏)
4.Meta Learning (元学习)

这里的相关概念都是基于已有预训练模型，就是模型本身已经训练好，有一定泛化能力。需要“再加工”满足别的任务需求。

进入后GPT时代，对模型的Fine-tuning也将成为趋势，借此机会，我来科普下相关概念。

1.Fine-tuning (微调)

有些人认为微调和训练没有区别，都是训练模型，但是微调是在原模型训练好的的基础上，做针对性的再训练。微调一般用额外的数据集，降低学习率让模型适应特定任务。

2.Transfer Learning (迁移学习)

迁移学习大意是让模型适应新的任务，这涉及模型的改进和再训练。可以把微调看作是迁移学习的一种。

相比微调，迁移学习很多时候并不需要训练原有模型，可以只训练一部分，或者给模型加1-2层后，用元模型的输出作为迁移学习的输入，训练额外添加部分即可。

3.Knowledge Distillation (知识蒸馏)

KD目标是用一个小模型去学习大模型的能力，在保证基线性能的前提下，降低模型的参数和复杂度。

4.Meta Learning (元学习)

Learning to Learning，就是学会学习，这个概念并不需要预训练模型。元学习是指模型学习各类任务数据，然后学会各类任务的共性，从而适应新的任务。

猜你喜欢

转载自blog.csdn.net/JishuFengyang/article/details/132782541

深度学习概念（术语）：Fine-tuning、Knowledge Distillation, etc

知识蒸馏（Knowledge Distillation）

Knowledge Distillation examples

BERT and Knowledge Distillation

知识蒸馏Knowledge Distillation

【随记】Knowledge Distillation: A Survey

On the Efficacy of Knowledge Distillation 解析

Contrastive Learning Rivals Masked Image Modeling in Fine-tuning via Feature Distillation

Residual Knowledge Distillation论文精度

Knowledge Distillation 知识蒸馏详解

Knowledge Distillation(KD) 知识蒸馏

论文解读：Decoupled Knowledge Distillation

知识蒸馏简介（Knowledge Distillation）

概念解析 | 知识蒸馏(Knowledge Distillation)

[论文解读]Explaining Knowledge Distillation by Quantifying the Knowledge

【Deep Learning】Sequence-Level Knowledge Distillation

【Distill 系列：三】On the Efficacy of Knowledge Distillation

Knowledge Distillation(KD) 知识蒸馏 Pytorch实现

【随记】Knowledge distillation in deep learning and its applications

【随记】The State Of Knowledge Distillation ForClassification Tasks

知识蒸馏是什么？（Knowledge Distillation）KD

【KD】2022 CVPR Decoupled Knowledge Distillation

【知识蒸馏】Knowledge Distillation with the Reused Teacher Classifier

【知识蒸馏】 Knowledge Distillation from A Stronger Teacher

【综述】2021-Knowledge Distillation: A Survey

知识蒸馏综述 Knowledge Distillation: A Survey

知识蒸馏（Knowledge distillation）必读论文合集

PKDGAN: Private Knowledge Distillation with Generative Adversarial Networks

Knowledge Distillation 知识蒸馏之 Hint layer & self-knowledge distillation

Channel Distillation: Channel-Wise Attention for Knowledge Distillation 原理与代码解析

今日推荐

手把手教你用 LangChain 实现大模型 Agent

外星人入侵（python）

超全的免费chatGPT列表【建议收藏】

52.2k star! 自己部署gpt4free, 免费使用各种GPT

2024年（第十届）全国大学生统计建模大赛优秀论文解析——中国经济发展与碳排放库兹涅茨曲线的验证研究

【自动驾驶技术】自动驾驶汽车AI芯片汇总——NVIDIA篇

7个免费的ChatGPT网站，给大家送上

Angular v18 正式发布！

【VMware】 vCenter Converter standalone 6.6.0正式版下载

开源日报 | Angular v18；大模型价格战下的推理优化；Mistral AI以开源模型瞄准美国市场；硅谷有自己的鲁迅

数学建模Matlab之数据预处理方法

充电桩---ISO15118协议详细介绍

周排行

慧测学习课件

Mscordacwks.dll/SOS.dll 调试归档

关于深度学习人工智能模型的探讨（二）（7）

Stop Using the text-indent:-9999px

Least Common Multiple（HDU - 1019 ）

Comparator接口的使用方法--例子

修改framework Camera的API,旋转摄像头

机器学习时代的“大数据+”：数据平台的设计与搭建

vue 项目部署到nginx

webstorm 常用插件集合

每日归档

更多

2024-05-29(65)

2024-05-28(2)

2024-05-27(56)

2024-05-26(6)

2024-05-25(68)

2024-05-24(65)

2024-05-23(9)

2024-05-22(41)

2024-05-21(8)

2024-05-20(36)