Learning to Learning with Gradients———论文阅读第三部分

其他 2021-11-28 07:07:04 阅读次数: 0

元学习中一个很重要的算法MAML，给出了讲解以及对应的代码，现在继续深入这个算法。

四. 基于模型不可知的元学习算法（MAML）

4.2 几种MAML

4.2.1 监督的回归和分类任务

核心和之前是一样的，对于监督学习的话，单输入单输出，我们的损失函数可以定位为：

回归： $L(\Phi,{D_j}_i)\ =\ \sum_{x^{(i)},y^{(j)}}||f_{\Phi}(x^{(i)})-y_{(j)}||^2_2$

分类： $L(\Phi,{D_j}_i)\ =\ \sum_{x^{(i)},y^{(j)}}y^{(j)}logf_{\Phi}(x^{(i)})\ +\ (1-y^{(j)})log(1-f_{\Phi}(x^{(i)}))$

在这里插入图片描述
对应着就可以了，实现的代码在上期。

4.2.2 强化学习

在这里插入图片描述
强化学习目前还没开始研究所以直接给出代码

4.3 执行和一阶近似

可以发现，之前的MAML算法我们用到了二阶导数，然而二阶导数会增加我们的计算量，因此作者就想用一阶近似去模拟二阶，看是否能代替。优化定义如下：

$\min _\theta\sum_{J_i}L(\theta\ -\ \alpha\ sg(\nabla_\theta L(\theta,D^{tr}_j)),D^{test}_{j_i})$

sg表示来停止梯度的操作，这种近似是将参数更新视为一个常数（ $\theta\ to\ \theta+c$ ）,然后反向去传播这个新的性能任务

4.4 MAML的表达能力

猜你喜欢

转载自blog.csdn.net/qq_45478482/article/details/121302387

Learning to Learning with Gradients———论文阅读第三部分

Learning to Learning with Gradients———论文阅读第一部分

Learning to Learning with Gradients———论文阅读第二部分

Learning

Coursera-Deep Learning Specialization 课程之（五）：Sequence Models: -weak1编程作业（第三部分）

[论文阅读] Learning Loss for Active Learning

【论文精读】Curriculum Learning

Reinforcement learning + OR的论文

maching learning入门（三）

论文阅读：Dual Supervised Learning

[论文阅读] Learning without Memorizing

[论文阅读] Learning Without Forgetting

《Learning to Compare: Relation Network for Few-Shot Learning》论文阅读

迁移学习论文阅读：Transfer Learning via Learning to Transfer

论文阅读之Learning and Generalization of Motor Skills by Learning from Demonstration

Q Learning vs Policy Gradients

experiential learning and passive learning

Deep Learning - Machine Learning

supervised learning|unsupervised learning

Learning Path for Machine Learning

Deep Learning 的阅读笔记（一）

Deep Learning阅读笔记：Chapter 5—Machine Learning Basics(2)

Deep Learning阅读笔记：Chapter 5—Machine Learning Basics(1)

Learning to Generate Questions by Learning What not to Generate阅读笔记

Deep learning 论文笔记

[论文翻译] Learning Without Forgetting

【论文合集】Awesome Transfer Learning

Learning to Design Games Strategic Environments in Reinforcement Learning（部分翻译）

论文翻译--StarCraft Micromanagement with Reinforcement Learning and Curriculum Transfer Learning

读论文：Learning to Compare: Relation Network for Few-Shot Learning

今日推荐

《美国对全球网络空间安全与发展的威胁和破坏》报告发布

火速冲上 GitHub 热榜 —— 开源编程语言、框架哪有这么可爱？

北京人形机器人创新中心发布全球首个纯电驱拟人奔跑的全尺寸人形机器人“天工”

LFOSSA 源来如此公开课 | 掌握云原生未来：CNCF 认证全面攻略与备考秘籍

周排行

让自己的头脑极度开放

CentOS 6.5(x64) 和Redhat6.5操作系误删libc

高可用注册中心

【日记】12.28/【题解】AtCoder AGC041

XML（5）_XML 约束_DTD

Java集合Map（四）

树梅派安装桌面环境教程

pipenv 的使用和安装

小程序白屏问题和内存研究

C语言简单选择排序

每日归档

更多

2024-05-02(0)

2024-05-01(4)

2024-04-30(1)

2024-04-29(40)

2024-04-28(0)

2024-04-27(56)

2024-04-26(39)

2024-04-25(22)

2024-04-24(36)

2024-04-23(26)