tensorflow 2.0 随机梯度下降之损失函数及其梯度

其他 2019-05-09 17:22:48 阅读次数: 0

版权声明：本文为博主（[email protected]）原创文章，未经博主允许不得转载。 https://blog.csdn.net/z_feng12489/article/details/90033237

6.3 损失函数及其梯度

MSE
cross entropy

MSE

$loss = \frac{1}{N}\sum (y-out)^2$ 这里 $N = B * NumOfClass$
$L_{2-norm} = \sqrt{\sum(y-out)^2}$

导数：

$loss = \sum [y-f_\theta(x)]^2$
$\frac{\nabla loss}{\nabla\theta}=2\sum [y-f_\theta(x)]*\frac{\nabla f_\theta(x)}{\nabla\theta}$

x=tf.random.normal([1,3])

w=tf.ones([3,2])

b=tf.ones([2])

y = tf.constant([0, 1])


with tf.GradientTape() as tape:

	tape.watch([w, b])
	logits = tf.sigmoid(x@w+b) 
	loss = tf.reduce_mean(tf.losses.MSE(y, logits))

grads = tape.gradient(loss, [w, b])
print('w grad:', grads[0])

print('b grad:', grads[1])

w grad: tf.Tensor(
[[-0.0309717   0.04595036]
 [-0.1207474   0.17914379]
 [ 0.01667333 -0.02473696]], shape=(3, 2), dtype=float32)
b grad: tf.Tensor([ 0.09684253 -0.14367795], shape=(2,), dtype=float32)

cross entropy

softmax:
$p_i=\frac{e^{a_i}}{\sum_{k=1}^{N}e^{a_k}}$
在这里插入图片描述

转移成概率值
拉大值之间的差距

Softmax 求导：
在这里插入图片描述
得到：

$\frac{\partial p_i}{\partial a_j}=p_i(\delta_{ij}-p_j)$

x=tf.random.normal([1,3])

w=tf.random.normal([3,2])

b=tf.random.normal([2])

y = tf.constant([0, 1])


with tf.GradientTape() as tape:

	tape.watch([w, b])
	logits = (x@w+b)
	loss = tf.reduce_mean(tf.losses.categorical_crossentropy(y, logits, from_logits=True))

grads = tape.gradient(loss, [w, b])
print('w grad:', grads[0])

print('b grad:', grads[1])

w grad: tf.Tensor(
[[ 0.23242529 -0.23242529]
 [ 0.9089024  -0.9089024 ]
 [ 0.58031267 -0.58031267]], shape=(3, 2), dtype=float32)
b grad: tf.Tensor([ 0.6567008 -0.6567008], shape=(2,), dtype=float32)

猜你喜欢

转载自blog.csdn.net/z_feng12489/article/details/90033237

tensorflow 2.0 随机梯度下降之损失函数及其梯度

tensorflow 2.0 随机梯度下降之激活函数及其梯度

tensorflow 2.0 随机梯度下降之梯度下降

tensorflow 2.0 随机梯度下降之函数优化实例

tensorflow 2.0 随机梯度下降之 FashionMNIST实战

tensorflow 2.0 随机梯度下降之 tensorboard可视化

tensorflow 2.0 随机梯度下降之多输出感知机梯度

tensorflow 2.0 随机梯度下降之单输出感知机梯度

TensorFlow 激活函数、损失函数、梯度下降

Python之随机梯度下降

随机梯度下降SGD （Tensorflow 2.1）

梯度下降之随机梯度下降 -minibatch 与并行化方法

机器学习之梯度下降、批量梯度下降与随机梯度下降

线性回归之随机梯度下降（sgd）

损失函数及其梯度

tensorflow中的梯度下降函数

梯度下降，随机梯度下降，

机器学习之梯度下降法（GD）、随机梯度下降法（SGD）和随机平均梯度下降法（SAGD）

龙曲良 Tensorflow —— 随机梯度下降（自用）-4

tensorflow-梯度下降

Tensorflow梯度下降应用

tensorflow 梯度下降

TensorFlow实现梯度下降

机器学习（ML）十五之梯度下降和随机梯度下降

TensorFlow之梯度下降解决线性回归(7)

随机梯度/批量梯度下降

吴裕雄--天生自然TensorFlow2教程：损失函数及其梯度

深度学习UFLDL教程翻译之优化：随机梯度下降

深度学习笔记之【随机梯度下降（SGD）】

机器学习之SGD（Stochastic Gradient Descent，随机梯度下降）

今日推荐

中国码农的“35岁魔咒”

蘭雅 CorelDRAW 插件 2024.5.1 国际劳动节版，免费下载

Arc Browser for Windows 1.0 正式 GA

90后程序员开发视频搬运软件、不到一年获利超 700 万，结局很刑！

《美国对全球网络空间安全与发展的威胁和破坏》报告发布

周排行

Java基础复习_day13_Collection集合

2018.11.16 c语言学习经验

且看Java内置四大核心函数式接口

小程序云开发中数据库的数据分段和显示图片

python的函数

Web-JS进阶

【干货】C++常用代码积累笔记大全

Spring的ioc操作与 IOC底层原理

构建之法20191121-11 Scrum立会报告+燃尽图 07

Spring boot之Hello World访问404

每日归档

更多

2024-05-05(0)

2024-05-04(7)

2024-05-03(19)

2024-05-02(0)

2024-05-01(4)

2024-04-30(1)

2024-04-29(40)

2024-04-28(0)

2024-04-27(56)

2024-04-26(39)