CS231n课程笔记：Leture6 Training Neural Networks I

企业开发 2023-04-08 07:39:38 阅读次数: 0

目录

Activation Functions

Data Preprocessing

Weight Initialization

Batch Normalization***

Babysetting the learning process

pART1

Activation Functions

In practice

Data Preprocessing

about PCA:

PCA(principal component analysis,主成分分析) - 简书 (jianshu.com)

Weight Initialization

four ways for weights initialization

(21条消息) 权值初始化——高斯初始化，Xavier初始化，MSRA初始化，He初始化_caicaiatnbu的博客-CSDN博客_高斯分布初始化

A good general rule of thumb is basically use the Xavier initialization to start with, and then you can also think about some of these other kinds of methods.

Batch Normalization***

We just estimate this at training time

Babysetting the learning process

How do we monitor training?and how do we adjust hyperparameters as we go to get a good learning result?

codes for initial network

def init_two_layer_model(input_size, hidden_size, output_size):
    model = {}
    model['W1'] = 0.0001 * np.random.randn(input_size, hidden_size)
    model['b1'] = np.zeros(hidden_size)
    model['W2'] = 0.0001 * np.random.randn(hidden_size, output_size)
    model['b2'] = np.zeros(output_size)
    return model

model = init_two_layer_model(32*32*3, 50, 10) # input_size , hidden_size ,number of classes
loss, grad = two_layer_net(X_train, model, y_train, 0.0)  # 0.0 is the disable regularization
print loss

trying to train

trainer = ClassifierTrainer()
X_tiny = X_train[:20] # take 20 examples
y_tiny = y_train[:20]
best_model, stats = trainer.train(X_tiny, y_tiny, X_tiny, y_tiny, model, two_layer_net, num_epochs=200, reg = 0.0, update='sgd',learning_rate_decay=1, sample_batches=False, learning_rate=1e-3, sample_batches=True, verbose=True)

猜你喜欢

转载自blog.csdn.net/m0_53292725/article/details/126957775

CS231n课程笔记：Leture6 Training Neural Networks I

CS231n课程笔记：Leture7 Training Neural Networks II

CS231n Lecture6-Training Neural Networks, part I学习笔记

CNN笔记（CS231N）——训练神经网络I（Training Neural Networks, Part I）

Lecture 6: Training Neural Networks, Part I

Training Neural Networks, part I

CS231n课程笔记：Leture5 Convolutional Neural Networks

【CS231n】Lecture 6：Training Neural Networks,Part 2

cs231n 学习 -- Lecture 6/7 Training Neural Networks

CNN笔记（CS231N）——训练神经网络II（Training Neural Networks, Part 2）

[Lecture 6 ] Training Neural Networks I（训练神经网络I）

训练神经网络（CS231n 7. Training Neural Networks II）

CS231n 7. Training Neural Networks II 训练神经网络

cs231n : Convolutional Neural Networks

CS231n笔记 Lecture 4 Introduction to Neural Networks

Population Based Training of Neural Networks

（转）A Recipe for Training Neural Networks

Training Neural Networks, part II

【阅读笔记】Differentiable plasticity: training plastic neural networks with backpropagation

【阅读笔记】Training Deep Neural Networks on Imbalanced Data Sets

《Understanding the difficulty of training deep feedforward neural networks》笔记

MLCC笔记15 - 训练神经网络 (Training Neural Networks)

论文笔记:Bag of Freebies for Training Object Detection Neural Networks 论文笔记:Bag of Freebies for Training Object Detection Neural Networks

(Review cs231n) Training of Neural Network2

CS231n Convolutional Neural Networks for Visual Recognition

CS231n: Lecture 10 | Recurrent Neural Networks

CS231n Lecture4-Introduction to Neural Networks

CS231n:Convolutional Neural Networks for Visual Recognition

cs231n 学习 -- Lecture 5 Convolutional Neural Networks

cs231n 学习 -- Lecture 4 Backpropagation and Neural Networks

今日推荐

火速冲上 GitHub 热榜 —— 开源编程语言、框架哪有这么可爱？

北京人形机器人创新中心发布全球首个纯电驱拟人奔跑的全尺寸人形机器人“天工”

LFOSSA 源来如此公开课 | 掌握云原生未来：CNCF 认证全面攻略与备考秘籍

国产云输入法——仅华为无云端数据上传安全问题

开源日报 | 工业开源项目OGG 1.0；姐姐，你要和我一起配置火狐吗；苹果AI遥遥落后？Fedora 40

开放签电子签章：停止新增，优化体验，前进更进（五一假期前工作）

开源日报 | 中学生开源前端动画引擎；全球首个Llama3 8B中文版开源模型；联想电脑恐出局；Linus讽刺AI炒作

周排行

浏览器对同一域名进行请求的最大并发连接数

React Hook之自定义Hook

【转】MyBatis缓存机制

-Java-泛型

自动化测试常用脚本-发送邮件

LeetCode#859: Buddy Strings

java、Python处理字符串

第二篇の博客

Hadoop伪分布式环境安装

SQL Server进阶（十一）临时表、表变量

每日归档

更多

2024-04-27(56)

2024-04-26(39)

2024-04-25(22)

2024-04-24(36)

2024-04-23(26)

2024-04-22(39)

2024-04-21(0)

2024-04-20(6)

2024-04-19(5)

2024-04-18(0)