TensorFlow笔记（1）非线性回归、MNIST手写数字识别

程序

import tensorflow as tf
import numpy as np
import matplotlib.pyplot as plt

# numpy生成200个随机点，下面这么写可以得到200行1列的矩阵
x_data = np.linspace(-0.5, 0.5, 200)[:, np.newaxis]
noise = np.random.normal(0, 0.02, x_data.shape)
y_data = np.square(x_data) + noise

# 定义两个placeholder，形状由样本定义（所以只有1列）
x = tf.placeholder(tf.float32, [None, 1])
y = tf.placeholder(tf.float32, [None, 1])

# 定义神经网络中间层
Weights_L1 = tf.Variable(tf.random_normal([1, 10]))
biases_L1 = tf.Variable(tf.zeros([1, 10]))
Wx_plus_b_L1 = tf.matmul(x, Weights_L1,) + biases_L1
L1 = tf.nn.tanh(Wx_plus_b_L1)

# 定义输出层
Weights_L2 = tf.Variable(tf.random_normal([10, 1]))
biases_L2 = tf.Variable(tf.zeros([1, 1]))
Wx_plus_b_L2 = tf.matmul(L1, Weights_L2) + biases_L2
prediction = tf.nn.tanh(Wx_plus_b_L2)

# 二次代价函数
loss = tf.reduce_mean(tf.square(y - prediction))
# 使用梯度下降法
train_step = tf.train.GradientDescentOptimizer(0.1).minimize(loss)

with tf.Session() as sess:
    sess.run(tf.global_variables_initializer())
    for _ in range(2000):
        sess.run(train_step, feed_dict={x:x_data, y:y_data})
    # 获得预测值
    prediction_value = sess.run(prediction, feed_dict={x:x_data})
    
    # 画图
    plt.figure()
    plt.scatter(x_data, y_data)
    plt.plot(x_data, prediction_value, 'r-', lw=5)
    plt.show()

思路

总体思路其实很简单，其实就是定义了输入层是1，输出层是1，隐层是10的BP神经网络，喂入数据后计算预测值与实际值的loss（就是程序的loss部分，这里用了二次代价函数），最后通过loss来计算出梯度下降的方向。还是很基础的程序，只是后面多了画图的步骤（plt部分其实可以没有，但是画图出来可以更直观，这部分代码如果没办法理解的话可以先跳过）。

MNIST手写数字识别（BP网络）

程序

import tensorflow as tf
from tensorflow.examples.tutorials.mnist import input_data

# 下面这行代码会将数据集下载到脚本目录下的MNIST_data目录中，如果没有这个文件夹会自动创建，如果网络不好的话可以自己手动下载然后移动数据集到这个文件夹中
mnist = input_data.read_data_sets('MNIST_data', one_hot=True)

# 每个批次的大小
batch_size = 100
# 计算总共多少批次
n_batch = mnist.train.num_examples // batch_size

# 定义两个placeholder
x = tf.placeholder(tf.float32, [None, 784])
y = tf.placeholder(tf.float32, [None, 10])

# 创建一个简单的神经网络
W = tf.Variable(tf.zeros([784, 10]))
b = tf.Variable(tf.zeros([10]))
prediction = tf.nn.softmax(tf.matmul(x, W) + b)

# 二次代价函数
loss = tf.reduce_mean(tf.square(y - prediction))
# 梯度下降
train_step = tf.train.GradientDescentOptimizer(0.2).minimize(loss)

# 初始化变量
init = tf.global_variables_initializer()

# 返回结果是布尔型列表， argmax返回一维张量中最大值所在的位置
correct_prediction = tf.equal(tf.argmax(y, 1), tf.argmax(prediction, 1))
# 求正确率，先用cast转换类型，True变成1，False变成0，然后再求平均值
accuracy = tf.reduce_mean(tf.cast(correct_prediction, tf.float32))

with tf.Session() as sess:
    sess.run(init)
    for epoch in range(21):
        for batch in range(n_batch):
            batch_xs, batch_ys = mnist.train.next_batch(batch_size)
            sess.run(train_step, feed_dict={x:batch_xs, y:batch_ys})
        acc = sess.run(accuracy, feed_dict={x:mnist.test.images, y:mnist.test.labels})
        print('Iter' + str(epoch) + ', Testing Accuracy ' + str(acc))

备注：上面的代码在最新版本（1.20）运行时可能会触发一堆warning，是由mnist.py这个负责下载MNIST数据集的脚本引发的，忽略就行。

参考资料

深度学习框架Tensorflow学习与应用

TensorFlow笔记（1）非线性回归、MNIST手写数字识别

程序

思路

MNIST手写数字识别（BP网络）

程序

参考资料

猜你喜欢