Solve the regression problem with the gradient descent algorithm (Python implementation)

The linear regression problem is a very classic problem in mathematics. The solutions to such problems are nothing more than two categories: 1. Least squares method; 2. Gradient descent algorithm. Here, we use gradient descent algorithm to solve the simplest Linear regression problem.

1. Build a data sample

我们这里用自己模拟生成的一段数据，一共200个样本，样本为一个二元一次方程，加上一个随机噪声：
	y = wx + b + nos (我样本的 w 为1.8， b 为5 其中 nos 为随机噪声，取值范围为[-4,4])
这里我就不过多赘述了，直接上代码：

from __future__ import print_function
import time
import random
start = time.perf_counter()

#创建文件
fp = open("test.txt","w")
for i in range(10):
    for numx in range(-10,10):
        numb = random.uniform(-4, 4) + 5 #常数加上一个随机噪声
        numy = 1.8*numx + numb
        fp.write(str(numx) + ' ' + str(numy) + '\n')
fp.close()

end = time.perf_counter()

print("运行耗时：", end-start)

The test.txt generated here is the original data sample we will use.
insert image description here

2. Start to use the gradient descent algorithm to process the original data and find the fitting function.
I will update the specific mathematical derivation process later, it is too late tonight. . . .
Go directly to the code:

from __future__ import print_function
import torch
import time
import random
import numpy as np
start = time.perf_counter()

#获取所有点的总误差的均值
def compute_error_for_line_given_points(b, w, points):
    totalError = 0
    for i in range(0, len(points)):
        x = points[i, 0]
        y = points[i, 1]
        totalError += (y - (w * x + b)) ** 2
    return totalError / float(len(points))

#梯度算法
def step_gradient(b_current, w_current, points, learningRate):
    b_gradient = 0
    w_gradient = 0
    N = float(len(points))
    for i in range(0, len(points)):
        x = points[i, 0]
        y = points[i, 1]
        b_gradient += -(2/N) * (y - ((w_current * x) + b_current))
        w_gradient += -(2/N) * x * (y - ((w_current * x) + b_current))
    new_b = b_current - (learningRate * b_gradient)
    new_w = w_current - (learningRate * w_gradient)
    return [new_b, new_w]

#开始迭代
def gradient_descent_runner(points, starting_b, starting_w, learning_rate, num_iteration):
    b = starting_b
    w = starting_w
    for i in range(num_iteration):
        b, w = step_gradient(b, w, np.array(points), learning_rate)
    return [b, w]

def run():
    points = np.genfromtxt("test.txt", delimiter=" ")
    learning_rate = 0.0005
    initial_b = 0
    initial_w = 0
    num_iterations = 10000
    print("Starting gradient descent at b = {0}, w = {1}, error = {2}"
          .format(initial_b, initial_w, compute_error_for_line_given_points(initial_b, initial_w, points)))
    print("Runing")
    [b, w] = gradient_descent_runner(points, initial_b, initial_w, learning_rate, num_iterations)
    print("After {0} iterations b = {1} w = {2}, error = {3}".
          format(num_iterations, b, w, compute_error_for_line_given_points(b, w, points)))

if __name__ == '__main__':
    run()



end = time.perf_counter()

print("运行耗时：", end-start)

Here b = 5.076, w = 1.795
We can directly see the results of the operation, b = 5.076 and w = 1.795 obtained after 10,000 iterations, so: the
fitting result is: y = 1.795 * x + 5.076
The original function is: y = 1.8 * x +
5We We can intuitively see our fitting results from the figure:
The fitting effect is not bad
So far, we can see more intuitively that the fitting effect of the function obtained by our gradient descent algorithm is still very good. This is the gradient descent algorithm sister I shared. Linear and simple Linear regression problem.

Solve the regression problem with the gradient descent algorithm (Python implementation)

Guess you like