Pytorch 常见报错 RuntimeError: Trying to backward through the graph a second time

其他 2021-02-11 10:23:11 阅读次数: 0

RuntimeError: Trying to backward through the graph a second time, but the buffers have already been freed.

model = RNN()
hn = torch.zeros(1,seq_len,hidden_num)
epochs = 250
clip_value = 100
loss = nn.CrossEntropyLoss()
optimizer = torch.optim.Adam(model.parameters(),lr=0.001)

for epoch in range(epochs):
    accu,num = 0.0,0
    for x,y in data_collect(corpus_indice,batch_size,seq_len):
		
		#-------------------------------------------------------------#
		# 这里添加上一句话即可
		hn.detach_()
		#-------------------------------------------------------------#
        output,hn = model(x,hn)
        y = y.transpose(1,0).contiguous().view(-1)
        ls = loss(output.view(-1,vocab_len),y)
        
        optimizer.zero_grad() 
        ls.backward()
        
        torch.nn.utils.clip_grad_value_(model.parameters(), clip_value)
        
        optimizer.step()
        accu += ls.item() * y.shape[0]
        num += y.shape[0]
    if epoch%50 == 0:
        print("现在是第{}次epoch，loss的值为{}".format(epoch,math.exp(accu/num)))
print("完成")

产生问题的原因(我个人的理解是这样的)：
每一次在读取并计算完一个 $b a t c h$ 的数据之后会有一个 $h_t$ 连接到计算图中，这个 $h_t$ 参与反向传播求梯度，梯度在求完结果之后就释放掉了，当到下一个也就是 $h_{t+1}$ 的时候，在计算梯度时还会经过 $h_t$ （因为这两个在计算图中连着），但是 $h_t$ 的相关信息已经被释放了，所以会产生报错。
解决方法：
- 第一种解决方案：
  hidden.detach_() 在每次读取一个 $b a t c h$ 的数据之后，开始训练之前都要将隐藏层从计算图中 $d e t a c h$ 出来。
- 第二种解决方案：
  将 loss.backward() 替换成 loss.backward(retain_graph=True)，这种方法相当于将计算图的全部内容都存储下来，中间不进行清零操作。很明显，一方面占内存比较多，另一方面计算起来会很慢，所以不推荐。

猜你喜欢

转载自blog.csdn.net/weixin_44618906/article/details/107435076

Pytorch 常见报错 RuntimeError: Trying to backward through the graph a second time

【报错】：RuntimeError: Trying to backward through the graph a second time, but the saved intermediate re

RuntimeError: Trying to backward through the graph a second time, but the buffers have already been

【笔记】RuntimeError: Trying to backward through the graph a second time：将无关变量的梯度回传关系撤销

RuntimeError: Trying to backward through the graph a second time (or directly access saved variable

RuntimeError: Trying to backward through the graph a second time, but the buffers have already free

Pytorch :Trying to backward through the graph a second time, but the buffers have already been freed

问题解决：Pytorch :Trying to backward through the graph a second time, but the buffers。。

“RuntimeError: Trying to backward through the graph a second time, but the buffers have already been freed. Specify retain_graph=True when calling backward the first time”

【Error】: Trying to backward through the graph a second time, but the buffers have already been

深度学习：模型训练过程中Trying to backward through the graph a second time解决方案

pytorch 常见报错

pytorch的backward

【Pytorch】backward与backward_hook

Pytorch之——常见报错

RuntimeError: Trying to resize storage that is not resizable

Pytorch之深入backward

Pytorch中backward函数

Pytorch中的backward()函数

Pytorch问题：autograd与backward()

pytorch backward问题

Pytorch-backward()

Pytorch autograd,backward详解

【Pytorch】backward()简单理解

pytorch 多次backward

解决pytorch dataloader报错：Trying to resize storage that is not resizable

The world is going through the second wave of the frenzy of cryptocurrency

Decoding billions of integers per second through vectorization

Pytorch之浅入backward

PyTorch0.4 中 backward()

今日推荐

美国拟限制 AI 大模型出口中国和俄罗斯

苹果将与 OpenAI 达成协议，将 ChatGPT 应用于 iPhone

openKylin 社区生态委员会第六次会议圆满召开

阿里云正式发布通义千问 2.5

Python 3.13 发布首个 Beta：实验性自由线程模式和 JIT、改进交互式解释器

Stack Overflow 拿我的代码去训练 AI 大模型，还封了我的账号

Pop!_OS 的 COSMIC 桌面完成 App Store 上架工作

报告：Django 仍然是 74% 开发者的首选

《2024 年一季度互联网投融资运行情况》研究报告

15 年前上了“FFmpeg 耻辱柱”，今天他还得谢谢咱——腾讯QQPlayer一雪前耻？

TIOBE 5 月榜单：Fortran “复活”进入 Top 10

GCC 14.1 发布

周排行

curl的POST请求，封装方法

8.1.1. Integer Types

Java基础 Day05(个人复习整理)

Python - Django - 中间件 process_exception

小L的试卷

【Shell编程】（函数）判断用户是否存在

python(css样式)

spring ant path 匹配原则 - 【笔记】

《JavaScript与JScript从入门到精通》(美)James.Jaworski.中译本.扫描版.pdf

Eclipse运行带参数的java程序

每日归档

更多

2024-05-12(0)

2024-05-11(38)

2024-05-10(38)

2024-05-09(35)

2024-05-08(42)

2024-05-07(14)

2024-05-06(40)

2024-05-05(0)

2024-05-04(7)

2024-05-03(19)