RuntimeError: CUDA out of memory在不减小batch_size的前提下的解决方案

企业开发 2023-08-18 18:40:06 阅读次数: 0

解决方案一

参考文章：一文解决 RuntimeError: CUDA out of memory. 全网最全_辞与不羡的博客-CSDN博客
由于需要使用较大的batch size，所以使用第五个解决方法

sudo gedit ~/.bashrc

在文件最末尾添加一行，其中max_split_size_mb可以根据报错提示设置的稍微大一些

export PYTORCH_CUDA_ALLOC_CONF=max_split_size_mb:64

之后激活环境变量

source ~/.bashrc

最后重启程序或脚本

解决方案二

参考文章：Gradient Accumulation in PyTorch | Nikita Kozodoi
同样是为了能够使用较大的batch size：Gradient Accumulation
引用原文的示例代码如下，主要思想就是将一个batch分成好几个小的batch，每个小的batch计算完梯度之后不更新参数，只是叠加在一起，当一个batch中的全部的小的batch计算完毕之后再更新网络参数，这样最终的batch size等于accum_iter * origin_batch_size。也就是说假入设置batch_size为64，会爆显存，但是设置为16就不会，那么就可以设置accum_iter为4，batch_size为16，达到一样的效果。

# batch accumulation parameter
accum_iter = 4  

# loop through enumaretad batches
for batch_idx, (inputs, labels) in enumerate(data_loader):

    # extract inputs and labels
    inputs = inputs.to(device)
    labels = labels.to(device)

    # passes and weights update
    with torch.set_grad_enabled(True):
        
        # forward pass 
        preds = model(inputs)
        loss  = criterion(preds, labels)

        # normalize loss to account for batch accumulation
        loss = loss / accum_iter 

        # backward pass
        loss.backward()

        # weights update
        if ((batch_idx + 1) % accum_iter == 0) or (batch_idx + 1 == len(data_loader)):
            optimizer.step()
            optimizer.zero_grad()

猜你喜欢

转载自blog.csdn.net/m0_46749624/article/details/127705998

RuntimeError: CUDA out of memory在不减小batch_size的前提下的解决方案

RuntimeError: CUDA out of memory

RuntimeError: CUDA error: out of memory

解决报错RuntimeError: CUDA out of memory

RuntimeError: CUDA out of memory 已解决

RuntimeError: CUDA out of memory. 错误日志

显存充足 RuntimeError: CUDA error: out of memory

全网最全RuntimeError: CUDA error: out of memory解决方法

一步真实解决RuntimeError: CUDA out of memory.

如何解决“RuntimeError: CUDA Out of memory”问题

RuntimeError: CUDA error: out of memoryCUDA

Pytorch——代码导致的异常报错：RuntimeError: CUDA out of memory.

【RuntimeError: CUDA error: out of memory 实测有用】

训练yolov5时出现RuntimeError: CUDA out of memory

RuntimeError: CUDA out of memory See documentation for Memory Management and PYTORCH_CUDA_ALLOC_CONF

pytorch与cuda版本不对应导致RuntimeError: CUDA error: out of memory

【兼容调试】pytorch出现RuntimeError: CUDA out of memory时的一些解决方法

解决RuntimeError: CUDA out of memory. Tried to allocate 236.00 MiB (GPU 0； 11.00

大概率（5重方法）解决RuntimeError: CUDA out of memory. Tried to allocate ... MiB

pytorch解决RuntimeError: CUDA out of memory. Tried to allocate 20.00 MiB (GPU 0； 4.00 G

如何解决常见问题：“RuntimeError: CUDA Out of memory”

RuntimeError: CUDA out of memory. Tried to allocate 352.00 MiB (GPU 0; 7.80 GiB total capacity; 6.45

RuntimeError: CUDA out of memory. Tried to allocate 98.00 MiB (GPU 0; 5.79 GiB total capacity; 4.75

RuntimeError: CUDA out of memory. Tried to allocate 132.00 MiB (GPU 2; 3.95 GiB total capacity; 3.41

RuntimeError: CUDA out of memory. Tried to allocate 320.00 MiB (GPU 0； 10.92 GiB total capacity； 9.9

RuntimeError: CUDA out of memory. Tried to allocate 2.08 GiB (GPU 0； 4.00 GiB total capacity； 274.14

AI绘画——使用stable-diffusion生成图片时提示RuntimeError: CUDA out of memory处理方法

RuntimeError: CUDA runtime implicit initialization on GPU:0 failed. Status: out of memory

报错`RuntimeError: CUDA out of memory. Tried to allocate 256.00 MiB (GPU 0； 9.78 GiB total capaci

RuntimeError: CUDA out of memory. Tried to allocate 22.00 MiB (GPU 0； 10.76 GiB total capacity； 9.61

今日推荐

中国码农的“35岁魔咒”

蘭雅 CorelDRAW 插件 2024.5.1 国际劳动节版，免费下载

Arc Browser for Windows 1.0 正式 GA

90后程序员开发视频搬运软件、不到一年获利超 700 万，结局很刑！

《美国对全球网络空间安全与发展的威胁和破坏》报告发布

周排行

[编程题]学英语

[codeforces 1288A] Deadline 约数+模

Python的web开发

Docker在Centos 7上的部署

python编码

解决Ubuntu16.04 fatal error: json/json.h: No such file or directory

mysql并发插入

rest接口如何适应jsonp的方案

linux 终端上网设置

高数——等号两边同时求导、积分的解释

每日归档

更多

2024-05-04(7)

2024-05-03(19)

2024-05-02(0)

2024-05-01(4)

2024-04-30(1)

2024-04-29(40)

2024-04-28(0)

2024-04-27(56)

2024-04-26(39)

2024-04-25(22)