神经网络加速基础知识

神经网络加速基础知识

执行过程

image-20230113203422048

image-20230113203441770

image-20230113203508271

image-20230113203537998

image-20230113203559003

image-20230113203613299

image-20230113203841681

指令执行时间

image-20230113203940431

处理器

image-20230113204215702

image-20230113204225349

image-20230113204312842

image-20230113204324870

image-20230113204414694

image-20230113204431226

image-20230113204502159

image-20230113204544646

image-20230113204606365

image-20230113204722216

image-20230113204813737

image-20230113204835472

image-20230113204947865

image-20230113205032249

image-20230113205053034

image-20230113205206966

算力瓶颈计算代码

import torch
import torchvision
from tqdm import tqdm

DEVICE = "cuda:0"

model = torchvision.models.mobilenet_v2(pretrained=True)
model = model.to(DEVICE)

with torch.no_grad():
    data = torch.rand(size=[1,3,224,224])
    for i in tqdm(range(1024)):
        o = model.forward(data.to(DEVICE))

    data = torch.rand(size=[128,3,224,224])

    for i in tqdm(range(128)):
        o = model.forward(data.to(DEVICE))

image-20230113210152829

猜你喜欢

转载自blog.csdn.net/charles_zhang_/article/details/128698147