Inference with onnxruntime-gpu model - Code World

Inference with onnxruntime-gpu model

Enterprise 2023-06-05 03:52:21 views: null

1. Install onnxruntime-gpu

The new version of onnxruntime-gpu supports both gpu reasoning and cpu reasoning.

Uninstall the old 1.7.1 cpu version and install the new gpu version:

pip uninstall onnxruntime
pip install onnxruntime-gpu

Check if the installation was successful:

>>> import onnxruntime
>>> onnxruntime.__version__
'1.10.0'
>>> onnxruntime.get_device()
'GPU'
>>> onnxruntime.get_available_providers()
['TensorrtExecutionProvider', 'CUDAExecutionProvider', 'CPUExecutionProvider']

2. Modify the reasoning code

Add the providers parameter to the reasoning code to select the reasoning framework. It depends on which one you support, just choose the one you support.

session = onnxruntime.InferenceSession('yolov5s.onnx', None)
# 改为：
session = onnxruntime.InferenceSession('yolov5s.onnx', 
        providers=['TensorrtExecutionProvider', 'CUDAExecutionProvider', 'CPUExecutionProvider'])

If Tensorrt and CUDA cannot be inferred when running the reasoning code, as shown below, it means that your own version of ONNX Runtime, TensorRT and CUDA is not corresponding correctly.

2022-08-09 15:38:31.386436528 [W:onnxruntime:Default, onnxruntime_pybind_state.cc:509 CreateExecutionProviderInstance] Failed to create TensorrtExecutionProvider. Please reference https://onnxruntime.ai/docs/execution-providers/TensorRT-ExecutionProvider.html#requirements to ensure all dependencies are met.

The corresponding versions are as follows:

Guess you like

Origin blog.csdn.net/u012505617/article/details/126249243

Inference with onnxruntime-gpu model

onnx onnxruntime onnxruntime-gpu

paddleocr - inference model used

6.7.tensorRT Advanced (1)-Using onnxruntime for onnx model inference process

Record tensorflow C ++ inference model

Using tensorrt to accelerate model inference

Large model serverless inference system

Deep learning model compression and accelerated model inference

Switch CPU/GPU during onnxruntime inference and modify onnx input and output to be dynamic

Deep learning model pruning, quantization and TensorRT inference

Torch call model inference result is wrong analysis

Merge BN and Conv layers during model inference

Dynamic shape during tensorRT model inference

[TRT] Use TensorRT for classification model inference

Vector database—accelerates large model training and inference

Reversible image denoising - InvDN model inference test

ByteDance Spark supports Wanka model inference practice

Summary of Tencent cloud server deployment onnxruntime-gpu experience

Yolov5 to ONNX model + C++ deployment using ONNX Runtime (including the introduction of official documents and using different inference engines as ONNXRuntime backend)

One difference between GPU inference and end-to-side NPU inference

Inference

MiniGPT4 Series 3 Model Inference (Web UI): Inference on RTX-3090 Ubuntu server

YOLOv5 - Use GPU for inference

GPU inference performance optimization in iQiyi CTR scenario

rknn-toolkit inference yolov-face.rknn model

[Model Inference] Teach you how to implement the mish operator in tensorrt

[TensorFlow series] [3] Freeze the model file and do inference

[Target Detection] Construct an XML file based on the inference results of the detection model

Paddlepaddle human body attribute recognition onnx model inference example (python)

Duxiao full quota model based on counterfactual causal inference

Recommended

Ranking

error: (-215:Assertion failed) !_img.empty() in function ‘cv::imwrite‘

Database migration between Navicat servers

Minimum number of rotation of the array: Array

balenaEtcher for mac (make a boot disk software) v1.5.67

Custom processing serialization and deserialization in jackson

Mu-en-mask system development software

Mastering Regular Expressions

Find mileage Java--

Web pages can not directly concern the public micro-channel number how to do? A key to arouse public concern number of micro-channel solutions

[CodeForces - 739B] Alyona and a tree Tree + [difference] + bipartite

Daily

More

2024-05-12(28)

2024-05-11(32)

2024-05-10(34)

2024-05-09(32)

2024-05-08(18)

2024-05-07(34)

2024-05-06(6)

2024-05-05(0)

2024-05-04(18)

2024-05-03(8)