NCNN的INT8量化使用方式 - Code World

NCNN的INT8量化使用方式

Enterprise 2022-01-15 08:11:34 views: null

编译NCNN

mkdir build && cd build && cmake ../

进入到build/tools/darknet目录，将来源于darknet的模型文件和权重文件拷贝一份到这里：

转换过程如下：

优化：

./ncnnoptimize /home/czl/ncnn/ncnn/build/tools/darknet/ncnn.param /home/czl/ncnn/ncnn/build/tools/darknet/ncnn.bin yolov4-tiny-opt.param yolov4-tiny-opt.bin 0

生成的优化过的模型如下：

检测实战，未优化的模型推理结果：

优化后的模型推理结果：

下载量化校准表图片

下载官方给出的1000张ImageNet图像，很多同学没有梯子，下载慢，可以用下这个链接：

imagenet-sample-images-master.zip_yolov4ncnn-深度学习文档类资源-CSDN下载ncnn量化int所需的校准图像yolov4ncnn更多下载资源、学习资料请访问CSDN下载频道.https://download.csdn.net/download/weixin_45829462/18704213

图片内容包括：

进入到目录build/tools/quantize，创建images文件夹，之后将imagenet图片全部拷贝到此目录

之后，执行命令 find images/ -type f >imagelist.txt，创建图像文件列表。

之后执行命令：

./ncnn2table yolov4-tiny-opt.param yolov4-tiny-opt.bin imagelist.txt yolov4-tiny.table mean=[104,117,123] norm=[0.017,0.017,0.017] shape=[224,224,3] pixel=BGR thread=8 method=kl

量化模型：

./ncnn2int8 yolov4-tiny-opt.param yolov4-tiny-opt.bin yolov4-tiny-int8.param yolov4-tiny-int8.bin yolov4-tiny.table

生成的yolov4-tiny-int8.bin即为量化后的权重文件，可以看到它的大小是量化前的四分之一，这就是量化的一个优势，可以减小内存使用量。

结束！

Guess you like

Origin blog.csdn.net/tugouxp/article/details/122489836

NCNN的INT8量化使用方式

NCNN的INT8量化使用方式

NCNN的INT8量化使用方式

NCNN的INT8量化使用方式

NCNN的INT8量化使用方式

NCNN的INT8量化使用方式

NCNN的INT8量化使用方式

int8 quantify

openvino量化自己训练的yolov3模型至int8(有成功验证截图)

TensorRT + int8 official forum interesting discussion summary

TensorRT INT8 quantization principle and implementation (very detailed)

int8, FLOPS, FLOPs, TOPS and other specific meanings

FP32, FP16 and INT8

Performance optimization of Int8 quantization operator in mobile CPU

PostgreSQL does not use extensions, generates random int8 values, generates uniformly distributed random int8 values

NCNN quantification of ncnn2table and ncnn2int8

The difference between int, int8, int16, int32, int64 and uint in Golang

Neural network model quantification technology for large AI models: INT8 or INT4?

OpenCV-- read image data types must be int8 type?

Sparsity in INT8: Accelerated Training Workflows and NVIDIA TensorRT Best Practices

OpenVINO 2022.3 combat 4: POT API realizes INT8 quantization of YOLOv5 model

OpenVINO 2022.3 combat three: POT API realizes image classification model INT8 quantization

AI model deployment-Python implementation of INT8 quantification of TensorRT model

Using OpenVINO to implement RT-DETR model INT8 quantitative inference acceleration

int8 quantify

int8 Quantify

INT8 cuantificar

int8 quantifier

int8 quantificar

TensorRT + int8 Официальный форум интересное обсуждение резюме

Recommended

Ranking

How to make the url of the picture have the download attribute and force the download. Jump browser to download automatically.

Lr CC 8.3 Installation Guide

MD5Utils (MD5 encryption tools! Unsalted)

Econ 325 (004)

Photo studio photography appointment app based on java SpringBoot and Vue uniapp

What is b

2017 Second Guangdong strong network Cup online game --Nonstandard

myeclipse8.5 add tomact7

mpeg1, mpeg2 and mpeg4 standard comparative analysis and summary

Tencent sub-sub-group 6-color program

Daily

More

2024-04-25(32)

2024-04-24(30)

2024-04-23(30)

2024-04-22(5)

2024-04-21(0)

2024-04-20(6)

2024-04-19(5)

2024-04-18(0)

2024-04-17(31)

2024-04-16(23)