mirror of https://github.com/PaddlePaddle/FastDeploy.git synced 2025-10-04 16:22:57 +08:00

Files

charl-u cbf88a46fa [Doc]Update English version of some documents (#1083 )

* 第一次提交

* 补充一处漏翻译

* deleted:    docs/en/quantize.md

* Update one translation

* Update en version

* Update one translation in code

* Standardize one writing

* Standardize one writing

* Update some en version

* Fix a grammer problem

* Update en version for api/vision result

* Merge branch 'develop' of https://github.com/charl-u/FastDeploy into develop

* Checkout the link in README in vision_results/ to the en documents

* Modify a title

* Add link to serving/docs/

* Finish translation of demo.md

* Update english version of serving/docs/

* Update title of readme

* Update some links

* Modify a title

* Update some links

* Update en version of java android README

* Modify some titles

* Modify some titles

* Modify some titles

* modify article to document

* update some english version of documents in examples

* Add english version of documents in examples/visions

* Sync to current branch

* Add english version of documents in examples

* Add english version of documents in examples

* Add english version of documents in examples

* Update some documents in examples

* Update some documents in examples

* Update some documents in examples

* Update some documents in examples

* Update some documents in examples

* Update some documents in examples

* Update some documents in examples

* Update some documents in examples

* Update some documents in examples

2023-01-09 10:08:19 +08:00

2.4 KiB

Executable File

Raw Blame History

English | 简体中文

PaddleClas Quantitative Model C++ Deployment Example

infer.cc in this directory can help you quickly complete the inference acceleration of PaddleClas quantization model deployment on CPU/GPU.

Deployment Preparations

FastDeploy Environment Preparations

1. For the software and hardware requirements, please refer to FastDeploy Environment Requirements.
1. For the installation of FastDeploy Python whl package, please refer to FastDeploy Python Installation.

Quantized Model Preparations

1. You can directly use the quantized model provided by FastDeploy for deployment.
1. You can use one-click automatical compression tool provided by FastDeploy to quantize model by yourself, and use the generated quantized model for deployment.(Note: The quantized classification model still needs the inference_cls.yaml file in the FP32 model folder. Self-quantized model folder does not contain this yaml file, you can copy it from the FP32 model folder to the quantized model folder.)

Take the Quantized PP-YOLOE-l Model as an example for Deployment, FastDeploy version 0.7.0 or higher is required (x.x.x>=0.7.0)

Run the following commands in this directory to compile and deploy the quantized model.

mkdir build
cd build
# Download pre-compiled FastDeploy libraries. You can choose the appropriate version from `pre-compiled FastDeploy libraries` mentioned above.
wget https://bj.bcebos.com/fastdeploy/release/cpp/fastdeploy-linux-x64-x.x.x.tgz
tar xvf fastdeploy-linux-x64-x.x.x.tgz
cmake .. -DFASTDEPLOY_INSTALL_DIR=${PWD}/fastdeploy-linux-x64-x.x.x
make -j

# Download the ResNet50_Vd quantized model and test images provided by FastDeloy. 
wget https://bj.bcebos.com/paddlehub/fastdeploy/resnet50_vd_ptq.tar
tar -xvf resnet50_vd_ptq.tar
wget https://gitee.com/paddlepaddle/PaddleClas/raw/release/2.4/deploy/images/ImageNet/ILSVRC2012_val_00000010.jpeg


# Use ONNX Runtime inference quantization model on CPU.
./infer_demo resnet50_vd_ptq ILSVRC2012_val_00000010.jpeg 0
# Use TensorRT inference quantization model on GPU.
./infer_demo resnet50_vd_ptq ILSVRC2012_val_00000010.jpeg 1
# Use Paddle-TensorRT inference quantization model on GPU.
./infer_demo resnet50_vd_ptq ILSVRC2012_val_00000010.jpeg 2

2.4 KiB Executable File Raw Blame History