mirror of https://github.com/PaddlePaddle/FastDeploy.git synced 2025-10-05 16:48:03 +08:00

Files

charl-u cbf88a46fa [Doc]Update English version of some documents (#1083 )

* 第一次提交

* 补充一处漏翻译

* deleted:    docs/en/quantize.md

* Update one translation

* Update en version

* Update one translation in code

* Standardize one writing

* Standardize one writing

* Update some en version

* Fix a grammer problem

* Update en version for api/vision result

* Merge branch 'develop' of https://github.com/charl-u/FastDeploy into develop

* Checkout the link in README in vision_results/ to the en documents

* Modify a title

* Add link to serving/docs/

* Finish translation of demo.md

* Update english version of serving/docs/

* Update title of readme

* Update some links

* Modify a title

* Update some links

* Update en version of java android README

* Modify some titles

* Modify some titles

* Modify some titles

* modify article to document

* update some english version of documents in examples

* Add english version of documents in examples/visions

* Sync to current branch

* Add english version of documents in examples

* Add english version of documents in examples

* Add english version of documents in examples

* Update some documents in examples

* Update some documents in examples

* Update some documents in examples

* Update some documents in examples

* Update some documents in examples

* Update some documents in examples

* Update some documents in examples

* Update some documents in examples

* Update some documents in examples

2023-01-09 10:08:19 +08:00

cpp

[Doc]Update English version of some documents (#1083 )

2023-01-09 10:08:19 +08:00

python

[Doc]Update English version of some documents (#1083 )

2023-01-09 10:08:19 +08:00

README_CN.md

[Doc]Add English version of documents in examples (#1070 )

2023-01-06 09:34:28 +08:00

README.md

[Doc]Add English version of documents in examples (#1070 )

2023-01-06 09:34:28 +08:00

README.md

English | 简体中文

YOLOv5 Quantized Model Deployment

FastDeploy supports the deployment of quantized models and provides a one-click model quantization tool. Users can use the one-click model quantization tool to quantize and deploy the models themselves or download the quantized models provided by FastDeploy directly for deployment.

FastDeploy One-Click Model Quantization Tool

FastDeploy provides a one-click quantization tool that allows users to quantize a model simply with a configuration file. For a detailed tutorial, please refer to: One-Click Model Quantization Tool

Download Quantized YOLOv5s Model

Users can also directly download the quantized models in the table below for deployment.

Model	Inference Backend	Hardware	FP32 Inference Time Delay	INT8 Inference Time Delay	Acceleration ratio	FP32 mAP	INT8 mAP	Method
YOLOv5s	TensorRT	GPU	8.79	5.17	1.70	37.6	36.6	Quantized distillation training
YOLOv5s	Paddle Inference	CPU	217.05	133.31	1.63	37.6	36.8	Quantized distillation training

The data in the above table shows the end-to-end inference performance of FastDeploy deployment before and after model quantization.

The test images are from COCO val2017.
The inference time delay is the inference latency on different Runtime in milliseconds.
CPU is Intel(R) Xeon(R) Gold 6271C, GPU is Tesla T4, TensorRT version 8.4.15, and the fixed CPU thread is 1 for all tests.

README.md

YOLOv5 Quantized Model Deployment

FastDeploy One-Click Model Quantization Tool

Download Quantized YOLOv5s Model

More Detailed Tutorials