mirror of https://github.com/PaddlePaddle/FastDeploy.git synced 2025-10-08 10:00:29 +08:00

Files

Thomas Young afe8444782 Add doc for serving (#730 )

* add ocr serving example

* 1

1

* Add files via upload

* Update README.md

* Delete ocr_pipeline.png

* Add files via upload

* Delete ocr_pipeline.png

* Add files via upload

* 1

1

* 1

1

* Update README.md

* 1

1

* fix codestyle

* fix codestyle

* Update README_CN.md

* Update README_EN.md

* Update demo.md

* Update demo.md

* Add files via upload

* Update demo.md

* Add files via upload

* Delete dynamic_batching.png

* Delete instance_group.png

* Delete simple_ensemble.png

* Add files via upload

* Update demo.md

* Update demo.md

* Update demo.md

* Update demo.md

* Delete dynamic_batching.png

* Delete instance_group.png

* Delete simple_ensemble.png

* Update demo.md

Co-authored-by: Jason <jiangjiajun@baidu.com>
Co-authored-by: heliqi <1101791222@qq.com>

2022-11-28 19:49:18 +08:00

docs

Add doc for serving (#730 )

2022-11-28 19:49:18 +08:00

scripts

fix serving/scripts/build.sh for docker build CI task (#688 )

2022-11-24 14:32:13 +08:00

src

[Serving][Backend] Backend support zero_copy_infer and Serving reduce the output memory copy (#703 )

2022-11-28 14:07:53 +08:00

CMakeLists.txt

support build cpu images (#341 )

2022-10-11 14:17:27 +08:00

Dockerfile

[Doc][Serving]modify serving doc (#718 )

2022-11-28 15:14:10 +08:00

Dockerfile_cpu

[Other] faster_tokenizer->fast_tokenizer (#636 )

2022-11-21 13:45:00 +08:00

Dockerfile_ipu

[Serving]: add ipu support for serving. (#10 ) (#470 )

2022-11-02 09:50:58 +08:00

README_CN.md

Add doc for serving (#730 )

2022-11-28 19:49:18 +08:00

README_EN.md

Add doc for serving (#730 )

2022-11-28 19:49:18 +08:00

README.md

Fd serving add docker images correlation and docs (#311 )

2022-10-08 16:08:07 +08:00

README_EN.md

简体中文 | English

FastDeploy Serving Deployment

Introduction

FastDeploy builds an end-to-end serving deployment based on Triton Inference Server. The underlying backend uses the FastDeploy high-performance Runtime module and integrates the FastDeploy pre- and post-processing modules to achieve end-to-end serving deployment. It can achieve fast deployment with easy-to-use process and excellent performance.

Prepare the environment

Environment requirements

Linux
If using a GPU image, NVIDIA Driver >= 470 is required (for older Tesla architecture GPUs, such as T4, the NVIDIA Driver can be 418.40+, 440.33+, 450.51+, 460.27+)

Obtain Image

CPU Image

CPU images only support Paddle/ONNX models for serving deployment on CPUs, and supported inference backends include OpenVINO, Paddle Inference, and ONNX Runtime

docker pull paddlepaddle/fastdeploy:0.6.0-cpu-only-21.10

GPU Image

GPU images support Paddle/ONNX models for serving deployment on GPU and CPU, and supported inference backends including OpenVINO, TensorRT, Paddle Inference, and ONNX Runtime

docker pull paddlepaddle/fastdeploy:0.6.0-gpu-cuda11.4-trt8.4-21.10

Users can also compile the image by themselves according to their own needs, referring to the following documents:

FastDeploy Serving Deployment Image Compilation

Task	Model
Classification	PaddleClas
Detection	PaddleDetection
Detection	ultralytics/YOLOv5
NLP	PaddleNLP/ERNIE-3.0
NLP	PaddleNLP/UIE
Speech	PaddleSpeech/PP-TTS
OCR	PaddleOCR/PP-OCRv3

README_EN.md

FastDeploy Serving Deployment

Introduction

Prepare the environment

Environment requirements

Obtain Image

CPU Image

GPU Image

Other Tutorials

Serving Deployment Demo