mirror of https://github.com/PaddlePaddle/FastDeploy.git synced 2025-12-24 13:28:13 +08:00

Files

chenjian 595ca69251 [Doc] Add doc for vdl serving (#1110 )

* add doc for vdl serving

* add doc for vdl serving

* add doc for vdl serving

* fix link

* fix link

* fix gif size

* fix gif size

* add english version

* fix links

* fix links

* update format

* update docs

* update docs

* update docs

* update docs

* update docs

* update docs

---------

Co-authored-by: heliqi <1101791222@qq.com>

2023-01-30 19:22:59 +08:00

2.4 KiB

Raw Blame History

简体中文 | English

FastDeploy Serving Deployment

Introduction

FastDeploy builds an end-to-end serving deployment based on Triton Inference Server. The underlying backend uses the FastDeploy high-performance Runtime module and integrates the FastDeploy pre- and post-processing modules to achieve end-to-end serving deployment. It can achieve fast deployment with easy-to-use process and excellent performance.

Prepare the environment

Environment requirements

Linux
If using a GPU image, NVIDIA Driver >= 470 is required (for older Tesla architecture GPUs, such as T4, the NVIDIA Driver can be 418.40+, 440.33+, 450.51+, 460.27+)

Obtain Image

CPU Image

CPU images only support Paddle/ONNX models for serving deployment on CPUs, and supported inference backends include OpenVINO, Paddle Inference, and ONNX Runtime

docker pull registry.baidubce.com/paddlepaddle/fastdeploy:1.0.2-cpu-only-21.10

GPU Image

GPU images support Paddle/ONNX models for serving deployment on GPU and CPU, and supported inference backends including OpenVINO, TensorRT, Paddle Inference, and ONNX Runtime

docker pull registry.baidubce.com/paddlepaddle/fastdeploy:1.0.2-gpu-cuda11.4-trt8.4-21.10

Users can also compile the image by themselves according to their own needs, referring to the following documents:

FastDeploy Serving Deployment Image Compilation

Task	Model
Classification	PaddleClas
Detection	PaddleDetection
Detection	ultralytics/YOLOv5
NLP	PaddleNLP/ERNIE-3.0
NLP	PaddleNLP/UIE
Speech	PaddleSpeech/PP-TTS
OCR	PaddleOCR/PP-OCRv3

2.4 KiB

Raw Blame History

FastDeploy Serving Deployment

Introduction

Prepare the environment

Environment requirements

Obtain Image

CPU Image

GPU Image

Other Tutorials

Serving Deployment Demo

2.4 KiB Raw Blame History

FastDeploy Serving Deployment

Introduction

Prepare the environment

Environment requirements

Obtain Image

CPU Image

GPU Image

Other Tutorials

Serving Deployment Demo

2.4 KiB

Raw Blame History