Files
FastDeploy/serving/README_EN.md
heliqi 2f73857b2b [Serving]Update images version to 0.6.0 (#594)
* serving images update to 0.6

* serving images update to 0.6

* serving images update to 0.6
2022-11-16 10:11:43 +08:00

53 lines
2.1 KiB
Markdown

[简体中文](README_CN.md) | English
# FastDeploy Serving Deployment
## Introduction
FastDeploy builds an end-to-end serving deployment based on [Triton Inference Server](https://github.com/triton-inference-server/server). The underlying backend uses the FastDeploy high-performance Runtime module and integrates the FastDeploy pre- and post-processing modules to achieve end-to-end serving deployment. It can achieve fast deployment with easy-to-use process and excellent performance.
## Prepare the environment
### Environment requirements
- Linux
- If using a GPU image, NVIDIA Driver >= 470 is required (for older Tesla architecture GPUs, such as T4, the NVIDIA Driver can be 418.40+, 440.33+, 450.51+, 460.27+)
### Obtain Image
#### CPU Image
CPU images only support Paddle/ONNX models for serving deployment on CPUs, and supported inference backends include OpenVINO, Paddle Inference, and ONNX Runtime
```shell
docker pull paddlepaddle/fastdeploy:0.6.0-cpu-only-21.10
```
#### GPU Image
GPU images support Paddle/ONNX models for serving deployment on GPU and CPU, and supported inference backends including OpenVINO, TensorRT, Paddle Inference, and ONNX Runtime
```
docker pull paddlepaddle/fastdeploy:0.6.0-gpu-cuda11.4-trt8.4-21.10
```
Users can also compile the image by themselves according to their own needs, referring to the following documents:
- [FastDeploy Serving Deployment Image Compilation](docs/zh_CN/compile.md)
## Other Tutorials
- [How to Prepare Serving Model Repository](docs/zh_CN/model_repository.md)
- [Serving Deployment Configuration for Runtime](docs/zh_CN/model_configuration.md)
- [Serving Deployment Demo](docs/zh_CN/demo.md)
- [YOLOV5 - Detection Task](../examples/vision/detection/yolov5/serving/README.md)
### Model List
| Task | Model |
|---|---|
| Classification | [PaddleClas](../examples/vision/classification/paddleclas/serving/README.md) |
| Detection | [ultralytics/YOLOv5](../examples/vision/detection/yolov5/serving/README.md) |
| NLP | [PaddleNLP/ERNIE-3.0](../examples/text/ernie-3.0/serving/README.md)|
| Speech | [PaddleSpeech/PP-TTS](../examples/audio/pp-tts/serving/README.md)|