mirror of https://github.com/PaddlePaddle/FastDeploy.git synced 2025-10-05 08:37:06 +08:00

Files

Jason 7c9bf11c44 [Other] Optimize Poros backend (#1232 )

* Optimize Poros backend

* fix error

* Add more pybind

* fix conflicts

* add some deprecate notices

2023-02-07 10:18:03 +08:00

infer_onnx_onnxruntime.py

[Other] Add onnx_ort_runtime cpp/python demos (#565 )

2022-11-11 12:47:06 +08:00

infer_onnx_openvino.py

Rename pybind/fastdeploy_runtime.cc to pybind/runtime.cc (#273 )

2022-09-23 11:16:02 +08:00

infer_onnx_tensorrt.py

Rename pybind/fastdeploy_runtime.cc to pybind/runtime.cc (#273 )

2022-09-23 11:16:02 +08:00

infer_paddle_onnxruntime.py

Optimize ocr system code (#209 )

2022-09-14 09:46:03 +08:00

infer_paddle_openvino.py

Optimize ocr system code (#209 )

2022-09-14 09:46:03 +08:00

infer_paddle_paddle_inference.py

Rename PaddleBackend to PaddleInferBackend (#728 )

2022-11-28 21:29:09 +08:00

infer_paddle_tensorrt.py

[Doc] Add runtime demo in quick_start (#524 )

2022-11-08 19:37:30 +08:00

infer_torchscript_poros.py

[Other] Optimize Poros backend (#1232 )

2023-02-07 10:18:03 +08:00

README_CN.md

[Doc]Add English version of documents in examples/ (#1042 )

2023-01-06 09:35:12 +08:00

README.md

[Doc] Update English version of some documents (#1084 )

2023-01-06 18:01:34 +08:00

README.md

English | 简体中文

Python Inference

Before running demo, the following two steps need to be confirmed:

1. Hardware and software environment meets the requirements. Please refer to Environment requirements for FastDeploy.
1. Install FastDeploy Python whl package, please refer to FastDeploy Python Installation.

This document shows an inference example on the CPU using the PaddleClas classification model MobileNetV2 as an example.

1. Obtaining the model

import fastdeploy as fd

model_url = "https://bj.bcebos.com/fastdeploy/models/mobilenetv2.tgz"
fd.download_and_decompress(model_url, path=".")

2. Backend Configuration

option = fd.RuntimeOption()

option.set_model_path("mobilenetv2/inference.pdmodel",
                      "mobilenetv2/inference.pdiparams")

# **** CPU Configuration ****
option.use_cpu()
option.use_ort_backend()
option.set_cpu_thread_num(12)

# Initialise runtime
runtime = fd.Runtime(option)

# Get model input name
input_name = runtime.get_input_info(0).name

# Constructing random data for inference
results = runtime.infer({
    input_name: np.random.rand(1, 3, 224, 224).astype("float32")
})

print(results[0].shape)

When loading is complete, you will get the following output information indicating the initialized backend and the hardware devices.

[INFO] fastdeploy/fastdeploy_runtime.cc(283)::Init	Runtime initialized with Backend::OrtBackend in device Device::CPU.

README.md

Python Inference

1. Obtaining the model

2. Backend Configuration

Other Documents