Commit Graph

121 Commits

Author SHA1 Message Date
kevin
22cab724e8 [Feature] block scheduler v1 support prefix caching (#3061)
* block scheduler v1 support prefix cache

* update code

* update code

* fix code bug

* add timeout time

---------

Co-authored-by: Jiang-Jia-Jun <163579578+Jiang-Jia-Jun@users.noreply.github.com>
2025-07-31 19:29:19 +08:00
YUNSHEN XIE
583eae2fd1 fix ci (#3106)
* fix ci

* disable test_non_streaming_chat_with_min_tokens
2025-07-31 17:25:08 +08:00
YuanRisheng
acc5c0aa85 add ci for custom op approve (#3079) 2025-07-30 16:50:20 +08:00
bukejiyu
8e203666d9 w4a8 offline (#3074)
* w4a8 offline

* update

* update

* update
2025-07-30 16:33:30 +08:00
Jiang-Jia-Jun
ffa0f4d99b [Fix] Fix version function (#3076)
* [Fix] Fix version function

* Fix commit

* Fix commit

* fix code sync

* Update coverage_run.sh

---------

Co-authored-by: Jiang-Jia-Jun <jiangjiajun@baidu.com>
2025-07-30 16:05:24 +08:00
YUNSHEN XIE
fb410b5f4c Add unit test run and coverage report generation (#3011)
* Add unit test run and coverage report generation

* fix

* fix: upload coverage report failure

* fix

* update

* fix

* fix

* update
2025-07-27 22:48:34 +08:00
EnflameGCU
7634ffb709 [GCU] Add CI (#3006) 2025-07-25 10:59:29 +08:00
liddk1121
17c5d3a241 [Iluvatar GPU] Add CI scripts (#2876) 2025-07-21 09:44:42 +08:00
Zero Rains
25698d56d1 polish code with new pre-commit rule (#2923) 2025-07-19 23:19:27 +08:00
ZhangYulongg
eb77b1be6d update ci cases 2025-07-18 21:44:07 +08:00
LQX
11cfdf5d89 添加XPU CI, test=model (#2701)
* 添加XPU CI,  test=model

* 添加XPU CI,  test=model

* 添加XPU CI,  test=model

* 添加XPU CI,  test=model

* 添加XPU CI,  test=model

* 添加XPU CI,  test=model

* 添加XPU CI,  test=model

* 添加XPU CI,  test=model

* 添加XPU CI,  test=model
2025-07-04 16:16:06 +08:00
gaoziyuan
a5ae88ded9 [feature]add fd whl version info (#2698) 2025-07-04 14:12:42 +08:00
YuBaoku
bb880c8d7c Update CI test cases (#2671)
* set git identity to avoid merge failure in CI

* add ci cases
2025-07-02 15:08:39 +08:00
RichardWooSJTU
85090ed799 remove unuseful scripts (#2652) 2025-07-01 10:18:25 +08:00
YUNSHEN XIE
d5af78945b Add ci (#2650)
Some checks failed
Deploy GitHub Pages / deploy (push) Has been cancelled
* add ci ut and workflow

* Automatically cancel any previous CI runs for the ci.yml workflow, keeping only the latest one active
2025-06-30 20:20:49 +08:00
Jiang-Jia-Jun
92c2cfa2e7 Sync v2.0 version of code to github repo 2025-06-29 23:29:37 +00:00
XieYunshen
0825146538 add ci ut and workflow 2025-06-16 02:18:00 +08:00
jiangjiajun
149c79699d [LLM] First commit the llm deployment code 2025-06-16 00:04:48 +08:00
DefTruth
6a8cd4d759 [Backend] support ort offline graph optimize option (#2268)
* support ort offline graph optimize option

* support ort offline graph optimize option
2023-11-02 09:19:33 +08:00
DefTruth
77ee48f9b8 [Bug Fix] fixed compile error for cpu building (#2105)
* [cmake] upgrade windows paddle inference -> 2.5.0

* [cmake] upgrade windows paddle inference -> 2.5.0

* fix paddle custom ops bug on windows

* [Backend] refactor paddle custom ops

* [Bug Fix] fixed paddle custom ops windows build error

* [Bug Fix] fixed compile error for cpu building
2023-07-13 17:02:25 +08:00
DefTruth
b2426aefa9 [Backend] add paddle custom ops compatible policy (#2070)
* Add centerpoint

* fix postprocess op file name

* [Backend] add paddle custom ops compatible policy

* [Backend] add paddle custom ops compatible policy

* [Backend] add paddle custom ops compatible policy

* upgrade linx paddle gpu -> 2.5

* add custom op compatible policy for paddle 2.5

* add custom op compatible policy for paddle 2.5

* add custom op compatible policy for paddle 2.5

* add collect_trt_shape_by_device option for paddle backend

* add collect_trt_shape_by_device option for paddle backend

* add custom op option for python build

* fix python build bugs

* update paddle linux x86 cpu only lib

* update paddle linux gpu lib

* update patchelf cmake

* fix paddle backend option pybind

* update paddle_inference.cmake

* add cuda sm_80 support (A100)

---------

Co-authored-by: zengshao0622 <peter_z96@163.com>
Co-authored-by: qiuyanjun <qiuyanjun@baidu.com>
2023-06-29 22:32:14 +08:00
DefTruth
ff631b7838 [Server] Support encrypt & auth for FD Server (#2018)
* [Server] Support GPU encrypt & auth for FD Triton Server

* fix dockerfile proxy env error

* update build scrpits

* remove some logs

---------

Co-authored-by: root <root@yq02-sys-rpm1206692e6.yq02.baidu.com>
Co-authored-by: qiuyanjun <qiuyanjun@baidu.com>
2023-06-13 20:44:25 +08:00
DefTruth
284b1b46c8 [Serving] Support XPU encrypt & auth server (#2007)
* [patchelf] fix patchelf error for inference xpu

* [serving] add xpu dockerfile and support fd server

* [serving] add xpu dockerfile and support fd server

* [Serving] support XPU + Tritron

* [Serving] support XPU + Tritron

* [Dockerfile] update xpu tritron docker file -> paddle 0.0.0

* [Dockerfile] update xpu tritron docker file -> paddle 0.0.0

* [Dockerfile] update xpu tritron docker file -> paddle 0.0.0

* [Dockerfile] add comments for xpu tritron dockerfile

* [Doruntime] fix xpu infer error

* [Doruntime] fix xpu infer error

* [XPU] update xpu dockerfile

* add xpu triton server docs

* add xpu triton server docs

* add xpu triton server docs

* add xpu triton server docs

* update xpu triton server docs

* update xpu triton server docs

* update xpu triton server docs

* update xpu triton server docs

* update xpu triton server docs

* update xpu triton server docs

* update xpu triton server docs

* update xpu triton server docs

* [XPU] Update XPU L3 Cache setting docs

* [XPU] Add Encryption and AUTH support for XPU Server

* [XPU] Add Encryption and AUTH support for XPU Server

* [Bug Fix] fix paddle reader error

* [Serving] Support XPU encrypt & auth server

* [Serving] Support XPU encrypt & auth server

* [Serving] Support XPU encrypt & auth server

* [Serving] Support XPU encrypt & auth server

* [Triton] switch TAG 22.12 -> TAG 21.10wq

* update xpu auth server script
2023-06-01 21:36:44 +08:00
DefTruth
434b48dda5 [Serving] Support FastDeploy XPU Triton Server (#1994)
* [patchelf] fix patchelf error for inference xpu

* [serving] add xpu dockerfile and support fd server

* [serving] add xpu dockerfile and support fd server

* [Serving] support XPU + Tritron

* [Serving] support XPU + Tritron

* [Dockerfile] update xpu tritron docker file -> paddle 0.0.0

* [Dockerfile] update xpu tritron docker file -> paddle 0.0.0

* [Dockerfile] update xpu tritron docker file -> paddle 0.0.0

* [Dockerfile] add comments for xpu tritron dockerfile

* [Doruntime] fix xpu infer error

* [Doruntime] fix xpu infer error

* [XPU] update xpu dockerfile

* add xpu triton server docs

* add xpu triton server docs

* add xpu triton server docs

* add xpu triton server docs

* update xpu triton server docs

* update xpu triton server docs

* update xpu triton server docs

* update xpu triton server docs

* update xpu triton server docs

* update xpu triton server docs

* update xpu triton server docs

* update xpu triton server docs
2023-05-29 14:38:25 +08:00
DefTruth
49c033a828 [XPU] Support XPU via Paddle Inference backend (#1987)
* [backend] Support XPU via Paddle Inference backend

* [backend] Support XPU via Paddle Inference backend

* [backend] Support XPU via Paddle Inference backend

* [XPU] support XPU benchmark via paddle inference

* [XPU] support XPU benchmark via paddle inference

* [benchmark] add xpu paddle h2d config files
2023-05-25 14:13:40 +08:00
Zheng-Bicheng
1ac6e8e614 [Shell] support fastdeploy_init.sh for MacOS (#1967)
* 兼容zsh

* 使用脚本所在的绝对路径, 确保与执行位置无关

* * Fixed the naming error of the LIBS_DIRECOTRIES variable.
* Fixed the error of using '=' for string comparison.

* * Troubleshooting failed environment variable addition on macOS system

---------

Co-authored-by: Edward Zhou <edwardzhou@EdwarddeMacBook-Pro-2.local>
Co-authored-by: Edward Zhou <edward.zhq@gmail.com>
Co-authored-by: DefTruth <31974251+DefTruth@users.noreply.github.com>
2023-05-19 17:30:12 +08:00
Edward Zhou
2032ce8dca [Shell] Support fastdeploy_init.sh for ZSH terminal (#1684)
* 兼容zsh

* 使用脚本所在的绝对路径, 确保与执行位置无关

---------

Co-authored-by: Edward Zhou <edwardzhou@EdwarddeMacBook-Pro-2.local>
Co-authored-by: Zheng-Bicheng <58363586+Zheng-Bicheng@users.noreply.github.com>
Co-authored-by: DefTruth <31974251+DefTruth@users.noreply.github.com>
2023-05-19 16:51:51 +08:00
DefTruth
1f66d5d9d3 [benchmark] support lite light api & optimize benchmark flags (#1950)
* [benchmark] support lite light api & optimize benchmark flags

* [backend] remove un-used option member

* [backend] remove un-used option member
2023-05-17 18:51:32 +08:00
DefTruth
33e07410da [cmake] Support custom paddle inference url (#1939)
* [cmake] Support custom paddle inference url

* [Python] Add custom Paddle Inference URL support for python

* [Docker] Add fd serving Dockerfile for paddle2.4.2

* [Docker] Add fd serving Dockerfile for paddle2.4.2

* [Docker] Add fd serving Dockerfile for paddle2.4.2

* [Docker] Add fd serving Dockerfile for paddle2.4.2

* [Bug Fix] fixed result format string error

* rerunning the re-touch CIs

* rerunning CIs
2023-05-16 14:30:31 +08:00
DefTruth
15c29f7e49 [Python] Add backward compatible for paddle2.4.2 (#1929)
* [Python] Add backward compatible for paddle2.4.2

* [Python] Add backward compatible for paddle2.4.2

* [scripts] update linux build scripts
2023-05-11 19:34:06 +08:00
DefTruth
652024d2f6 Revert "Remove Paddle Reader" (#1860)
Revert "Remove Paddle Reader (#1813)"

This reverts commit f3d44785c4.
2023-04-23 23:16:31 +08:00
Jason
f3d44785c4 Remove Paddle Reader (#1813)
* Remove Paddle Reader

* support pp-infer c++14

* disable trt cache

---------

Co-authored-by: wang-xinyu <wangxinyu_es@163.com>
2023-04-20 21:12:43 +08:00
DefTruth
9ec419129f [XPU] Update Lite XPU lib to fix l3 cache crash for seg models (#1786)
* Update kunlunxin.cmake

* [XPU] Update Lite XPU lib to fix l3 cache crash
2023-04-11 18:01:52 +08:00
Jason
6343b0db47 [Build] Support build with source code of Paddle2ONNX (#1559)
* Add notes for tensors

* Optimize some apis

* move some warnings

* Support build with Paddle2ONNX

* Add protobuf support

* Fix compile on mac

* add clearn package script

* Add paddle2onnx code

* remove submodule

* Add onnx ocde

* remove softlink

* add onnx code

* fix error

* Add cmake file

* fix patchelf

* update paddle2onnx

* Delete .gitmodules

---------

Co-authored-by: PaddleCI <paddle_ci@example.com>
Co-authored-by: pangyoki <pangyoki@126.com>
Co-authored-by: jiangjiajun <jiangjiajun@baidu.lcom>
2023-03-17 10:03:22 +08:00
Jack Zhou
f7ef464412 [Other] Use symbolic link pointed to TensorRT library (#1461)
* Add force rpath

* update to dev-6

* remove force-rpath

* remove nvinfer so

* Add symbol link creation when import fastdeploy first

* Add dst

* update symbol link creation

* Add logging

* logging -> logger

* update tips

* Set logging level

* Add --force-rpath
2023-03-10 09:09:04 +08:00
Jason
6be2c0367b [Example] Update runtime examples (#1542)
* Add notes for tensors

* Optimize some apis

* move some warnings
2023-03-08 16:56:04 +08:00
Jason
bf2ba72f1b [Compile] Support build FastDeploy with GLIBCXX_ABI=0 (#1528)
* Support abi=0 build

* trigger ci

* fix cmake
2023-03-07 09:45:30 +08:00
DefTruth
767647d88e [Android] Add WITH_TENSOR_FUNCS option -> Android (#1512)
* [Android] Add WITH_TENSOR_FUNCS option -> Android

* [Android] move android cmake codes -> android.cmake

* [Android] move android cmake codes -> android.cmake
2023-03-06 13:33:43 +08:00
DefTruth
95f809c1ff [Bug Fix] fix build error while enable flycv on linux x64 (#1365)
* [Bug Fix] fix build error while enable flycv on linux x64

* [Bug Fix] fix build error while enable flycv on linux x64

* move WITH_FLYCV_STATIC option -> flycv.cmake
2023-02-20 13:52:31 +08:00
DefTruth
ee85a3cade [Benchmark] Add precision evaluation api from benchmark (#1310)
* [Benchmark] Init benchmark precision api

* [Benchmark] Init benchmark precision api

* [Benchmark] Add benchmark precision api

* [Benchmark] Calculate the statis of diff

* [Benchmark] Calculate the statis of diff

* [Benchmark] Calculate the statis of diff

* [Benchmark] Calculate the statis of diff

* [Benchmark] Calculate the statis of diff

* [Benchmark] Add SplitDataLine utils

* [Benchmark] Add LexSortByXY func

* [Benchmark] Add LexSortByXY func

* [Benchmark] Add LexSortDetectionResultByXY func

* [Benchmark] Add LexSortDetectionResultByXY func

* [Benchmark] Add tensor diff presicion test

* [Benchmark] fixed conflicts

* [Benchmark] fixed calc tensor diff

* fixed build bugs

* fixed ci bugs when WITH_TESTING=ON
2023-02-16 17:16:14 +08:00
DefTruth
b732e4c711 [Bug Fix] Add bundle script for text api only pkg (#1295)
[Android] Add bundle script for text api only pkg
2023-02-10 12:00:43 +08:00
DefTruth
ce424c313e [XPU] Add XPU option for benchmark example (#1287)
* [XPU] Add xpu option for benchmark example

* [XPU] Add fp16 flags
2023-02-10 11:24:40 +08:00
DefTruth
ab5377b5fa [Benchmark] Add PaddleYOLOv8 cpp benchmark example & lite flags option (#1270)
* [Android] Add PaddleYOLOv8 cpp benchmark example & lite flags option

* [Benchmark] add linux x86_64 gpu benchmark build script
2023-02-09 11:58:07 +08:00
zhoushunjie
396a291bc2 update cmake 2023-02-08 09:14:45 +00:00
zhoushunjie
f8903222bf Add build_android_cpp_with_text_api_only 2023-02-08 08:31:35 +00:00
DefTruth
f73a538f61 [Backend] support bechmark mode for runtime and backend (#1201)
* [backend] support bechmark mode for runtime and backend

* [backend] support bechmark mode for runtime and backend

* [pybind11] add benchmark methods pybind

* [pybind11] add benchmark methods pybind

* [Other] Update build scripts

* [Other] Update cmake/summary.cmake

* [Other] update build scripts

* [Other] add ENABLE_BENCHMARK option -> setup.py

* optimize backend time recording

* optimize backend time recording

* optimize trt backend time record

* [backend] optimze backend_time recording for trt

* [benchmark] remove redundant logs

* fixed ov_backend confilct

* [benchmark] fixed paddle_backend conflicts

* [benchmark] fixed paddle_backend conflicts

* [benchmark] fixed paddle_backend conflicts

* [benchmark] remove use_gpu option from ort backend option

* [benchmark] update benchmark_ppdet.py

* [benchmark] update benchmark_ppcls.py

* fixed lite backend conflicts

* [Lite] fixed lite xpu

* add benchmark macro

* add RUNTIME_PROFILE_LOOP macros

* add comments for RUNTIME_PROFILE macros

* add comments for new apis

* add comments for new apis

* update benchmark_ppdet.py

* afixed bugs

* remove unused codes

* optimize RUNTIME_PROFILE_LOOP macros

* optimize RUNTIME_PROFILE_LOOP macros

* add comments for benchmark option and result

* add docs for benchmark namespace
2023-02-06 14:29:35 +08:00
DefTruth
2613e9c5d0 [Android] support fastdeploy build with static deps(60Mb->29~31Mb) (#1176)
* [Android] support fastdeploy build with static deps(70Mb->17~19Mb)

* [Android] support fastdeploy build with static deps(60Mb->29~30Mb)

* fixed ci

* fixed ci

* [staticlib] support fd android static lib

* [static] optimize bundle_static_library func

* [staticlib] add api_helpers.h -> staticlib headers

* [staticlib] add api_helpers.h -> staticlib headers

* [staticlib] add api_helpers.h -> staticlib headers

* [staticlib] Fixed Paddle Lite paddle_use_kernels.h to support fd armv7 static lib

* [staticlib] Add strip -> fd static lib target

* [staticlib] optimize bundle_static_library func

* [staticlib] add strip for fd static lib on  mac osx

* [staticlib] move api_helpers -> lite/option

* [staticlib] optimize bundle_static_library

* [staticlib] add Android limit
2023-01-30 19:39:56 +08:00
yeliang2258
c866377382 [Bug Fix]Fix KunlunXin compile bug (#1120)
* fix KunlunXin compile bugs

* update code
2023-01-12 15:52:33 +08:00
Zheng-Bicheng
3a32024912 [Doc] 更新RKNPU2文档以及修改FastDeploy环境变量的脚本 (#1102)
* 更新交叉编译

* 更新交叉编译

* 更新交叉编译

* 更新交叉编译

* 更新交叉编译

* 更新交叉编译

* 更新交叉编译

* 更新交叉编译

* 更新交叉编译

* Update issues.md

* Update fastdeploy_init.sh

* 更新交叉编译

Co-authored-by: Jason <928090362@qq.com>
2023-01-10 15:22:06 +08:00
Jason
df20b2a02b [Other] Remove useless macros (#1095)
* Remove useless macros

* triger ci

* fix check error

* rename INTEGRATE_PADDLE2ONNX to ENABLE_PADDLE2ONNX
2023-01-09 21:35:23 +08:00