plusNew001
f0140be1e1
Change paddlepaddle-xpu installation command ( #3646 )
...
Updated the installation command for paddlepaddle-xpu to use a specific wheel file.
2025-08-27 16:17:19 +08:00
plusNew001
f8b70bf60c
update xpu ci ( #3632 )
...
* Update Docker image version in CI workflow
* Modify paddlepaddle-xpu installation and add dependencies
Updated installation source for paddlepaddle-xpu and added dependency download step.
* Fix no_proxy environment variable in CI workflow
2025-08-27 14:25:56 +08:00
gaoziyuan
82e64b13e1
[NewFeature]Support dp multi api server && Fix some bug in mixed ep && merge develop ( #3598 )
...
* [Feature] update ep
* fix ci
* fix ci
* fix ci
* fix ci
* fix ci
* fix ci
* fix ci
* fix queue ports idx
* fix ci
* fix ci
* fix ci
* fix ci
* fix ci
* fix ci
* fix ci
* fix ci
* Update engine.py
* fix ci
* fix some bug in mixed ep
* add server fix and op fix
* rm some log
* fix code style
* ltd fix
* fix
* fix
* fix some bug
* fix bug
* fix bug
* fix style
* Update config.py
* Update splitwise_connector.py
* Update cache_messager.py
* Update __init__.py
* merge and fix
* Update engine.py
* Update common_engine.py
* Update run_ci_xpu.sh
* Update ernie_processor.py
* Update ernie_processor.py
---------
Co-authored-by: ltd0924 <ltd0924@sina.com >
Co-authored-by: ltd0924 <32387785+ltd0924@users.noreply.github.com >
2025-08-26 19:59:02 +08:00
Yuanle Liu
cbce94a00e
rename ernie_xxx to ernie4_5_xxx ( #3621 )
...
* rename ernie_xxx to ernie4_5_xxx
* ci fix
2025-08-26 19:29:27 +08:00
YuanRisheng
642480f5f6
[CI] Standard unittest ( #3606 )
...
* standard unittest
* fix bugs
* fix script
2025-08-26 19:03:11 +08:00
YUNSHEN XIE
46664985fc
Modify the existing coverage collection method ( #3573 )
...
fix cov report
2025-08-25 10:35:35 +08:00
bukejiyu
77514e3e1e
[V1 Loader] support weight_only ( #3413 )
...
CE Compile Job / ce_job_pre_check (push) Has been cancelled
CE Compile Job / print_ce_job_pre_check_outputs (push) Has been cancelled
CE Compile Job / FD-Clone-Linux (push) Has been cancelled
CE Compile Job / Show Code Archive Output (push) Has been cancelled
CE Compile Job / BUILD_SM8090 (push) Has been cancelled
CE Compile Job / BUILD_SM8689 (push) Has been cancelled
CE Compile Job / CE_UPLOAD (push) Has been cancelled
Deploy GitHub Pages / deploy (push) Has been cancelled
Publish Job / publish_pre_check (push) Has been cancelled
Publish Job / print_publish_pre_check_outputs (push) Has been cancelled
Publish Job / FD-Clone-Linux (push) Has been cancelled
Publish Job / Show Code Archive Output (push) Has been cancelled
Publish Job / BUILD_SM8090 (push) Has been cancelled
Publish Job / BUILD_SM8689 (push) Has been cancelled
Publish Job / PADDLE_PYPI_UPLOAD_8090 (push) Has been cancelled
Publish Job / PADDLE_PYPI_UPLOAD_8689 (push) Has been cancelled
Publish Job / Run FastDeploy Unit Tests and Coverage (push) Has been cancelled
Publish Job / Run FastDeploy LogProb Tests (push) Has been cancelled
Publish Job / Extracted partial CE model tasks to run in CI. (push) Has been cancelled
Publish Job / Run Base Tests (push) Has been cancelled
Publish Job / Run Accuracy Tests (push) Has been cancelled
* support wint4/wint8
* delete smoe case
* update ci
* print log
2025-08-23 13:13:41 +08:00
YUNSHEN XIE
cb166053ba
fix test name ( #3493 )
...
* fix test name
* update
* update
* fix
* fix
* update
* update
* update
* update
* update
* fix
* update
2025-08-22 23:43:47 +08:00
plusNew001
7ae41e9daf
[CI] fix xpu ci bug ( #3535 )
2025-08-22 15:08:39 +08:00
Zhang Yulong
b7eee3aec1
Update CI ( #3474 )
...
* update CI cases
* update CI cases
* update CI cases
* update CI cases
* Merge upstream/develop and resolve directory rename conflict
* Merge upstream/develop and resolve directory rename conflict
* Merge upstream/develop and resolve directory rename conflict
* update deploy
* update deploy
* update deploy
* update deploy
* update deploy
2025-08-21 16:49:20 +08:00
YUNSHEN XIE
e197894977
add e2e cases ( #3476 )
...
* add e2e cases
* fix
2025-08-20 18:50:14 +08:00
YuBaoku
33d369586b
[CI] remove useless case ( #3482 )
2025-08-20 14:20:30 +08:00
YUNSHEN XIE
3a6058e445
Add stable ci ( #3460 )
...
* add stable ci
* fix
* update
* fix
* rename tests dir;fix stable ci bug
* add timeout limit
* update
2025-08-20 08:57:17 +08:00
Zero Rains
8b12c80f90
[FixBug] compute early stopping with real batch size ( #3418 )
...
* [FixBug] compute early stopping with real batch size
* update
* fix test_sampler
2025-08-18 22:09:21 -07:00
Jundong Liu
ea4a3b479c
[Excutor] Increase buffer size to prevent address corruption; add forward metadata debug tool ( #3404 )
...
* 修复buffer申请不够大,增加打印forwardmetadata的工具
* fix mistake
* Make CPU tensor in CPUPlace
* Add test about forward_meta_str and Add unitest_requirement
---------
Co-authored-by: RAM <gstian5555@outlook.com >
2025-08-18 16:14:09 +08:00
YUNSHEN XIE
f72db9386c
Add requirements for running unit tests ( #3350 )
...
* Add requirements for running unit tests
* update
2025-08-14 17:37:18 +08:00
YUNSHEN XIE
1ace375fc3
Optimize CI execution workflow ( #3371 )
...
* Optimize CI execution workflow
* fix
2025-08-13 18:47:31 +08:00
EnflameGCU
d1a92e3e17
[GCU] Enable gcu CI ( #3190 )
...
Deploy GitHub Pages / deploy (push) Has been cancelled
* [GCU] Update to the latest version
* [GCU] Enable CI
2025-08-13 11:48:24 +08:00
yzwu
fbdd6b0663
[Iluvatar GPU] Optimze attention and moe performance ( #3234 )
2025-08-08 10:51:24 +08:00
YuanRisheng
0074b423a9
fix ci bug ( #3239 )
Deploy GitHub Pages / deploy (push) Has been cancelled
2025-08-07 11:32:39 +08:00
Yzc216
d9e3f88f9e
[Feature] multi source download ( #3125 )
...
Deploy GitHub Pages / deploy (push) Has been cancelled
* multi-source download
* multi-source download
* huggingface download revision
* requirement
* style
* add revision arg
* test
* pre-commit
* Change default download
* change requirements.txt
* modify English Documentation
* documentation
* modify model download path
* add requirements
* error optimization
* 连接失败兜底
* 连接失败兜底
* 连接失败兜底
* unit test
* unit test
* unit test
* test
* test
2025-08-07 00:40:27 +08:00
YuanRisheng
a6e8b780f8
fix approve ( #3224 )
2025-08-06 10:36:01 +08:00
YuanRisheng
79bbacc152
Fix approve shell scripts ( #3108 )
...
* fix approve
* fix
2025-08-04 15:51:33 +08:00
YUNSHEN XIE
bdb83e007d
fix ci ( #3141 )
2025-08-01 17:42:26 +08:00
YUNSHEN XIE
e1011e92d9
disable test_cuda_graph.py ( #3124 )
2025-07-31 22:03:48 +08:00
plusNew001
8c63237cfa
[CI] add xpu ci case ( #3111 )
...
* [CI] add xpu ci case
* [CI]Update run_ci_xpu.sh
2025-07-31 22:03:34 +08:00
YUNSHEN XIE
ff6a109b4d
Describe PR diff coverage using JSON file ( #3114 )
...
* Refactored ci pipeline
* update
* Describe PR diff coverage using JSON file
* remove pip cache setting from Approve
* fix
* update
2025-07-31 21:59:20 +08:00
kevin
22cab724e8
[Feature] block scheduler v1 support prefix caching ( #3061 )
...
* block scheduler v1 support prefix cache
* update code
* update code
* fix code bug
* add timeout time
---------
Co-authored-by: Jiang-Jia-Jun <163579578+Jiang-Jia-Jun@users.noreply.github.com >
2025-07-31 19:29:19 +08:00
YUNSHEN XIE
583eae2fd1
fix ci ( #3106 )
...
* fix ci
* disable test_non_streaming_chat_with_min_tokens
2025-07-31 17:25:08 +08:00
YuanRisheng
acc5c0aa85
add ci for custom op approve ( #3079 )
2025-07-30 16:50:20 +08:00
bukejiyu
8e203666d9
w4a8 offline ( #3074 )
...
* w4a8 offline
* update
* update
* update
2025-07-30 16:33:30 +08:00
Jiang-Jia-Jun
ffa0f4d99b
[Fix] Fix version function ( #3076 )
...
* [Fix] Fix version function
* Fix commit
* Fix commit
* fix code sync
* Update coverage_run.sh
---------
Co-authored-by: Jiang-Jia-Jun <jiangjiajun@baidu.com >
2025-07-30 16:05:24 +08:00
YUNSHEN XIE
fb410b5f4c
Add unit test run and coverage report generation ( #3011 )
...
* Add unit test run and coverage report generation
* fix
* fix: upload coverage report failure
* fix
* update
* fix
* fix
* update
2025-07-27 22:48:34 +08:00
EnflameGCU
7634ffb709
[GCU] Add CI ( #3006 )
2025-07-25 10:59:29 +08:00
liddk1121
17c5d3a241
[Iluvatar GPU] Add CI scripts ( #2876 )
2025-07-21 09:44:42 +08:00
Zero Rains
25698d56d1
polish code with new pre-commit rule ( #2923 )
2025-07-19 23:19:27 +08:00
ZhangYulongg
eb77b1be6d
update ci cases
2025-07-18 21:44:07 +08:00
LQX
11cfdf5d89
添加XPU CI, test=model ( #2701 )
...
* 添加XPU CI, test=model
* 添加XPU CI, test=model
* 添加XPU CI, test=model
* 添加XPU CI, test=model
* 添加XPU CI, test=model
* 添加XPU CI, test=model
* 添加XPU CI, test=model
* 添加XPU CI, test=model
* 添加XPU CI, test=model
2025-07-04 16:16:06 +08:00
gaoziyuan
a5ae88ded9
[feature]add fd whl version info ( #2698 )
2025-07-04 14:12:42 +08:00
YuBaoku
bb880c8d7c
Update CI test cases ( #2671 )
...
* set git identity to avoid merge failure in CI
* add ci cases
2025-07-02 15:08:39 +08:00
RichardWooSJTU
85090ed799
remove unuseful scripts ( #2652 )
2025-07-01 10:18:25 +08:00
YUNSHEN XIE
d5af78945b
Add ci ( #2650 )
...
Deploy GitHub Pages / deploy (push) Has been cancelled
* add ci ut and workflow
* Automatically cancel any previous CI runs for the ci.yml workflow, keeping only the latest one active
2025-06-30 20:20:49 +08:00
Jiang-Jia-Jun
92c2cfa2e7
Sync v2.0 version of code to github repo
2025-06-29 23:29:37 +00:00
XieYunshen
0825146538
add ci ut and workflow
2025-06-16 02:18:00 +08:00
jiangjiajun
149c79699d
[LLM] First commit the llm deployment code
2025-06-16 00:04:48 +08:00
DefTruth
6a8cd4d759
[Backend] support ort offline graph optimize option ( #2268 )
...
* support ort offline graph optimize option
* support ort offline graph optimize option
2023-11-02 09:19:33 +08:00
DefTruth
77ee48f9b8
[Bug Fix] fixed compile error for cpu building ( #2105 )
...
* [cmake] upgrade windows paddle inference -> 2.5.0
* [cmake] upgrade windows paddle inference -> 2.5.0
* fix paddle custom ops bug on windows
* [Backend] refactor paddle custom ops
* [Bug Fix] fixed paddle custom ops windows build error
* [Bug Fix] fixed compile error for cpu building
2023-07-13 17:02:25 +08:00
DefTruth
b2426aefa9
[Backend] add paddle custom ops compatible policy ( #2070 )
...
* Add centerpoint
* fix postprocess op file name
* [Backend] add paddle custom ops compatible policy
* [Backend] add paddle custom ops compatible policy
* [Backend] add paddle custom ops compatible policy
* upgrade linx paddle gpu -> 2.5
* add custom op compatible policy for paddle 2.5
* add custom op compatible policy for paddle 2.5
* add custom op compatible policy for paddle 2.5
* add collect_trt_shape_by_device option for paddle backend
* add collect_trt_shape_by_device option for paddle backend
* add custom op option for python build
* fix python build bugs
* update paddle linux x86 cpu only lib
* update paddle linux gpu lib
* update patchelf cmake
* fix paddle backend option pybind
* update paddle_inference.cmake
* add cuda sm_80 support (A100)
---------
Co-authored-by: zengshao0622 <peter_z96@163.com >
Co-authored-by: qiuyanjun <qiuyanjun@baidu.com >
2023-06-29 22:32:14 +08:00
DefTruth
ff631b7838
[Server] Support encrypt & auth for FD Server ( #2018 )
...
* [Server] Support GPU encrypt & auth for FD Triton Server
* fix dockerfile proxy env error
* update build scrpits
* remove some logs
---------
Co-authored-by: root <root@yq02-sys-rpm1206692e6.yq02.baidu.com >
Co-authored-by: qiuyanjun <qiuyanjun@baidu.com >
2023-06-13 20:44:25 +08:00
DefTruth
284b1b46c8
[Serving] Support XPU encrypt & auth server ( #2007 )
...
* [patchelf] fix patchelf error for inference xpu
* [serving] add xpu dockerfile and support fd server
* [serving] add xpu dockerfile and support fd server
* [Serving] support XPU + Tritron
* [Serving] support XPU + Tritron
* [Dockerfile] update xpu tritron docker file -> paddle 0.0.0
* [Dockerfile] update xpu tritron docker file -> paddle 0.0.0
* [Dockerfile] update xpu tritron docker file -> paddle 0.0.0
* [Dockerfile] add comments for xpu tritron dockerfile
* [Doruntime] fix xpu infer error
* [Doruntime] fix xpu infer error
* [XPU] update xpu dockerfile
* add xpu triton server docs
* add xpu triton server docs
* add xpu triton server docs
* add xpu triton server docs
* update xpu triton server docs
* update xpu triton server docs
* update xpu triton server docs
* update xpu triton server docs
* update xpu triton server docs
* update xpu triton server docs
* update xpu triton server docs
* update xpu triton server docs
* [XPU] Update XPU L3 Cache setting docs
* [XPU] Add Encryption and AUTH support for XPU Server
* [XPU] Add Encryption and AUTH support for XPU Server
* [Bug Fix] fix paddle reader error
* [Serving] Support XPU encrypt & auth server
* [Serving] Support XPU encrypt & auth server
* [Serving] Support XPU encrypt & auth server
* [Serving] Support XPU encrypt & auth server
* [Triton] switch TAG 22.12 -> TAG 21.10wq
* update xpu auth server script
2023-06-01 21:36:44 +08:00