周周周
f6f726c773
clean code in sttantion ( #3917 )
2025-09-05 20:49:01 +08:00
chen
0d989829bb
Compatible with EB 0.3B torch model arch ( #3913 )
...
* fix
* check
2025-09-05 19:04:59 +08:00
ltd0924
bd7d15f7ea
[Feature] support controller port in multi api server ( #3898 )
...
CE Compile Job / Show Code Archive Output (push) Has been cancelled
CE Compile Job / BUILD_SM8090 (push) Has been cancelled
CE Compile Job / BUILD_SM8689 (push) Has been cancelled
CE Compile Job / CE_UPLOAD (push) Has been cancelled
Deploy GitHub Pages / deploy (push) Has been cancelled
Publish Job / publish_pre_check (push) Has been cancelled
Publish Job / print_publish_pre_check_outputs (push) Has been cancelled
Publish Job / FD-Clone-Linux (push) Has been cancelled
Publish Job / Show Code Archive Output (push) Has been cancelled
Publish Job / BUILD_SM8090 (push) Has been cancelled
Publish Job / BUILD_SM8689 (push) Has been cancelled
Publish Job / PADDLE_PYPI_UPLOAD_8090 (push) Has been cancelled
Publish Job / PADDLE_PYPI_UPLOAD_8689 (push) Has been cancelled
Publish Job / Run FastDeploy Unit Tests and Coverage (push) Has been cancelled
Publish Job / Run FastDeploy LogProb Tests (push) Has been cancelled
Publish Job / Extracted partial CE model tasks to run in CI. (push) Has been cancelled
Publish Job / Run Base Tests (push) Has been cancelled
Publish Job / Run Accuracy Tests (push) Has been cancelled
Publish Job / Run Stable Tests (push) Has been cancelled
CI Images Build / FD-Clone-Linux (push) Has been cancelled
CI Images Build / Show Code Archive Output (push) Has been cancelled
CI Images Build / CI Images Build (push) Has been cancelled
CI Images Build / BUILD_SM8090 (push) Has been cancelled
CI Images Build / Run FastDeploy Unit Tests and Coverage (push) Has been cancelled
CI Images Build / Run FastDeploy LogProb Tests (push) Has been cancelled
CI Images Build / Extracted partial CE model tasks to run in CI. (push) Has been cancelled
CI Images Build / Run Base Tests (push) Has been cancelled
CI Images Build / Run Accuracy Tests (push) Has been cancelled
CI Images Build / Run Stable Tests (push) Has been cancelled
CI Images Build / Publish Docker Images Pre Check (push) Has been cancelled
* Update serving_chat.py
* Update serving_completion.py
* Update serving_completion.py
* Update multi_api_server.py
2025-09-05 17:16:31 +08:00
Yuan Xiaolan
2cf55168ca
load hadamard_block_size from config ( #3797 )
2025-09-05 17:07:58 +08:00
AIbin
41aee08982
【Inference Optimize】Update MergedReplicatedLinear for DSK qkv_a_proj_with_mqa. ( #3673 )
...
* support MergedReplicatedLinear
* update MergedReplicatedLinear to support DSK_wint4 V1_load
* update model name
* update linear class
* fix
* fix v0 moe_bias load
---------
Co-authored-by: bukejiyu <52310069+bukejiyu@users.noreply.github.com >
2025-09-04 21:16:05 -07:00
ooo oo
b23fc654d9
【Hackathon 9th No.32】add unit tests for group_swiglu_with_masked ( #3748 )
2025-09-05 11:53:47 +08:00
gaoziyuan
ab1929f5ff
fix mem boom in ep ( #3854 )
2025-09-05 11:48:21 +08:00
Echo-Nie
fc3bc56e59
【Hackathon 9th No.35】add test_moe_redundant_topk_select ( #3867 )
2025-09-05 11:29:02 +08:00
ltd0924
7643e6e6b2
[Docs] add data parallel ( #3883 )
...
CE Compile Job / ce_job_pre_check (push) Has been cancelled
CE Compile Job / print_ce_job_pre_check_outputs (push) Has been cancelled
CE Compile Job / FD-Clone-Linux (push) Has been cancelled
CE Compile Job / Show Code Archive Output (push) Has been cancelled
CE Compile Job / BUILD_SM8090 (push) Has been cancelled
CE Compile Job / BUILD_SM8689 (push) Has been cancelled
CE Compile Job / CE_UPLOAD (push) Has been cancelled
Deploy GitHub Pages / deploy (push) Has been cancelled
* [Docs] add data parallel
* [Docs] add data parallel
2025-09-04 20:33:50 +08:00
ltd0924
e0e7d68435
Update qwen_vl_processor.py ( #3808 )
2025-09-04 20:31:48 +08:00
Zhang Yulong
4c160aa4dd
Update test_ernie_21b_mtp.py ( #3885 )
2025-09-04 20:20:36 +08:00
YuBaoku
c7b7126b20
[CI] update paddleformers==0.2 in develop ( #3878 )
2025-09-04 20:12:41 +08:00
SunLei
29628de6a7
Support for async processor added. ( #3869 )
...
* Support for async processor added.
* remove yappi code
---------
Co-authored-by: Yuanle Liu <yuanlehome@163.com >
2025-09-04 19:58:53 +08:00
xiaolei373
ed97cf8396
Graceful shut down ( #3785 )
...
* feat(log):add_request_and_response_log
* 优雅退出-接口增加退出时长参数
2025-09-04 19:33:50 +08:00
freeliuzc
88d44a2c93
support mtp in v1_scheduler mode ( #3695 )
CE Compile Job / Show Code Archive Output (push) Has been cancelled
CE Compile Job / BUILD_SM8090 (push) Has been cancelled
CE Compile Job / BUILD_SM8689 (push) Has been cancelled
CE Compile Job / CE_UPLOAD (push) Has been cancelled
Deploy GitHub Pages / deploy (push) Has been cancelled
Publish Job / publish_pre_check (push) Has been cancelled
Publish Job / print_publish_pre_check_outputs (push) Has been cancelled
Publish Job / FD-Clone-Linux (push) Has been cancelled
Publish Job / Show Code Archive Output (push) Has been cancelled
Publish Job / BUILD_SM8090 (push) Has been cancelled
Publish Job / BUILD_SM8689 (push) Has been cancelled
Publish Job / PADDLE_PYPI_UPLOAD_8090 (push) Has been cancelled
Publish Job / PADDLE_PYPI_UPLOAD_8689 (push) Has been cancelled
Publish Job / Run FastDeploy Unit Tests and Coverage (push) Has been cancelled
Publish Job / Run FastDeploy LogProb Tests (push) Has been cancelled
Publish Job / Extracted partial CE model tasks to run in CI. (push) Has been cancelled
Publish Job / Run Base Tests (push) Has been cancelled
Publish Job / Run Accuracy Tests (push) Has been cancelled
Publish Job / Run Stable Tests (push) Has been cancelled
CI Images Build / FD-Clone-Linux (push) Has been cancelled
CI Images Build / Show Code Archive Output (push) Has been cancelled
CI Images Build / CI Images Build (push) Has been cancelled
CI Images Build / BUILD_SM8090 (push) Has been cancelled
CI Images Build / Run FastDeploy Unit Tests and Coverage (push) Has been cancelled
CI Images Build / Run FastDeploy LogProb Tests (push) Has been cancelled
CI Images Build / Extracted partial CE model tasks to run in CI. (push) Has been cancelled
CI Images Build / Run Base Tests (push) Has been cancelled
CI Images Build / Run Accuracy Tests (push) Has been cancelled
CI Images Build / Run Stable Tests (push) Has been cancelled
CI Images Build / Publish Docker Images Pre Check (push) Has been cancelled
2025-09-04 17:39:59 +08:00
xiaoxiaohehe001
f265a26f8b
support mtp rope_3d ( #3791 )
...
* support mtp rope_3d
* Update speculate_write_cache_with_rope_kernel.cu
2025-09-04 17:18:05 +08:00
RichardWooSJTU
f36a388ffe
fix response processsors ( #3826 )
...
* fix response processsors
* fix ci
* fix ut
---------
Co-authored-by: YuBaoku <49938469+EmmonsCurse@users.noreply.github.com >
2025-09-04 16:01:25 +08:00
chenjian
22c165d6dd
[Feature] Set v1 scheduler as default in develop ( #3807 )
...
* Set scheduler v1 as default
* Set scheduler v1 as default
* Set scheduler v1 as default
* Set scheduler v1 as default
* Set scheduler v1 as default
* close V1 in guided_decoding
* fix vl ci
* close V1 in guided_decoding
2025-09-04 15:16:56 +08:00
co63oc
e83251699f
【Hackathon 9th No.63】add test_draft_model_postprocess.py ( #3757 )
...
* add test_draft_model_postprocess.py
* fix
* fix
2025-09-04 15:00:48 +08:00
Echo-Nie
ac46ef403a
【Hackathon 9th No.34】add test_get_position_ids_and_mask_encoder_batch ( #3739 )
2025-09-04 14:54:30 +08:00
RichardWooSJTU
0989788b29
support extend block tables ( #3824 )
2025-09-04 14:39:04 +08:00
gaoziyuan
6ef3b611b0
add dp config ( #3822 )
2025-09-04 11:46:48 +08:00
ooo oo
460809070c
【Hackathon 9th No.54、57】 add unit tests for per_token_quant and per_token_quant_padding ( #3746 )
2025-09-04 11:46:38 +08:00
co63oc
7baf1b56e0
【Hackathon 9th No.27】add test_get_padding_offset ( #3708 )
...
* add test_get_padding_offset
* fix
* fix
* fix
2025-09-04 11:42:35 +08:00
co63oc
9ec4fa0f8e
fix typo EngineSevice EngineService ( #3841 )
2025-09-04 11:20:36 +08:00
yangjianfengo1
c870be6d27
fix port ( #3863 )
2025-09-04 10:01:38 +08:00
plusNew001
3790505319
[XPU] Update XPU stable xvllm and xtdk version for 2.2 ( #3853 )
...
CE Compile Job / Show Code Archive Output (push) Has been cancelled
CE Compile Job / BUILD_SM8090 (push) Has been cancelled
CE Compile Job / BUILD_SM8689 (push) Has been cancelled
CE Compile Job / CE_UPLOAD (push) Has been cancelled
Deploy GitHub Pages / deploy (push) Has been cancelled
Publish Job / publish_pre_check (push) Has been cancelled
Publish Job / print_publish_pre_check_outputs (push) Has been cancelled
Publish Job / FD-Clone-Linux (push) Has been cancelled
Publish Job / Show Code Archive Output (push) Has been cancelled
Publish Job / BUILD_SM8090 (push) Has been cancelled
Publish Job / BUILD_SM8689 (push) Has been cancelled
Publish Job / PADDLE_PYPI_UPLOAD_8090 (push) Has been cancelled
Publish Job / PADDLE_PYPI_UPLOAD_8689 (push) Has been cancelled
Publish Job / Run FastDeploy Unit Tests and Coverage (push) Has been cancelled
Publish Job / Run FastDeploy LogProb Tests (push) Has been cancelled
Publish Job / Extracted partial CE model tasks to run in CI. (push) Has been cancelled
Publish Job / Run Base Tests (push) Has been cancelled
Publish Job / Run Accuracy Tests (push) Has been cancelled
Publish Job / Run Stable Tests (push) Has been cancelled
CI Images Build / FD-Clone-Linux (push) Has been cancelled
CI Images Build / Show Code Archive Output (push) Has been cancelled
CI Images Build / CI Images Build (push) Has been cancelled
CI Images Build / BUILD_SM8090 (push) Has been cancelled
CI Images Build / Run FastDeploy Unit Tests and Coverage (push) Has been cancelled
CI Images Build / Run FastDeploy LogProb Tests (push) Has been cancelled
CI Images Build / Extracted partial CE model tasks to run in CI. (push) Has been cancelled
CI Images Build / Run Base Tests (push) Has been cancelled
CI Images Build / Run Accuracy Tests (push) Has been cancelled
CI Images Build / Run Stable Tests (push) Has been cancelled
CI Images Build / Publish Docker Images Pre Check (push) Has been cancelled
* Add debug environment variable exports
Added debug environment variable exports for CLANG_PATH and XVLLM_PATH.
* Lock paddlepaddle-xpu version in CI script
Temporarily lock paddlepaddle-xpu version due to framework update issues.
* Update no_proxy environment variable in CI workflow
* Install lsof tool in run_ci_xpu.sh
* Update dependency versions for stable release
* Update paddlepaddle-xpu installation command
2025-09-03 23:21:00 +08:00
co63oc
e24b745d48
[UnitTest][MTP]add test_speculate_get_output_padding_offset ( #3740 )
2025-09-03 22:21:21 +08:00
co63oc
aaa2de1afa
[UnitTest][MTP]add test_speculate_get_padding_offset ( #3730 )
2025-09-03 22:21:02 +08:00
yyssys
abde903813
Automatically configure workers based on max-num-seqs ( #3846 )
...
Automatically configure workers based on max-num-seqs
2025-09-03 21:12:42 +08:00
YUNSHEN XIE
7dbd9412b0
reopen ut ( #3795 )
...
* reopen ut
* update
* update
* update ci dockerfile
2025-09-03 19:05:20 +08:00
luukunn
fc598d4c5a
add reasoning parser plugin ( #3811 )
...
* add reasoning parser plugin
* fix finish reason
2025-09-03 18:31:27 +08:00
Ayakouji
31313e0f3d
[Feature] ernie4_5_vl_moe
support huggingface safetensor loading ( #3750 )
...
* update
* update
* update in tp
* add todo
* update
---------
Co-authored-by: aquagull <hongyuh@qq.com >
2025-09-03 02:58:59 -07:00
lizexu123
4c998c3636
[Code Simplification] delete cum_offsets_out ( #3815 )
...
CE Compile Job / ce_job_pre_check (push) Has been cancelled
CE Compile Job / print_ce_job_pre_check_outputs (push) Has been cancelled
CE Compile Job / FD-Clone-Linux (push) Has been cancelled
CE Compile Job / Show Code Archive Output (push) Has been cancelled
CE Compile Job / BUILD_SM8090 (push) Has been cancelled
CE Compile Job / BUILD_SM8689 (push) Has been cancelled
CE Compile Job / CE_UPLOAD (push) Has been cancelled
Deploy GitHub Pages / deploy (push) Has been cancelled
* fix
* fix
2025-09-03 16:15:33 +08:00
YuanRisheng
0a1ce612c2
V1 loader support ep ( #3801 )
2025-09-03 16:05:41 +08:00
Yuan Xiaolan
fa58a9fa8f
qk norm for speculate decode C16 ( #3637 )
2025-09-03 14:53:56 +08:00
plusNew001
d22d3de256
[XPU] Update XPU CI case ( #3837 )
...
* Add debug environment variable exports
Added debug environment variable exports for CLANG_PATH and XVLLM_PATH.
* Lock paddlepaddle-xpu version in CI script
Temporarily lock paddlepaddle-xpu version due to framework update issues.
* Update no_proxy environment variable in CI workflow
* Install lsof tool in run_ci_xpu.sh
2025-09-03 14:32:12 +08:00
lzy
2527eb0e4e
fix test_append_attention_with_output.py ( #3831 )
...
Co-authored-by: plusNew001 <95567040+plusNew001@users.noreply.github.com >
2025-09-03 14:07:50 +08:00
AIbin
54b458fd98
[Doc] update wint2 doc ( #3819 )
...
* update_wint2_doc
2025-09-03 11:27:43 +08:00
plusNew001
d81c57146f
[XPU] FIX XPU CI BUG ( #3829 )
...
* Add debug environment variable exports
Added debug environment variable exports for CLANG_PATH and XVLLM_PATH.
* Lock paddlepaddle-xpu version in CI script
Temporarily lock paddlepaddle-xpu version due to framework update issues.
2025-09-03 11:25:48 +08:00
ooo oo
2396e49f9e
【Hackathon 9th No.73】add unit tests for graph_opt_backend ( #3609 )
...
* test: add unit tests for graph_opt_backend
* refactor(tests): improve graph optimization test structure and readability
* fix(tests): correct CUDA graph related typos in test files
- Fix class name: TestCUDAGrpahSubgraph -> TestCUDAGraphSubgraph
* refactor(test): support attention layer and optimize graph optimization backend test to eliminate redundant baseline calculations
* remove some func call
---------
Co-authored-by: RAM <gstian5555@outlook.com >
Co-authored-by: Tao Luo <luotao02@baidu.com >
2025-09-03 11:18:00 +08:00
co63oc
94a61d505c
fix dcu_worker.py ( #3734 )
2025-09-03 10:57:42 +08:00
co63oc
ce998449e0
fix w8a8.py ( #3733 )
2025-09-03 10:57:26 +08:00
Echo-Nie
f7a4bea785
【Hackathon 9th No.84】Supplementary Unit Test for fastdeploy/reasoning ( #3570 )
...
测试内容:测试基类的注册、获取函数功能是否正常
Co-authored-by: Tao Luo <luotao02@baidu.com >
2025-09-03 10:55:02 +08:00
co63oc
5441538173
rename fused_get_rope.cu ( #3752 )
...
* rename fused_get_rope.cu
* fix
* fix typos
* fix
* fix
2025-09-03 10:54:34 +08:00
ltd0924
2c9b169c0e
[BugFix] fix scheduler invalid ( #3803 )
...
CE Compile Job / Show Code Archive Output (push) Has been cancelled
CE Compile Job / BUILD_SM8090 (push) Has been cancelled
CE Compile Job / BUILD_SM8689 (push) Has been cancelled
CE Compile Job / CE_UPLOAD (push) Has been cancelled
Deploy GitHub Pages / deploy (push) Has been cancelled
Publish Job / publish_pre_check (push) Has been cancelled
Publish Job / print_publish_pre_check_outputs (push) Has been cancelled
Publish Job / FD-Clone-Linux (push) Has been cancelled
Publish Job / Show Code Archive Output (push) Has been cancelled
Publish Job / BUILD_SM8090 (push) Has been cancelled
Publish Job / BUILD_SM8689 (push) Has been cancelled
Publish Job / PADDLE_PYPI_UPLOAD_8090 (push) Has been cancelled
Publish Job / PADDLE_PYPI_UPLOAD_8689 (push) Has been cancelled
Publish Job / Run FastDeploy Unit Tests and Coverage (push) Has been cancelled
Publish Job / Run FastDeploy LogProb Tests (push) Has been cancelled
Publish Job / Extracted partial CE model tasks to run in CI. (push) Has been cancelled
Publish Job / Run Base Tests (push) Has been cancelled
Publish Job / Run Accuracy Tests (push) Has been cancelled
Publish Job / Run Stable Tests (push) Has been cancelled
CI Images Build / FD-Clone-Linux (push) Has been cancelled
CI Images Build / Show Code Archive Output (push) Has been cancelled
CI Images Build / CI Images Build (push) Has been cancelled
CI Images Build / BUILD_SM8090 (push) Has been cancelled
CI Images Build / Run FastDeploy Unit Tests and Coverage (push) Has been cancelled
CI Images Build / Run FastDeploy LogProb Tests (push) Has been cancelled
CI Images Build / Extracted partial CE model tasks to run in CI. (push) Has been cancelled
CI Images Build / Run Base Tests (push) Has been cancelled
CI Images Build / Run Accuracy Tests (push) Has been cancelled
CI Images Build / Run Stable Tests (push) Has been cancelled
CI Images Build / Publish Docker Images Pre Check (push) Has been cancelled
* [BugFix] fix max streaming tokens invalid
* fix scheduler bug
* fix scheduler bug
2025-09-02 20:28:51 +08:00
Longzhi Wang
e0c9a6c76c
[Feat] Support streaming transfer data using ZMQ ( #3521 )
...
* Support streaming transfer data of ZMQ
* fix typo
* fix typo
* support tp
* add unittest
* update
* update
* fix typo
* fix typo
* fix tp_num in ci machine
---------
Co-authored-by: Wanglongzhi2001 <>
2025-09-02 19:52:19 +08:00
Echo-Nie
0fe1d62232
[MTP] add test_draft_model_set_value_by_flags.py ( #3741 )
2025-09-02 19:33:33 +08:00
Jiang-Jia-Jun
18e5d355a1
Update version in docs
2025-09-02 19:21:10 +08:00
yangjianfengo1
8e1b35a09b
【Fix bug] w4afp8 的nblock固定为256,并且fa3的append attn 增加mask参数 ( #3771 )
...
* fix w4afp8
* 增加集中式配置
* codestyle
* fix fa3 append attn
2025-09-02 19:17:01 +08:00