kxz2002
2dfbcf3cc9
[BugFix] Fix inference_start_time ( #4922 )
...
* fix inference_start_time
* fix inference_start_time
2025-11-10 19:28:44 +08:00
LiqinruiG
aa79e6185a
[Docs] Improve reasoning_out docs ( #4901 )
...
* [Docs] Improve reasoning_out docs
* [Docs] Improve reasoning_out docs
* [Docs] Improve reasoning_out docs
---------
Co-authored-by: liqinrui <liqinrui@baidu.com >
2025-11-10 19:20:38 +08:00
qw86972190
07b21d241d
[XPU]Update documentation ( #4917 )
...
* [XPU]Update documentation
* [XPU]Update documentation
* [XPU]Update documentation
* [XPU]Update documentation
* [XPU][Docs] Update documentation
* [XPU][Docs] Update documentation
* [XPU][Docs] Update documentation
* [XPU][Docs] Update documentation
* [XPU][Docs] Update documentation
* [XPU][Docs] Update documentation
2025-11-10 19:11:42 +08:00
周周周
54536267db
[DeepEP] support P async_finish ( #4899 )
2025-11-10 18:24:02 +08:00
chenjian
78895e2c7d
[Bug Fix] fix bug for PD EP ( #4823 )
...
* fix bug for PD EP
* fix
* optimize perf for engine worker queue
* fix bug
* fix internode ll two stage
* fix for ci
* fix bug
2025-11-10 15:33:29 +08:00
Echo-Nie
112623e33e
init version, exist some bugs, waiting fix ( #4906 )
CE Compile Job / ce_job_pre_check (push) Has been cancelled
CE Compile Job / print_ce_job_pre_check_outputs (push) Has been cancelled
CE Compile Job / FD-Clone-Linux (push) Has been cancelled
CE Compile Job / Show Code Archive Output (push) Has been cancelled
CE Compile Job / BUILD_SM8090 (push) Has been cancelled
CE Compile Job / BUILD_SM8689 (push) Has been cancelled
CE Compile Job / CE_UPLOAD (push) Has been cancelled
Deploy GitHub Pages / deploy (push) Has been cancelled
2025-11-10 14:16:09 +08:00
LiqinruiG
90b0936ae9
[Docs] add api-key usage instructions ( #4902 )
...
* [Docs] add api-key usage instructions
* [Docs] add api-key usage instructions
---------
Co-authored-by: liqinrui <liqinrui@baidu.com >
2025-11-10 13:39:39 +08:00
luukunn
41c0bef964
[BugFix] When the value of "temperature" is 0, adjust it to 1e-06 ( #4900 )
...
* add default temperature value
* add unit test
* update
* update
* add unit test
* update
* fix unit test
2025-11-10 13:24:33 +08:00
plusNew001
0a3bc84f71
[XPU][CI]Update test assertion and base response value ( #4907 )
2025-11-10 11:44:54 +08:00
zhuzixuan
8a9e7b53af
[Docs]Supplement the English and Chinese user documentation for Tool calling ( #4895 )
...
CE Compile Job / ce_job_pre_check (push) Has been cancelled
Deploy GitHub Pages / deploy (push) Has been cancelled
CE Compile Job / print_ce_job_pre_check_outputs (push) Has been cancelled
CE Compile Job / FD-Clone-Linux (push) Has been cancelled
CE Compile Job / Show Code Archive Output (push) Has been cancelled
CE Compile Job / BUILD_SM8090 (push) Has been cancelled
CE Compile Job / BUILD_SM8689 (push) Has been cancelled
CE Compile Job / CE_UPLOAD (push) Has been cancelled
Publish Job / publish_pre_check (push) Has been cancelled
Publish Job / print_publish_pre_check_outputs (push) Has been cancelled
Publish Job / FD-Clone-Linux (push) Has been cancelled
Publish Job / Show Code Archive Output (push) Has been cancelled
Publish Job / BUILD_SM8090 (push) Has been cancelled
Publish Job / BUILD_SM8689 (push) Has been cancelled
Publish Job / PADDLE_PYPI_UPLOAD_8090 (push) Has been cancelled
Publish Job / PADDLE_PYPI_UPLOAD_8689 (push) Has been cancelled
Publish Job / Run FD Image Build (push) Has been cancelled
Publish Job / Run FastDeploy Unit Tests and Coverage (push) Has been cancelled
Publish Job / Run FastDeploy LogProb Tests (push) Has been cancelled
Publish Job / Extracted partial CE model tasks to run in CI. (push) Has been cancelled
Publish Job / Run Base Tests (push) Has been cancelled
Publish Job / Run Accuracy Tests (push) Has been cancelled
Publish Job / Run Stable Tests (push) Has been cancelled
CI Images Build / FD-Clone-Linux (push) Has been cancelled
CI Images Build / Show Code Archive Output (push) Has been cancelled
CI Images Build / CI Images Build (push) Has been cancelled
CI Images Build / BUILD_SM8090 (push) Has been cancelled
CI Images Build / Run FastDeploy Unit Tests and Coverage (push) Has been cancelled
CI Images Build / Run FastDeploy LogProb Tests (push) Has been cancelled
CI Images Build / Extracted partial CE model tasks to run in CI. (push) Has been cancelled
CI Images Build / Run Base Tests (push) Has been cancelled
CI Images Build / Run Accuracy Tests (push) Has been cancelled
CI Images Build / Run Stable Tests (push) Has been cancelled
CI Images Build / Publish Docker Images Pre Check (push) Has been cancelled
* tool calling文档编写,v1.0
* tool calling文档编写,v1.0
* tool calling文档编写,v1.0
* tool calling doc,v1.1
* tool calling doc,v1.1
* tool calling doc,v1.1
* tool calling doc,v1.1
2025-11-08 20:05:14 +08:00
kxz2002
87911b7cf1
[Feature] Enable FastDeploy to support adding the “--api-key” authentication parameter. ( #4806 )
...
* add api key initial commit
* add unit test
* modify unit test
* move middleware to a single file and add unit tests
2025-11-08 18:24:02 +08:00
chen
80aedb82ce
[BugFix] max_lgprobes=-1 maps to ori_vocab_size ( #4884 )
...
CE Compile Job / ce_job_pre_check (push) Has been cancelled
CE Compile Job / print_ce_job_pre_check_outputs (push) Has been cancelled
CE Compile Job / FD-Clone-Linux (push) Has been cancelled
CE Compile Job / Show Code Archive Output (push) Has been cancelled
CE Compile Job / BUILD_SM8090 (push) Has been cancelled
CE Compile Job / BUILD_SM8689 (push) Has been cancelled
CE Compile Job / CE_UPLOAD (push) Has been cancelled
Deploy GitHub Pages / deploy (push) Has been cancelled
* -1 ori_vobal_size
* check
* check
* check
* revert config.py
2025-11-07 22:15:40 +08:00
Neil Zhu
6de1ce3b25
[Metax] support ERNIE-4.5-VL-28B ( #4820 )
CE Compile Job / ce_job_pre_check (push) Has been cancelled
CE Compile Job / print_ce_job_pre_check_outputs (push) Has been cancelled
CE Compile Job / FD-Clone-Linux (push) Has been cancelled
CE Compile Job / Show Code Archive Output (push) Has been cancelled
CE Compile Job / BUILD_SM8090 (push) Has been cancelled
CE Compile Job / BUILD_SM8689 (push) Has been cancelled
CE Compile Job / CE_UPLOAD (push) Has been cancelled
Deploy GitHub Pages / deploy (push) Has been cancelled
Publish Job / publish_pre_check (push) Has been cancelled
Publish Job / print_publish_pre_check_outputs (push) Has been cancelled
Publish Job / FD-Clone-Linux (push) Has been cancelled
Publish Job / Show Code Archive Output (push) Has been cancelled
Publish Job / BUILD_SM8090 (push) Has been cancelled
Publish Job / BUILD_SM8689 (push) Has been cancelled
Publish Job / PADDLE_PYPI_UPLOAD_8090 (push) Has been cancelled
Publish Job / PADDLE_PYPI_UPLOAD_8689 (push) Has been cancelled
Publish Job / Run FD Image Build (push) Has been cancelled
Publish Job / Run FastDeploy Unit Tests and Coverage (push) Has been cancelled
Publish Job / Run FastDeploy LogProb Tests (push) Has been cancelled
Publish Job / Extracted partial CE model tasks to run in CI. (push) Has been cancelled
Publish Job / Run Base Tests (push) Has been cancelled
Publish Job / Run Accuracy Tests (push) Has been cancelled
Publish Job / Run Stable Tests (push) Has been cancelled
CI Images Build / FD-Clone-Linux (push) Has been cancelled
CI Images Build / Show Code Archive Output (push) Has been cancelled
CI Images Build / CI Images Build (push) Has been cancelled
CI Images Build / BUILD_SM8090 (push) Has been cancelled
CI Images Build / Run FastDeploy Unit Tests and Coverage (push) Has been cancelled
CI Images Build / Run FastDeploy LogProb Tests (push) Has been cancelled
CI Images Build / Extracted partial CE model tasks to run in CI. (push) Has been cancelled
CI Images Build / Run Base Tests (push) Has been cancelled
CI Images Build / Run Accuracy Tests (push) Has been cancelled
CI Images Build / Run Stable Tests (push) Has been cancelled
CI Images Build / Publish Docker Images Pre Check (push) Has been cancelled
2025-11-07 04:55:49 -08:00
YuBaoku
69e503499d
[CI] fix docker_build error of ciuse ( #4886 )
2025-11-07 19:44:21 +08:00
chen
6871aad03d
[BugFix] fix token_processor zmq ( #4827 )
...
* fix_token_processor_zmq
* check pooling_model's token_ids is None
* revert
2025-11-07 19:43:25 +08:00
plusNew001
fa098383f6
[XPU][CI] Ci bug fix ( #4889 )
...
* Refactor test_45t by commenting out responses
Comment out base response variables and update assertion.
* Update run_w4a8.py
* Fix assertion syntax in run_45T.py
2025-11-07 17:50:11 +08:00
ddchenhao66
72d5ee9a7c
[XPU] modify 424B model deployment parameter ( #4888 )
...
Co-authored-by: ddchenhao66 <dhaochen163.com>
2025-11-07 17:34:37 +08:00
zhupengyang
6b5ae9ffea
[XPU] fix ep_tp all2all ci ( #4876 )
2025-11-07 16:37:23 +08:00
ming1753
cba185f1fe
[Feature] Optim PaddleOCR-VL ( #4873 )
...
* [Feature] Optim PaddleOCR-VL
* fix bug
2025-11-07 14:56:44 +08:00
Jiang-Jia-Jun
bbe0820555
Add instructions for copilot reviewer
CE Compile Job / ce_job_pre_check (push) Has been cancelled
CE Compile Job / print_ce_job_pre_check_outputs (push) Has been cancelled
CE Compile Job / FD-Clone-Linux (push) Has been cancelled
CE Compile Job / Show Code Archive Output (push) Has been cancelled
CE Compile Job / BUILD_SM8090 (push) Has been cancelled
CE Compile Job / BUILD_SM8689 (push) Has been cancelled
CE Compile Job / CE_UPLOAD (push) Has been cancelled
Deploy GitHub Pages / deploy (push) Has been cancelled
2025-11-07 11:19:27 +08:00
Haonan Luo
79e6bf4bdc
[Others] Delete PaddleOCR Useless Function ( #4815 )
...
* fix paddleocr prefix cache bug
* add test for paddleocr_vl
* disable prefix-caching in ocr
* add test for paddleocr_vl
* Fix top_p for rejection sampling
* delete useless func for paddleocr
---------
Co-authored-by: ming1753 <ideaminghp@163.com >
Co-authored-by: ming1753 <61511741+ming1753@users.noreply.github.com >
2025-11-07 11:14:41 +08:00
Jiang-Jia-Jun
048856a7f6
Add instructions for copilot reviewer
2025-11-07 11:01:05 +08:00
YuBaoku
d0f9535ee7
[CI] Refactor check-bypass logic in run_tests_with_coverage ( #4655 )
...
* [CI] Change check-bypass workflow using commit SHA
* [CI] Change check-bypass in run_tests_with_coverage
2025-11-07 10:47:27 +08:00
YuBaoku
fa28745f19
[CI] Update ERNIE-4.5-VL baseline to adapt to MoE changes ( #4867 )
CE Compile Job / ce_job_pre_check (push) Has been cancelled
CE Compile Job / print_ce_job_pre_check_outputs (push) Has been cancelled
CE Compile Job / FD-Clone-Linux (push) Has been cancelled
CE Compile Job / Show Code Archive Output (push) Has been cancelled
CE Compile Job / BUILD_SM8090 (push) Has been cancelled
CE Compile Job / BUILD_SM8689 (push) Has been cancelled
CE Compile Job / CE_UPLOAD (push) Has been cancelled
Deploy GitHub Pages / deploy (push) Has been cancelled
2025-11-06 22:02:10 +08:00
kevin
cc34487810
[Feature] support mm disable_chunked ( #4803 )
...
* support mm disable_chunked
* update code
* update code
* update code
2025-11-06 21:32:25 +08:00
Jiang-Jia-Jun
6b68c58e8d
Revert "[Bug Fix] fix ernie4_5_vl_moe ( #4843 )" ( #4863 )
...
CE Compile Job / ce_job_pre_check (push) Has been cancelled
CE Compile Job / print_ce_job_pre_check_outputs (push) Has been cancelled
CE Compile Job / FD-Clone-Linux (push) Has been cancelled
CE Compile Job / Show Code Archive Output (push) Has been cancelled
CE Compile Job / BUILD_SM8090 (push) Has been cancelled
CE Compile Job / BUILD_SM8689 (push) Has been cancelled
CE Compile Job / CE_UPLOAD (push) Has been cancelled
Deploy GitHub Pages / deploy (push) Has been cancelled
Publish Job / publish_pre_check (push) Has been cancelled
Publish Job / print_publish_pre_check_outputs (push) Has been cancelled
Publish Job / FD-Clone-Linux (push) Has been cancelled
Publish Job / Show Code Archive Output (push) Has been cancelled
Publish Job / BUILD_SM8090 (push) Has been cancelled
Publish Job / BUILD_SM8689 (push) Has been cancelled
Publish Job / PADDLE_PYPI_UPLOAD_8090 (push) Has been cancelled
Publish Job / PADDLE_PYPI_UPLOAD_8689 (push) Has been cancelled
Publish Job / Run FD Image Build (push) Has been cancelled
Publish Job / Run FastDeploy Unit Tests and Coverage (push) Has been cancelled
Publish Job / Run FastDeploy LogProb Tests (push) Has been cancelled
Publish Job / Extracted partial CE model tasks to run in CI. (push) Has been cancelled
Publish Job / Run Base Tests (push) Has been cancelled
Publish Job / Run Accuracy Tests (push) Has been cancelled
Publish Job / Run Stable Tests (push) Has been cancelled
CI Images Build / FD-Clone-Linux (push) Has been cancelled
CI Images Build / Show Code Archive Output (push) Has been cancelled
CI Images Build / CI Images Build (push) Has been cancelled
CI Images Build / BUILD_SM8090 (push) Has been cancelled
CI Images Build / Run FastDeploy Unit Tests and Coverage (push) Has been cancelled
CI Images Build / Run FastDeploy LogProb Tests (push) Has been cancelled
CI Images Build / Extracted partial CE model tasks to run in CI. (push) Has been cancelled
CI Images Build / Run Base Tests (push) Has been cancelled
CI Images Build / Run Accuracy Tests (push) Has been cancelled
CI Images Build / Run Stable Tests (push) Has been cancelled
CI Images Build / Publish Docker Images Pre Check (push) Has been cancelled
This reverts commit 6460d4df27 .
2025-11-06 19:18:29 +08:00
LokeZhou
6460d4df27
[Bug Fix] fix ernie4_5_vl_moe ( #4843 )
...
* fix ernie4_5_vl_moe
* fix vl moe meta
2025-11-06 19:16:33 +08:00
YuBaoku
a139f8f3cb
[CI] Optimize port cleanup logic ( #4860 )
2025-11-06 19:13:48 +08:00
Zhang Yulong
5aa73d32f4
Update deploy.py ( #4850 )
2025-11-06 19:09:28 +08:00
YuBaoku
819b2dbbae
Revert "【New Feature】W4afp8 supports per group quantization ( #4272 )" ( #4854 )
...
This reverts commit 93fcf7e4ec .
2025-11-06 17:48:28 +08:00
YuBaoku
3478d20262
[CI] Add Check PR Template ( #4481 )
2025-11-06 17:41:14 +08:00
zhupengyang
b54eb7ad81
[XPU] ep+tp all2all ( #4836 )
2025-11-06 17:26:14 +08:00
Jiang-Jia-Jun
901d559aa7
Update README_CN.md
2025-11-06 17:19:22 +08:00
Jiang-Jia-Jun
0010420c56
Update README_EN.md
2025-11-06 17:19:07 +08:00
Zhang Yulong
83532e1d01
[Benchmark] Enhance benchmark output logging ( #4682 )
...
* Enhance benchmark output logging
Add print statements to display the number of discarded outputs before and after filtering.
* Update benchmark_serving.py
2025-11-06 16:53:31 +08:00
Jiang-Jia-Jun
095dada092
Add gemini for code review
2025-11-06 16:42:32 +08:00
Echo-Nie
c18b177f21
fix the get_act_fn,_load_st_projector ( #4824 )
2025-11-06 16:13:35 +08:00
Echo-Nie
e4f1267186
bug: fix list to List ( #4818 )
2025-11-06 16:13:12 +08:00
Ding
6c316286c1
fix: correct typo in nvidia_gpu.md ( #4848 )
2025-11-06 16:03:02 +08:00
Juncai
08ca0f6aea
[Feature] [PD] add simple router and refine splitwise deployment ( #4709 )
...
* add simple router and refine splitwise deployment
* fix
2025-11-06 14:56:02 +08:00
Ayakouji
831266da7a
[Fix] fix ernie4_5_vl model torch format loadding ( #4447 )
...
* fix
* add test
* fix test
* fix test
* update
2025-11-06 14:34:21 +08:00
plusNew001
fc8bef2c95
[XPU][CI]Change ci vl model to 28 b ( #4764 )
...
* Update XPU_VISIBLE_DEVICES and model parameters
* Update base response and adjust max tokens
* Implement process cleanup in CI workflow
Add process cleanup commands to prevent port conflicts
* Remove process cleanup commands from CI workflow
Removed old process cleanup commands to prevent port conflicts.
2025-11-06 14:12:23 +08:00
Echo-Nie
354ddc8bc5
[CI] Add unittest for activation, native_paddle_backend, w4a8, w4afp8, platforms/utils ( #4812 )
...
* add unnitest for activation, native_paddle_backend, w4a8, w4afp8, platforms/utils
* Remove activation function retrieval tests
Removed tests for valid and unsupported activation function retrieval.
* move w4a8, w4afp8 to quantization
* fix code style
2025-11-06 14:08:00 +08:00
SunLei
782818c031
fix: ci port conflict ( #4840 )
CE Compile Job / ce_job_pre_check (push) Has been cancelled
CE Compile Job / print_ce_job_pre_check_outputs (push) Has been cancelled
CE Compile Job / FD-Clone-Linux (push) Has been cancelled
CE Compile Job / Show Code Archive Output (push) Has been cancelled
CE Compile Job / BUILD_SM8090 (push) Has been cancelled
CE Compile Job / BUILD_SM8689 (push) Has been cancelled
CE Compile Job / CE_UPLOAD (push) Has been cancelled
Deploy GitHub Pages / deploy (push) Has been cancelled
2025-11-06 11:56:17 +08:00
kxz2002
5bdd40da5d
[BugFix] Fix ernie_vl_reasoning_parsers.py 'end_token' to 'think_end_token' ( #4805 )
...
* fix ernie_vl_reasoning_parsers.py 'end_token' to 'think_end_token'
* add unit tests
2025-11-06 11:28:55 +08:00
周周周
69fa741763
remove seq_lens_this_time ( #4821 )
2025-11-06 11:06:28 +08:00
K11OntheBoat
62dfad4a5f
[PD Disaggregation] Support Qwen3-MoE use PD + EP inference. ( #4691 )
...
support Qwen-MoE PD/EP
2025-11-06 10:32:15 +08:00
YuBaoku
e8c3e20ee6
[CI] fix docker_build error and add tag-base ( #4810 )
CE Compile Job / ce_job_pre_check (push) Has been cancelled
CE Compile Job / print_ce_job_pre_check_outputs (push) Has been cancelled
CE Compile Job / FD-Clone-Linux (push) Has been cancelled
CE Compile Job / Show Code Archive Output (push) Has been cancelled
CE Compile Job / BUILD_SM8090 (push) Has been cancelled
CE Compile Job / BUILD_SM8689 (push) Has been cancelled
CE Compile Job / CE_UPLOAD (push) Has been cancelled
Deploy GitHub Pages / deploy (push) Has been cancelled
2025-11-05 21:57:54 +08:00
yangjianfengo1
93fcf7e4ec
【New Feature】W4afp8 supports per group quantization ( #4272 )
...
* w4afp8 支持per group
* code style
* 精度完成
* revert append attn utils
* ffn1 动态量化
* ffn2 支持动态量化
* code style
* code style
* 修改单测
* 修改单测
* fix bug
* Implement conditional parameter creation for layers
Add parameter creation for up_gate_proj_in_scale when ep_size > 1.
* code style
* fix conflict
* code style
* code style
* 修复w4aint8 精度
* fix ci
---------
Co-authored-by: yuanxiaolan <yuanxiaolan01@baidu.com >
2025-11-05 21:00:23 +08:00
李泳桦
fcd2f05dff
[BugFix] fix messages being inplace modified in offline chat api ( #4831 )
2025-11-05 20:46:33 +08:00