liyonghua0910
|
e4a6710e4f
|
Merge remote-tracking branch 'upstream/develop' into develop+clear_prefix_cache
|
2025-09-17 21:35:04 +08:00 |
|
YuBaoku
|
2745f37017
|
[CI] enhance clean port and add waiting time (#4152)
CE Compile Job / ce_job_pre_check (push) Has been cancelled
CE Compile Job / print_ce_job_pre_check_outputs (push) Has been cancelled
CE Compile Job / FD-Clone-Linux (push) Has been cancelled
CE Compile Job / Show Code Archive Output (push) Has been cancelled
CE Compile Job / BUILD_SM8090 (push) Has been cancelled
CE Compile Job / BUILD_SM8689 (push) Has been cancelled
CE Compile Job / CE_UPLOAD (push) Has been cancelled
Deploy GitHub Pages / deploy (push) Has been cancelled
Publish Job / publish_pre_check (push) Has been cancelled
Publish Job / print_publish_pre_check_outputs (push) Has been cancelled
Publish Job / FD-Clone-Linux (push) Has been cancelled
Publish Job / Show Code Archive Output (push) Has been cancelled
Publish Job / BUILD_SM8090 (push) Has been cancelled
Publish Job / BUILD_SM8689 (push) Has been cancelled
Publish Job / PADDLE_PYPI_UPLOAD_8090 (push) Has been cancelled
Publish Job / PADDLE_PYPI_UPLOAD_8689 (push) Has been cancelled
Publish Job / Run FastDeploy Unit Tests and Coverage (push) Has been cancelled
Publish Job / Run FastDeploy LogProb Tests (push) Has been cancelled
Publish Job / Extracted partial CE model tasks to run in CI. (push) Has been cancelled
Publish Job / Run Base Tests (push) Has been cancelled
Publish Job / Run Accuracy Tests (push) Has been cancelled
Publish Job / Run Stable Tests (push) Has been cancelled
CI Images Build / FD-Clone-Linux (push) Has been cancelled
CI Images Build / Show Code Archive Output (push) Has been cancelled
CI Images Build / CI Images Build (push) Has been cancelled
CI Images Build / BUILD_SM8090 (push) Has been cancelled
CI Images Build / Run FastDeploy Unit Tests and Coverage (push) Has been cancelled
CI Images Build / Run FastDeploy LogProb Tests (push) Has been cancelled
CI Images Build / Extracted partial CE model tasks to run in CI. (push) Has been cancelled
CI Images Build / Run Base Tests (push) Has been cancelled
CI Images Build / Run Accuracy Tests (push) Has been cancelled
CI Images Build / Run Stable Tests (push) Has been cancelled
CI Images Build / Publish Docker Images Pre Check (push) Has been cancelled
|
2025-09-17 20:31:49 +08:00 |
|
gaoziyuan
|
896e3bb606
|
[NewFeture]add ep rollout model init and update/clear ep buffer (#4039)
* fix gid
* merge
* fix test
* fix bug
* fix
* fix ci
|
2025-09-17 20:24:53 +08:00 |
|
YuanRisheng
|
0d3a57a2c6
|
fix unittest (#4155)
|
2025-09-17 20:20:26 +08:00 |
|
qw86972190
|
b52971749c
|
Print KV Cache available memory and block memory usage in GB format (#4148)
|
2025-09-17 20:01:55 +08:00 |
|
liyonghua0910
|
c9cfd09939
|
[fix] fix code style
|
2025-09-17 19:23:19 +08:00 |
|
liyonghua0910
|
38d09a82c0
|
Merge remote-tracking branch 'upstream/develop' into develop+clear_prefix_cache
|
2025-09-17 19:10:29 +08:00 |
|
liyonghua0910
|
f62c4feeb8
|
[chore] add preemption triggered info log
|
2025-09-17 18:55:59 +08:00 |
|
liyonghua0910
|
33f209086b
|
[fix] fix clear/update lock not working when workers > 1
|
2025-09-17 18:53:58 +08:00 |
|
liyonghua0910
|
cfa0982aae
|
[fix] fix ep group all-reduce
|
2025-09-17 18:05:34 +08:00 |
|
RichardWooSJTU
|
2adca04f1f
|
Reconstruct streaming data transfer with zmq (#3836)
CE Compile Job / ce_job_pre_check (push) Has been cancelled
CE Compile Job / print_ce_job_pre_check_outputs (push) Has been cancelled
CE Compile Job / FD-Clone-Linux (push) Has been cancelled
CE Compile Job / Show Code Archive Output (push) Has been cancelled
CE Compile Job / BUILD_SM8090 (push) Has been cancelled
CE Compile Job / BUILD_SM8689 (push) Has been cancelled
CE Compile Job / CE_UPLOAD (push) Has been cancelled
Deploy GitHub Pages / deploy (push) Has been cancelled
* reconstruct USE_GET_SAVE_OUTPUT_V1
* fix ut
* use dp rank
* fix ci
|
2025-09-17 14:30:39 +08:00 |
|
Jiang-Jia-Jun
|
f9766f917b
|
[BugFix] Forbiden FD_DISABLED_RECOVER while ENABLE_V1_KVCACHE_SCHEDULER (#4142)
Co-authored-by: Jiang-Jia-Jun <jiangjiajun@baidu.com>
|
2025-09-17 14:11:44 +08:00 |
|
liyonghua0910
|
c33e362932
|
[fix] fix key/value_cache_scales indent
|
2025-09-17 11:42:51 +08:00 |
|
YuanRisheng
|
2e9e53ff7e
|
[FDConfig]Remove max_num_batched_tokens/max_num_seqs in parallel config (#4116)
* remove max_num_batched_tokens in parallel config
* remove max_num_seqs
* update test case
* fix test
* fix
---------
Co-authored-by: Jiang-Jia-Jun <163579578+Jiang-Jia-Jun@users.noreply.github.com>
|
2025-09-17 10:43:35 +08:00 |
|
YUNSHEN XIE
|
c01a756912
|
mv test to tests (#4129)
CE Compile Job / ce_job_pre_check (push) Has been cancelled
CE Compile Job / print_ce_job_pre_check_outputs (push) Has been cancelled
CE Compile Job / FD-Clone-Linux (push) Has been cancelled
CE Compile Job / Show Code Archive Output (push) Has been cancelled
CE Compile Job / BUILD_SM8090 (push) Has been cancelled
CE Compile Job / BUILD_SM8689 (push) Has been cancelled
CE Compile Job / CE_UPLOAD (push) Has been cancelled
Deploy GitHub Pages / deploy (push) Has been cancelled
Publish Job / publish_pre_check (push) Has been cancelled
Publish Job / print_publish_pre_check_outputs (push) Has been cancelled
Publish Job / FD-Clone-Linux (push) Has been cancelled
Publish Job / Show Code Archive Output (push) Has been cancelled
Publish Job / BUILD_SM8090 (push) Has been cancelled
Publish Job / BUILD_SM8689 (push) Has been cancelled
Publish Job / PADDLE_PYPI_UPLOAD_8090 (push) Has been cancelled
Publish Job / PADDLE_PYPI_UPLOAD_8689 (push) Has been cancelled
Publish Job / Run FastDeploy Unit Tests and Coverage (push) Has been cancelled
Publish Job / Run FastDeploy LogProb Tests (push) Has been cancelled
Publish Job / Extracted partial CE model tasks to run in CI. (push) Has been cancelled
Publish Job / Run Base Tests (push) Has been cancelled
Publish Job / Run Accuracy Tests (push) Has been cancelled
Publish Job / Run Stable Tests (push) Has been cancelled
CI Images Build / FD-Clone-Linux (push) Has been cancelled
CI Images Build / Show Code Archive Output (push) Has been cancelled
CI Images Build / CI Images Build (push) Has been cancelled
CI Images Build / BUILD_SM8090 (push) Has been cancelled
CI Images Build / Run FastDeploy Unit Tests and Coverage (push) Has been cancelled
CI Images Build / Run FastDeploy LogProb Tests (push) Has been cancelled
CI Images Build / Extracted partial CE model tasks to run in CI. (push) Has been cancelled
CI Images Build / Run Base Tests (push) Has been cancelled
CI Images Build / Run Accuracy Tests (push) Has been cancelled
CI Images Build / Run Stable Tests (push) Has been cancelled
CI Images Build / Publish Docker Images Pre Check (push) Has been cancelled
|
2025-09-16 20:45:40 +08:00 |
|
Zhang Yulong
|
cd09913552
|
Update test_w4a8_model.py (#4125)
|
2025-09-16 20:43:10 +08:00 |
|
chenjian
|
67e6d8c691
|
[Feature] Set prefix caching as default (#3814)
* Set prefix caching as default
* Set prefix caching as default
* Set prefix caching as default
* skip dynamic load scene
* fix kill bug
* fix kill bug
* fix kill bug
* fix
* fix
* fix ci
|
2025-09-16 20:34:27 +08:00 |
|
Yuan Xiaolan
|
de8638b1e9
|
fix dynamic Cfp8 computing error (#4119)
Co-authored-by: YuBaoku <49938469+EmmonsCurse@users.noreply.github.com>
|
2025-09-16 20:21:49 +08:00 |
|
YUNSHEN XIE
|
4f8901489c
|
ci: Increase compilation task time limit (#4098)
* ci: Increase compilation task time limit
* update
* update
* rename
* update
* update
|
2025-09-16 20:05:45 +08:00 |
|
tianlef
|
e79a1a7938
|
x1_a3b config (#4135)
|
2025-09-16 19:44:46 +08:00 |
|
xiegegege
|
d682c97dd3
|
[benchmark]add lite-vl and x1 yaml (#4130)
|
2025-09-16 16:38:36 +08:00 |
|
Divano
|
8e49d99009
|
Addcase (#4112)
logprob 没跑,不影响,增加校验openai 异常情况下 错误输出格式字段的case
|
2025-09-16 16:12:14 +08:00 |
|
tianlef
|
83bf1fd5aa
|
[Doc]add plas attention config (#4128)
|
2025-09-16 15:55:12 +08:00 |
|
co63oc
|
b70ca35c0b
|
【Hackathon 9th No.52】add test_dynamic_per_token_scaled_fp8_quant (#4015)
CE Compile Job / ce_job_pre_check (push) Has been cancelled
CE Compile Job / print_ce_job_pre_check_outputs (push) Has been cancelled
CE Compile Job / FD-Clone-Linux (push) Has been cancelled
CE Compile Job / Show Code Archive Output (push) Has been cancelled
CE Compile Job / BUILD_SM8090 (push) Has been cancelled
CE Compile Job / BUILD_SM8689 (push) Has been cancelled
CE Compile Job / CE_UPLOAD (push) Has been cancelled
Deploy GitHub Pages / deploy (push) Has been cancelled
* add test_dynamic_per_token_scaled_fp8_quant
* fix
* add bfloat16
* ci
|
2025-09-16 14:11:29 +08:00 |
|
Echo-Nie
|
befe463f01
|
【Hackathon 9th No.37】add test_top_k_renorm_probs (#3755)
* add test_top_k_renorm_probs.py
* add size=2,3
|
2025-09-16 11:12:46 +08:00 |
|
Sunny-bot1
|
442543cd6b
|
fix ep wint8 (#4102)
|
2025-09-16 11:05:33 +08:00 |
|
Yuanle Liu
|
ed2dcec829
|
add ignore=all for deepgemm (#4118)
CE Compile Job / ce_job_pre_check (push) Has been cancelled
CE Compile Job / print_ce_job_pre_check_outputs (push) Has been cancelled
CE Compile Job / FD-Clone-Linux (push) Has been cancelled
CE Compile Job / Show Code Archive Output (push) Has been cancelled
CE Compile Job / BUILD_SM8090 (push) Has been cancelled
CE Compile Job / BUILD_SM8689 (push) Has been cancelled
CE Compile Job / CE_UPLOAD (push) Has been cancelled
Deploy GitHub Pages / deploy (push) Has been cancelled
Publish Job / publish_pre_check (push) Has been cancelled
Publish Job / print_publish_pre_check_outputs (push) Has been cancelled
Publish Job / FD-Clone-Linux (push) Has been cancelled
Publish Job / Show Code Archive Output (push) Has been cancelled
Publish Job / BUILD_SM8090 (push) Has been cancelled
Publish Job / BUILD_SM8689 (push) Has been cancelled
Publish Job / PADDLE_PYPI_UPLOAD_8090 (push) Has been cancelled
Publish Job / PADDLE_PYPI_UPLOAD_8689 (push) Has been cancelled
Publish Job / Run FastDeploy Unit Tests and Coverage (push) Has been cancelled
Publish Job / Run FastDeploy LogProb Tests (push) Has been cancelled
Publish Job / Extracted partial CE model tasks to run in CI. (push) Has been cancelled
Publish Job / Run Base Tests (push) Has been cancelled
Publish Job / Run Accuracy Tests (push) Has been cancelled
Publish Job / Run Stable Tests (push) Has been cancelled
CI Images Build / FD-Clone-Linux (push) Has been cancelled
CI Images Build / Show Code Archive Output (push) Has been cancelled
CI Images Build / CI Images Build (push) Has been cancelled
CI Images Build / BUILD_SM8090 (push) Has been cancelled
CI Images Build / Run FastDeploy Unit Tests and Coverage (push) Has been cancelled
CI Images Build / Run FastDeploy LogProb Tests (push) Has been cancelled
CI Images Build / Extracted partial CE model tasks to run in CI. (push) Has been cancelled
CI Images Build / Run Base Tests (push) Has been cancelled
CI Images Build / Run Accuracy Tests (push) Has been cancelled
CI Images Build / Run Stable Tests (push) Has been cancelled
CI Images Build / Publish Docker Images Pre Check (push) Has been cancelled
|
2025-09-15 21:52:00 +08:00 |
|
Jiang-Jia-Jun
|
a04365a0c7
|
Update api_server.py
|
2025-09-15 21:31:33 +08:00 |
|
YuanRisheng
|
03b3d6175d
|
fix mtp (#4105)
|
2025-09-15 20:26:07 +08:00 |
|
co63oc
|
17a27170bc
|
fix typos (#4093)
|
2025-09-15 18:33:30 +08:00 |
|
bukejiyu
|
113e330030
|
fix bf16 and add comments (#4106)
|
2025-09-15 17:23:07 +08:00 |
|
freeliuzc
|
69aa2781a1
|
[MTP]Support mtp reshard (#4099)
* support rl reshard
* modify model name
|
2025-09-15 17:13:53 +08:00 |
|
freeliuzc
|
46911f903d
|
[MTP]update hybrid-mtp-with-ngram (#4047)
|
2025-09-15 17:13:31 +08:00 |
|
Yuanle Liu
|
b1b33211e8
|
[CUDAGraph] Support multi output buffers and merge some fixes from feature/exp_0908 (#4062)
* refine cudagraph
* refine cudagraph
* typo
* fix
* fix plugins
* fix
* update
* update
* update
|
2025-09-15 16:21:30 +08:00 |
|
zhupengyang
|
9409665713
|
[xpu] support ep (#4067)
CE Compile Job / ce_job_pre_check (push) Has been cancelled
CE Compile Job / print_ce_job_pre_check_outputs (push) Has been cancelled
CE Compile Job / FD-Clone-Linux (push) Has been cancelled
CE Compile Job / Show Code Archive Output (push) Has been cancelled
CE Compile Job / BUILD_SM8090 (push) Has been cancelled
CE Compile Job / BUILD_SM8689 (push) Has been cancelled
CE Compile Job / CE_UPLOAD (push) Has been cancelled
Deploy GitHub Pages / deploy (push) Has been cancelled
|
2025-09-15 13:53:11 +08:00 |
|
bukejiyu
|
29ed617f0f
|
[v1 loader]qwen Offline fp8 (#4036)
* support offline fp8
* update ut
* update ut
* update ut
* fix
* update
* update
|
2025-09-15 13:44:11 +08:00 |
|
Sunny-bot1
|
b1a5b756a3
|
[Optimize] Support WINT8 and group scale for Machete (#3905)
|
2025-09-15 12:01:34 +08:00 |
|
Echo-Nie
|
4408dc7f67
|
【Hackathon 9th No.49】add test_pre_cache_len_concat (#3847)
* add test_pre_cache_len_concat
* fix according review, add ref_pre_cache_len_concat
|
2025-09-15 11:20:14 +08:00 |
|
co63oc
|
ef4a1aa2da
|
【Hackathon 9th No.61、65】add test_draft_model_update (#3940)
* add draft_model_update test
* fix
* fix
* fix
* fix
* fix
|
2025-09-15 11:19:50 +08:00 |
|
Zero Rains
|
f213ae1e86
|
[Bug Fix]fix the bug for cache_messager signal loss (#3879)
* fix the bug for real size 0 in cudagraph
* fix cache_messager
|
2025-09-15 11:16:24 +08:00 |
|
李泳桦
|
1411415816
|
Merge branch 'develop' into develop+clear_prefix_cache
|
2025-09-15 10:54:24 +08:00 |
|
qwes5s5
|
553adb299e
|
【FastDeploy CLI】collect-env subcommand (#4044)
* collect-env subcommand
* trigger ci
---------
Co-authored-by: K11OntheBoat <your_email@example.com>
|
2025-09-15 10:31:23 +08:00 |
|
zhouchong
|
958abebeab
|
Support offline inference with streaming output (#4071)
* Support offline inference with streaming output
* add unit test
---------
Co-authored-by: Jiang-Jia-Jun <163579578+Jiang-Jia-Jun@users.noreply.github.com>
|
2025-09-15 10:27:03 +08:00 |
|
liyonghua0910
|
94a55fc158
|
[fix] fix prefix caching not enabled
|
2025-09-12 21:43:37 +08:00 |
|
liyonghua0910
|
c7b8f4f8c6
|
[fix] fix ipc suffix, use port instead
|
2025-09-12 21:41:59 +08:00 |
|
liyonghua0910
|
013338f2f5
|
[feat] support clearing prefix cache (cherry-picked from release/2.1)
|
2025-09-12 21:41:55 +08:00 |
|
YUNSHEN XIE
|
4871f18dad
|
fix(CE): update concurrency to stop CE tasks from canceling each other (#4083)
CE Compile Job / ce_job_pre_check (push) Has been cancelled
Deploy GitHub Pages / deploy (push) Has been cancelled
CE Compile Job / print_ce_job_pre_check_outputs (push) Has been cancelled
CE Compile Job / FD-Clone-Linux (push) Has been cancelled
CE Compile Job / Show Code Archive Output (push) Has been cancelled
CE Compile Job / BUILD_SM8090 (push) Has been cancelled
CE Compile Job / BUILD_SM8689 (push) Has been cancelled
CE Compile Job / CE_UPLOAD (push) Has been cancelled
Publish Job / publish_pre_check (push) Has been cancelled
Publish Job / print_publish_pre_check_outputs (push) Has been cancelled
Publish Job / FD-Clone-Linux (push) Has been cancelled
Publish Job / Show Code Archive Output (push) Has been cancelled
Publish Job / BUILD_SM8090 (push) Has been cancelled
Publish Job / BUILD_SM8689 (push) Has been cancelled
Publish Job / PADDLE_PYPI_UPLOAD_8090 (push) Has been cancelled
Publish Job / PADDLE_PYPI_UPLOAD_8689 (push) Has been cancelled
Publish Job / Run FastDeploy Unit Tests and Coverage (push) Has been cancelled
Publish Job / Run FastDeploy LogProb Tests (push) Has been cancelled
Publish Job / Extracted partial CE model tasks to run in CI. (push) Has been cancelled
Publish Job / Run Base Tests (push) Has been cancelled
Publish Job / Run Accuracy Tests (push) Has been cancelled
Publish Job / Run Stable Tests (push) Has been cancelled
CI Images Build / FD-Clone-Linux (push) Has been cancelled
CI Images Build / Show Code Archive Output (push) Has been cancelled
CI Images Build / CI Images Build (push) Has been cancelled
CI Images Build / BUILD_SM8090 (push) Has been cancelled
CI Images Build / Run FastDeploy Unit Tests and Coverage (push) Has been cancelled
CI Images Build / Run FastDeploy LogProb Tests (push) Has been cancelled
CI Images Build / Extracted partial CE model tasks to run in CI. (push) Has been cancelled
CI Images Build / Run Base Tests (push) Has been cancelled
CI Images Build / Run Accuracy Tests (push) Has been cancelled
CI Images Build / Run Stable Tests (push) Has been cancelled
CI Images Build / Publish Docker Images Pre Check (push) Has been cancelled
|
2025-09-12 19:16:26 +08:00 |
|
Ayakouji
|
987609c894
|
[BugFix] Fix image_feature 0-Size causing insert failed (#4042)
* update
* fix image_feature
|
2025-09-12 19:13:08 +08:00 |
|
xiaolei373
|
9ac539471d
|
[format] Valid para format error info (#4035)
* feat(log):add_request_and_response_log
* 报错信息与OpenAI对齐
|
2025-09-12 19:05:17 +08:00 |
|
YuanRisheng
|
88ea565aba
|
[BugFix]Fix load kv cache quant scale (#4077)
* fix kv cache
* fix kv_cache
* fix kv cache
|
2025-09-12 17:44:03 +08:00 |
|