Commit Graph

4209 Commits

Author SHA1 Message Date
lizan1999
ec6811f648 support token num = 0 (#5635)
Co-authored-by: lizan1999 <lizan03@baidu.com>
Co-authored-by: cmcamdy <1027740945@qq.com>
Co-authored-by: Jiaxin Sui <95567040+plusNew001@users.noreply.github.com>
2025-12-19 10:20:38 +08:00
xunyoyo
d657455616 [CI] 【Hackathon 9th Sprint No.19】NO.19 功能模块单测补充 (#5063)
Some checks failed
CE Compile Job / ce_job_pre_check (push) Has been cancelled
CE Compile Job / print_ce_job_pre_check_outputs (push) Has been cancelled
CE Compile Job / FD-Clone-Linux (push) Has been cancelled
CE Compile Job / Show Code Archive Output (push) Has been cancelled
CE Compile Job / BUILD_SM8090 (push) Has been cancelled
CE Compile Job / BUILD_SM8689 (push) Has been cancelled
CE Compile Job / CE_UPLOAD (push) Has been cancelled
Deploy GitHub Pages / deploy (push) Has been cancelled
Publish Job / publish_pre_check (push) Has been cancelled
Publish Job / print_publish_pre_check_outputs (push) Has been cancelled
Publish Job / FD-Clone-Linux (push) Has been cancelled
Publish Job / Show Code Archive Output (push) Has been cancelled
Publish Job / BUILD_SM8090 (push) Has been cancelled
Publish Job / BUILD_SM8689 (push) Has been cancelled
Publish Job / PADDLE_PYPI_UPLOAD_8090 (push) Has been cancelled
Publish Job / PADDLE_PYPI_UPLOAD_8689 (push) Has been cancelled
Publish Job / Run FD Image Build (push) Has been cancelled
Publish Job / Run FastDeploy Unit Tests and Coverage (push) Has been cancelled
Publish Job / Run FastDeploy LogProb Tests (push) Has been cancelled
Publish Job / Extracted partial CE model tasks to run in CI. (push) Has been cancelled
Publish Job / Run Base Tests (push) Has been cancelled
Publish Job / Run Accuracy Tests (push) Has been cancelled
Publish Job / Run Stable Tests (push) Has been cancelled
CI Images Build / FD-Clone-Linux (push) Has been cancelled
CI Images Build / Show Code Archive Output (push) Has been cancelled
CI Images Build / CI Images Build (push) Has been cancelled
CI Images Build / BUILD_SM8090 (push) Has been cancelled
CI Images Build / Run FastDeploy Unit Tests and Coverage (push) Has been cancelled
CI Images Build / Run FastDeploy LogProb Tests (push) Has been cancelled
CI Images Build / Extracted partial CE model tasks to run in CI. (push) Has been cancelled
CI Images Build / Run Base Tests (push) Has been cancelled
CI Images Build / Publish Docker Images Pre Check (push) Has been cancelled
* Add pooler unit tests

* Refine pooler tests import handling

* Refactor pooler tests to use real modules

* Clean up test_pooler.py by removing docstring

Removed unnecessary docstring and cleaned up code.

* Clean up imports in test_pooler.py

Removed unnecessary import of sys and related path adjustments.

* Update model config and clean up test code

* Update test_pooler.py

---------

Co-authored-by: CSWYF3634076 <wangyafeng@baidu.com>
2025-12-18 21:32:44 +08:00
Yuanle Liu
b47674c796 [BugFix] fix rl model_weights_signal to support tp>1 (#5639) 2025-12-18 04:43:58 -08:00
Jiaxin Sui
d739af5e6e Revert "[XPU][CI] xpu add ci test for pd (#5610)" (#5645)
This reverts commit 80fb530ce2.
2025-12-18 19:59:09 +08:00
bukejiyu
4aa2c6871b [RL]Support loading weights via the load_weights function for RL (#5549)
* RL support load_weights

* fix
2025-12-18 02:27:05 -08:00
yzwu
ac013803f3 [Iluvatar] Support V1_KVCACHE_SCHEDULER and paddleocr-vl rope mode (#5555) 2025-12-18 02:14:25 -08:00
Zhang Yulong
48f3e9797e Update backend_request_func.py (#5633)
Some checks failed
CE Compile Job / ce_job_pre_check (push) Has been cancelled
CE Compile Job / print_ce_job_pre_check_outputs (push) Has been cancelled
CE Compile Job / FD-Clone-Linux (push) Has been cancelled
CE Compile Job / Show Code Archive Output (push) Has been cancelled
CE Compile Job / BUILD_SM8090 (push) Has been cancelled
CE Compile Job / BUILD_SM8689 (push) Has been cancelled
CE Compile Job / CE_UPLOAD (push) Has been cancelled
Deploy GitHub Pages / deploy (push) Has been cancelled
2025-12-18 16:21:34 +08:00
xunyoyo
2d2619d300 [CI] 【Hackathon 9th Sprint No.36】NO.36 功能模块单测补充 (修复) (#5609)
* Implement unit tests for PrefixCacheManager

* Update prefix cache manager tests

* Handle get_all_visible_devices in prefix cache manager tests

* Add repo root to prefix cache manager tests sys.path

* Use pathlib for repo root in prefix cache manager tests

* Refine repo root Path import in tests

* Handle list-based visible device configuration

* Refine PrefixCacheManager test stubs

* Run pre-commit on prefix cache manager tests

* Remove duplicate pytest import in cache manager tests

* Add tests for visible device formatting

* Revert

* Simplify test stubs in prefix cache manager tests

* Refine PrefixCacheManager tests

* Adjust prefix cache manager tests per review

* Remove ignored tests from coverage configuration

* Make prefix cache manager tests runnable without paddle

* Use real paddle import in prefix cache manager tests

* Clean up imports in test_prefix_cache_manager.py

Removed unnecessary import of 'os' and related path manipulation.

* Update test_prefix_cache_manager.py

* Replace pid_suffix with ipc_suffix in tests

* Add local cache queue and RDMA ports to cache config
2025-12-18 16:08:42 +08:00
lizan1999
e1a9b282eb fix bug for EP+MTP (#5605)
Co-authored-by: lizan1999 <lizan03@baidu.com>
2025-12-18 14:34:54 +08:00
Longzhi Wang
d8587e987e [Model] tp+ep support v1_loader (#5465)
* [Model] tp+ep support v1_loader

* fix

* fix mtp_linear

* fix mtp_linear

* fix

* fix

* fix v0 loader

* fix

* Add get_tensor for ep

* fix linear weight_loader

* fix typo

* fix
2025-12-18 14:31:54 +08:00
Zhang Yulong
c89a62e550 Update backend_request_func.py (#5631) 2025-12-18 14:20:17 +08:00
zhupengyang
8735cb5045 [XPU] refactor moe ffn (#5501)
- remove BKCL_DISPATCH_ALL_GATHER
- support sparse mode
- support moe quant_method
2025-12-18 14:14:05 +08:00
MingkunZhang
d0a7834a17 [Metax] fix metax runner issue (#5629) 2025-12-17 21:32:54 -08:00
qw86972190
c606df59f5 [XPU]logprob bug (#5626) 2025-12-18 12:07:20 +08:00
kesmeey
d81341b9b3 [CI]【Hackathon 9th Sprint No.14】功能模块 fastdeploy/rl/rollout_model.py 单测补充 (#5552)
* Add rollout model unit tests

* test: update rl rollout_model tests

* test: fix cache_type_branches unsupported platform case

* test: fix rl rollout_model test indent

* Delete tests/spec_decode/test_mtp_proposer.py

* chore: format test_rollout_model

* chore: translate rollout test comments to English

* test: guard rollout_model import by disabling auto registry

* chore: reorder imports in rl rollout test

* test: isolate env for RL rollout tests

* style: format rollout RL tests with black

* update

* test: remove RL rollout unit tests causing collection issues

* test: add lightweight rollout_model RL unit tests

* fix(coverage): filter test file paths and handle collection failures

- Only extract real test file paths (tests/.../test_*.py) from pytest collect output

- Filter out ERROR/collecting prefixes to prevent garbage in failed_tests.log

- Add proper error handling for pytest collection failures

- Exit early if no test files can be extracted

- Preserve collection error output for debugging

* update

* style: fix code style issues in test_rollout_model.py

- Remove unused 'os' import

- Remove trailing blank lines

---------

Co-authored-by: YuBaoku <49938469+EmmonsCurse@users.noreply.github.com>
2025-12-18 10:57:53 +08:00
Zhang Yulong
f45c131ddf update (#5625)
Some checks failed
CE Compile Job / ce_job_pre_check (push) Has been cancelled
CE Compile Job / print_ce_job_pre_check_outputs (push) Has been cancelled
CE Compile Job / FD-Clone-Linux (push) Has been cancelled
CE Compile Job / Show Code Archive Output (push) Has been cancelled
CE Compile Job / BUILD_SM8090 (push) Has been cancelled
CE Compile Job / BUILD_SM8689 (push) Has been cancelled
CE Compile Job / CE_UPLOAD (push) Has been cancelled
Deploy GitHub Pages / deploy (push) Has been cancelled
Publish Job / publish_pre_check (push) Has been cancelled
Publish Job / print_publish_pre_check_outputs (push) Has been cancelled
Publish Job / FD-Clone-Linux (push) Has been cancelled
Publish Job / Show Code Archive Output (push) Has been cancelled
Publish Job / BUILD_SM8090 (push) Has been cancelled
Publish Job / BUILD_SM8689 (push) Has been cancelled
Publish Job / PADDLE_PYPI_UPLOAD_8090 (push) Has been cancelled
Publish Job / PADDLE_PYPI_UPLOAD_8689 (push) Has been cancelled
Publish Job / Run FD Image Build (push) Has been cancelled
Publish Job / Run FastDeploy Unit Tests and Coverage (push) Has been cancelled
Publish Job / Run FastDeploy LogProb Tests (push) Has been cancelled
Publish Job / Extracted partial CE model tasks to run in CI. (push) Has been cancelled
Publish Job / Run Base Tests (push) Has been cancelled
Publish Job / Run Accuracy Tests (push) Has been cancelled
Publish Job / Run Stable Tests (push) Has been cancelled
CI Images Build / FD-Clone-Linux (push) Has been cancelled
CI Images Build / Show Code Archive Output (push) Has been cancelled
CI Images Build / CI Images Build (push) Has been cancelled
CI Images Build / BUILD_SM8090 (push) Has been cancelled
CI Images Build / Run FastDeploy Unit Tests and Coverage (push) Has been cancelled
CI Images Build / Run FastDeploy LogProb Tests (push) Has been cancelled
CI Images Build / Extracted partial CE model tasks to run in CI. (push) Has been cancelled
CI Images Build / Run Base Tests (push) Has been cancelled
CI Images Build / Publish Docker Images Pre Check (push) Has been cancelled
2025-12-17 21:38:14 +08:00
Zhang Yulong
94be5ebdd1 [CI] Add CI case for MTP accept ratio (#5570)
* Implement test for MTP accept ratio

Add test for MTP accept ratio with assertions on results and metrics.

* Update test_ernie_21b_mtp.py

* Refactor test_mtp_accept_ratio for baseline comparison

Refactor test_mtp_accept_ratio to compare results against baseline file and metrics.

* Fix formatting issues in test_ernie_21b_mtp.py

---------

Co-authored-by: YuBaoku <49938469+EmmonsCurse@users.noreply.github.com>
2025-12-17 21:35:02 +08:00
kesmeey
ac731653b3 [CI]【Hackathon 9th Sprint No.12】功能模块 fastdeploy/spec_decode/mtp.py 单测补充 (#5533)
* Add unit tests for MTPProposer class in spec_decode/mtp.py

* fix: remove non-existent QuantizationConfig import in test_mtp_proposer

* fix: add logprobs_mode attribute to FakeModelConfig

* fix: fix test failures in test_mtp_proposer - fix Mock setup, remove arrival_time, add missing keys

* fix: add seq_lens_this_time initialization and kv_cache init before insert_tasks_v1

* fix: check pos_emb_type attribute existence before assertion

* test: add minimal coverage for mtp cache type, mm init, preempted

* test: fix cache_type_branches unsupported platform on 12

* test: refine MTPProposer tests for cache type, requests and chunked prefill

* chore: remove stray spec_decode copy
2025-12-17 20:09:45 +08:00
megemini
111955ec0c [BugFix] 移除重复的 PaddleOCRVLProcessor 初始化代码 2025-12-17 18:58:02 +08:00
fmiao2372
404cf0ece4 [Intel HPU] enable tensor_wise_fp8 (#5324)
* [Intel HPU] enable tensor_wise_fp8

* update code based on comments

* fix code style issue

* fix bug about RP 5138

* mv kv_cache modifications to HPU backend

* fix FP8 Precision Issues

* fix FP8 Precision Issues

* Add quantization UT

---------

Co-authored-by: yanfeich <yanfei.cheng@intel.com>
Co-authored-by: YuBaoku <49938469+EmmonsCurse@users.noreply.github.com>
2025-12-17 16:45:03 +08:00
freeliuzc
15f5112ecb [Speculative Decoding]Support different inferseed in speculate decoding (#5568)
* fix mtp entropy drop in RL

* optimize usage and fix unit test

* optimize padding_sampling_params speed(vectorized)

---------

Co-authored-by: YuBaoku <49938469+EmmonsCurse@users.noreply.github.com>
2025-12-17 16:14:29 +08:00
ddchenhao66
80fb530ce2 [XPU][CI] xpu add ci test for pd (#5610)
Some checks failed
CE Compile Job / ce_job_pre_check (push) Has been cancelled
CE Compile Job / print_ce_job_pre_check_outputs (push) Has been cancelled
CE Compile Job / FD-Clone-Linux (push) Has been cancelled
CE Compile Job / Show Code Archive Output (push) Has been cancelled
CE Compile Job / BUILD_SM8090 (push) Has been cancelled
CE Compile Job / BUILD_SM8689 (push) Has been cancelled
CE Compile Job / CE_UPLOAD (push) Has been cancelled
Deploy GitHub Pages / deploy (push) Has been cancelled
Co-authored-by: ddchenhao66 <dhaochen163.com>
2025-12-17 16:07:44 +08:00
Yonghua Li
0c8c6369ed [Feature] [PD Disaggregation] simplify configuration for pd-disaggregated deployment, and refactor post-init and usage for all ports (#5415)
* [feat] simplify configuration for pd-disaggregated deployment, and refactor post-init and usage for all ports

* [fix] fix some bugs

* [fix] fix rdma port for cache manager/messager

* [fix] temporarily cancel port availability check to see if it can pass ci test

* [feat] simplify args for multi api server

* [fix] fix dp

* [fix] fix port for xpu

* [fix] add tests for ports post processing & fix ci

* [test] fix test_multi_api_server

* [fix] fix rdma_comm_ports args for multi_api_server

* [fix] fix test_common_engine

* [fix] fix test_cache_transfer_manager

* [chore] automatically setting FD_ENABLE_MULTI_API_SERVER

* [fix] avoid api server from creating engine_args twice

* [fix] fix test_run_batch

* [fix] fix test_metrics

* [fix] fix splitwise connector init

* [test] add test_rdma_transfer and test_expert_service

* [fix] fix code syntax

* [fix] fix test_rdma_transfer and build wheel with rdma script
2025-12-17 15:50:42 +08:00
Yuanle Liu
cdc0004894 Revert "[Feature] add ue8m0 for per_token_quant_fp8 (#5563)" (#5611)
This reverts commit 73e1d6aa90.
2025-12-17 13:59:06 +08:00
YuBaoku
21fa2baa51 [CI] disable test_prefix_cache_manager.py in unit_test 2025-12-17 10:48:02 +08:00
周周周
e29b005520 [Others] Clean code && remove GPU sync code (#5548)
Some checks failed
CE Compile Job / ce_job_pre_check (push) Has been cancelled
CE Compile Job / print_ce_job_pre_check_outputs (push) Has been cancelled
CE Compile Job / FD-Clone-Linux (push) Has been cancelled
CE Compile Job / Show Code Archive Output (push) Has been cancelled
CE Compile Job / BUILD_SM8090 (push) Has been cancelled
CE Compile Job / BUILD_SM8689 (push) Has been cancelled
CE Compile Job / CE_UPLOAD (push) Has been cancelled
Deploy GitHub Pages / deploy (push) Has been cancelled
Publish Job / publish_pre_check (push) Has been cancelled
Publish Job / print_publish_pre_check_outputs (push) Has been cancelled
Publish Job / FD-Clone-Linux (push) Has been cancelled
Publish Job / Show Code Archive Output (push) Has been cancelled
Publish Job / BUILD_SM8090 (push) Has been cancelled
Publish Job / BUILD_SM8689 (push) Has been cancelled
Publish Job / PADDLE_PYPI_UPLOAD_8090 (push) Has been cancelled
Publish Job / PADDLE_PYPI_UPLOAD_8689 (push) Has been cancelled
Publish Job / Run FD Image Build (push) Has been cancelled
Publish Job / Run FastDeploy Unit Tests and Coverage (push) Has been cancelled
Publish Job / Run FastDeploy LogProb Tests (push) Has been cancelled
Publish Job / Extracted partial CE model tasks to run in CI. (push) Has been cancelled
Publish Job / Run Base Tests (push) Has been cancelled
Publish Job / Run Accuracy Tests (push) Has been cancelled
Publish Job / Run Stable Tests (push) Has been cancelled
CI Images Build / FD-Clone-Linux (push) Has been cancelled
CI Images Build / Show Code Archive Output (push) Has been cancelled
CI Images Build / CI Images Build (push) Has been cancelled
CI Images Build / BUILD_SM8090 (push) Has been cancelled
CI Images Build / Run FastDeploy Unit Tests and Coverage (push) Has been cancelled
CI Images Build / Run FastDeploy LogProb Tests (push) Has been cancelled
CI Images Build / Extracted partial CE model tasks to run in CI. (push) Has been cancelled
CI Images Build / Run Base Tests (push) Has been cancelled
CI Images Build / Publish Docker Images Pre Check (push) Has been cancelled
2025-12-16 21:09:37 +08:00
Yuanle Liu
867803ae10 [BugFix] fix speculate_limit_thinking_content_length (#5590)
* fix speculate_limit_thinking_content_length

* update
2025-12-16 04:31:45 -08:00
kevin
7140939c51 [BugFix] fix video bug (#5557)
* fix video bug

* add eb5 moe model
2025-12-16 20:06:50 +08:00
chen
27ef3610b5 support glm fa3 (#5586) 2025-12-16 19:33:27 +08:00
Jiang-Jia-Jun
2ad3bff4ff [Optim] Optimize costtime in checking tasks in engine-worker-queue (#5580)
* [Optim] Optimize costtime in checking tasks in engine-worker-queue

* Update fastdeploy/engine/common_engine.py

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

---------

Co-authored-by: Jiang-Jia-Jun <jiangjiajun@baidu.com>
Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>
2025-12-16 19:27:31 +08:00
xunyoyo
55609a51fc [CI] 【Hackathon 9th Sprint No.36】NO.36 功能模块单测补充 (#5058)
* Implement unit tests for PrefixCacheManager

* Update prefix cache manager tests

* Handle get_all_visible_devices in prefix cache manager tests

* Add repo root to prefix cache manager tests sys.path

* Use pathlib for repo root in prefix cache manager tests

* Refine repo root Path import in tests

* Handle list-based visible device configuration

* Refine PrefixCacheManager test stubs

* Run pre-commit on prefix cache manager tests

* Remove duplicate pytest import in cache manager tests

* Add tests for visible device formatting

* Revert

* Simplify test stubs in prefix cache manager tests

* Refine PrefixCacheManager tests

* Adjust prefix cache manager tests per review
2025-12-16 19:19:03 +08:00
fxyfxy777
73e1d6aa90 [Feature] add ue8m0 for per_token_quant_fp8 (#5563)
* ue8m0

* add default arg

---------

Co-authored-by: YuBaoku <49938469+EmmonsCurse@users.noreply.github.com>
2025-12-16 18:40:12 +08:00
Yonghua Li
eeb99d2af5 [BugFix] skip model executing after clearing/updating is done (#5527)
* [fix] fix ep loop

* [fix] another try

* [fix] again
2025-12-16 17:39:03 +08:00
RAM
6fc5eccf83 [RL] R3 Support RDMA Store (#5467)
* [RL] R3 support rdma store

* refine notes

* refine code

* disable prefix cache

* support preempted task and put cpu tensor
2025-12-16 16:50:13 +08:00
xiaolei373
a30b4da260 [Feature] Tracing: Fine-Grained Tracing for Request Latency Part1 (#5458) 2025-12-16 16:36:09 +08:00
kevin
c9b47f90ce [BugFix] fix cpu prefix cache bug (#5544)
Some checks failed
CE Compile Job / ce_job_pre_check (push) Has been cancelled
CE Compile Job / print_ce_job_pre_check_outputs (push) Has been cancelled
CE Compile Job / FD-Clone-Linux (push) Has been cancelled
CE Compile Job / Show Code Archive Output (push) Has been cancelled
CE Compile Job / BUILD_SM8090 (push) Has been cancelled
CE Compile Job / BUILD_SM8689 (push) Has been cancelled
CE Compile Job / CE_UPLOAD (push) Has been cancelled
Deploy GitHub Pages / deploy (push) Has been cancelled
* fix_dy_c8_bug

* add block_num check

* fix test case

* update ci case
2025-12-16 14:21:42 +08:00
YuBaoku
5d2b16e6f3 [CI] Remove test_metrics.py due to incompatible forced merge (#5578)
* [CI] Remove test_metrics.py due to incompatible forced merge
2025-12-16 14:04:46 +08:00
Jiang-Jia-Jun
021399f7c9 Revert "[Feature] Use paddle.compat.enable_torch_proxy in `fastdeploy/__ini…" (#5579)
This reverts commit ff45ac078e.
2025-12-16 13:55:27 +08:00
Echo-Nie
50100f98d7 [Feature] Support fusedmoe on Blackwell (#5325)
* update sm100

* fix

* fix style
2025-12-16 11:58:50 +08:00
YuBaoku
63fff8df70 [CI] Adapt vl_model baseline changes due to Paddle update (#5576) 2025-12-16 11:42:31 +08:00
gaoziyuan
5db08cc1d5 【NewFeature】support load fp8 weight (#5565) 2025-12-16 11:23:57 +08:00
Jiang-Jia-Jun
8b6395478a Revert "[BugFix] reschedule_preempt_task append waiting & PREEMPTED blocksize…" (#5575)
This reverts commit dbedb0797b.
2025-12-16 11:12:57 +08:00
Jiang-Jia-Jun
9058cc712d Update gpu_model_runner.py 2025-12-16 11:12:07 +08:00
Jiang-Jia-Jun
075bd71272 Remove GPUMemoryChecker initialization
Removed memory checker initialization from GPU model runner.
2025-12-16 11:09:27 +08:00
Jundong Liu
ff45ac078e [Feature] Use paddle.compat.enable_torch_proxy in fastdeploy/__init__.py (#5211)
* test feature

* fix xgrammar

* fix paddleformer

* try whitelist

* manual patch PaddlePaddle/Paddle#76706 for test

* remove triton version

* add comment

* Update scripts/run_ci_xpu.sh

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

* remove use-triton-in-paddle in requirement.txt

---------

Co-authored-by: Nyakku Shigure <sigure.qaq@gmail.com>
Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>
2025-12-16 11:05:30 +08:00
xunyoyo
9e8c46c526 [CI] 【Hackathon 9th Sprint No.34】NO.34 功能模块单测补充 (#5057)
Some checks failed
CE Compile Job / ce_job_pre_check (push) Has been cancelled
CE Compile Job / print_ce_job_pre_check_outputs (push) Has been cancelled
CE Compile Job / FD-Clone-Linux (push) Has been cancelled
CE Compile Job / Show Code Archive Output (push) Has been cancelled
CE Compile Job / BUILD_SM8090 (push) Has been cancelled
CE Compile Job / BUILD_SM8689 (push) Has been cancelled
CE Compile Job / CE_UPLOAD (push) Has been cancelled
Deploy GitHub Pages / deploy (push) Has been cancelled
Publish Job / publish_pre_check (push) Has been cancelled
Publish Job / print_publish_pre_check_outputs (push) Has been cancelled
Publish Job / FD-Clone-Linux (push) Has been cancelled
Publish Job / Show Code Archive Output (push) Has been cancelled
Publish Job / BUILD_SM8090 (push) Has been cancelled
Publish Job / BUILD_SM8689 (push) Has been cancelled
Publish Job / PADDLE_PYPI_UPLOAD_8090 (push) Has been cancelled
Publish Job / PADDLE_PYPI_UPLOAD_8689 (push) Has been cancelled
Publish Job / Run FD Image Build (push) Has been cancelled
Publish Job / Run FastDeploy Unit Tests and Coverage (push) Has been cancelled
Publish Job / Run FastDeploy LogProb Tests (push) Has been cancelled
Publish Job / Extracted partial CE model tasks to run in CI. (push) Has been cancelled
Publish Job / Run Base Tests (push) Has been cancelled
Publish Job / Run Accuracy Tests (push) Has been cancelled
Publish Job / Run Stable Tests (push) Has been cancelled
CI Images Build / FD-Clone-Linux (push) Has been cancelled
CI Images Build / Show Code Archive Output (push) Has been cancelled
CI Images Build / CI Images Build (push) Has been cancelled
CI Images Build / BUILD_SM8090 (push) Has been cancelled
CI Images Build / Run FastDeploy Unit Tests and Coverage (push) Has been cancelled
CI Images Build / Run FastDeploy LogProb Tests (push) Has been cancelled
CI Images Build / Extracted partial CE model tasks to run in CI. (push) Has been cancelled
CI Images Build / Run Base Tests (push) Has been cancelled
CI Images Build / Publish Docker Images Pre Check (push) Has been cancelled
* Add unit tests for SplitWiseScheduler module

* Add info and ping to fake redis client for tests

* Document fake redis metadata methods in tests

* Enhance splitwise scheduler tests

* Clean up test_splitwise_scheduler.py

Removed copyright notice and documentation comments.

* Simplify splitwise scheduler test stubs

* Refine splitwise scheduler tests

* Handle empty result keys with restored sleep

---------

Co-authored-by: CSWYF3634076 <wangyafeng@baidu.com>
2025-12-15 20:29:25 +08:00
Yuanle Liu
b8e4828373 [BugFix] fix dynamic c8 in v1 loader (#5562) 2025-12-15 04:07:54 -08:00
freeliuzc
532f9ba227 [BugFix][Speculative Decoding](Spend many dyas to solve)Fix write qknorm cache bug in speculative decoding (#5491)
* [liuzichang spend 10 dyas]fix write qknorm cache bug

* fix 'fix cachekv bug''
2025-12-15 18:27:11 +08:00
MingkunZhang
5265d844e9 [Metax] fix GetStopFlagsMulti kernel crash issue (#5556) 2025-12-15 01:56:20 -08:00
chenjian
0100ee885f Fix bug for caching output when preempted (#5502) 2025-12-15 17:25:35 +08:00