YuBaoku
fe55baae47
[CI] Fix unit_test error of unstable execution ( #5660 )
...
* [CI] Fix unit_test error of unstable execution
2025-12-19 22:59:53 +08:00
chen
a32cb54d0b
[BugFix] Fix custom_all_reduce overflow ( #5662 )
...
* check
* check
* code style
2025-12-19 18:24:21 +08:00
MingkunZhang
46d83be065
[Metax] update ci test ( #5652 )
2025-12-19 17:25:47 +08:00
xunyoyo
669dfe8dca
[CI] 【Hackathon 9th Sprint No.38】NO.38 功能模块单测补充 ( #5060 )
...
CE Compile Job / ce_job_pre_check (push) Has been cancelled
CE Compile Job / print_ce_job_pre_check_outputs (push) Has been cancelled
CE Compile Job / FD-Clone-Linux (push) Has been cancelled
CE Compile Job / Show Code Archive Output (push) Has been cancelled
CE Compile Job / BUILD_SM8090 (push) Has been cancelled
CE Compile Job / BUILD_SM8689 (push) Has been cancelled
CE Compile Job / CE_UPLOAD (push) Has been cancelled
Deploy GitHub Pages / deploy (push) Has been cancelled
* Add tests for global scheduler functionality
This file contains tests for the global scheduler, including request handling, load accounting, and response management.
* Enhance global scheduler unit tests
* Clean up comments in test_global_scheduler.py
Removed unnecessary comments and coverage instructions.
* Update tests/scheduler/test_global_scheduler.py
Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com >
* Update tests/scheduler/test_global_scheduler.py
Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com >
* Update tests/scheduler/test_global_scheduler.py
Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com >
* Update tests/scheduler/test_global_scheduler.py
Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com >
* Translate comments and docstrings to Chinese
Updated comments and docstrings in Chinese to enhance clarity for native speakers.
* Use monkeypatch for FD_ENABLE_MAX_PREFILL in tests
Replaced direct assignment with monkeypatching for FD_ENABLE_MAX_PREFILL in test cases.
* Correct test function definitions in test_global_scheduler
Fix function definitions and update test cases for scheduler.
* Fix typos in global scheduler tests
* Translate comments to English in test_global_scheduler.py
Updated comments and docstrings in test_global_scheduler.py from Chinese to English for better clarity and consistency.
---------
Co-authored-by: CSWYF3634076 <wangyafeng@baidu.com >
Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com >
Co-authored-by: Jiaxin Sui <95567040+plusNew001@users.noreply.github.com >
2025-12-19 16:28:16 +08:00
kevin
807e404369
[BugFix] fix eb5 mm prefix cache bug ( #5638 )
...
* fix eb5 mm prefix cache bug
* update code
---------
Co-authored-by: Jiaxin Sui <95567040+plusNew001@users.noreply.github.com >
2025-12-19 14:57:37 +08:00
RichardWooSJTU
6bd772b93f
fix eplb weight updating ( #5529 )
...
Co-authored-by: Jiang-Jia-Jun <163579578+Jiang-Jia-Jun@users.noreply.github.com >
2025-12-19 14:30:32 +08:00
Yuanle Liu
689f54f671
[RL] Update worker_process.py ( #5651 )
2025-12-18 20:07:58 -08:00
fmiao2372
a8fce47195
[Intel HPU] enable kv cache scheduler v1 for hpu ( #5648 )
...
* [Intel HPU] enable kv cache scheduler v1 for hpu
* fix copilt comments
2025-12-19 12:03:39 +08:00
bukejiyu
fc452c8e29
[RL]Fix RL load_weights ( #5642 )
...
Co-authored-by: Jiaxin Sui <95567040+plusNew001@users.noreply.github.com >
2025-12-18 19:16:18 -08:00
lizan1999
ec6811f648
support token num = 0 ( #5635 )
...
Co-authored-by: lizan1999 <lizan03@baidu.com >
Co-authored-by: cmcamdy <1027740945@qq.com >
Co-authored-by: Jiaxin Sui <95567040+plusNew001@users.noreply.github.com >
2025-12-19 10:20:38 +08:00
xunyoyo
d657455616
[CI] 【Hackathon 9th Sprint No.19】NO.19 功能模块单测补充 ( #5063 )
...
CE Compile Job / ce_job_pre_check (push) Has been cancelled
CE Compile Job / print_ce_job_pre_check_outputs (push) Has been cancelled
CE Compile Job / FD-Clone-Linux (push) Has been cancelled
CE Compile Job / Show Code Archive Output (push) Has been cancelled
CE Compile Job / BUILD_SM8090 (push) Has been cancelled
CE Compile Job / BUILD_SM8689 (push) Has been cancelled
CE Compile Job / CE_UPLOAD (push) Has been cancelled
Deploy GitHub Pages / deploy (push) Has been cancelled
Publish Job / publish_pre_check (push) Has been cancelled
Publish Job / print_publish_pre_check_outputs (push) Has been cancelled
Publish Job / FD-Clone-Linux (push) Has been cancelled
Publish Job / Show Code Archive Output (push) Has been cancelled
Publish Job / BUILD_SM8090 (push) Has been cancelled
Publish Job / BUILD_SM8689 (push) Has been cancelled
Publish Job / PADDLE_PYPI_UPLOAD_8090 (push) Has been cancelled
Publish Job / PADDLE_PYPI_UPLOAD_8689 (push) Has been cancelled
Publish Job / Run FD Image Build (push) Has been cancelled
Publish Job / Run FastDeploy Unit Tests and Coverage (push) Has been cancelled
Publish Job / Run FastDeploy LogProb Tests (push) Has been cancelled
Publish Job / Extracted partial CE model tasks to run in CI. (push) Has been cancelled
Publish Job / Run Base Tests (push) Has been cancelled
Publish Job / Run Accuracy Tests (push) Has been cancelled
Publish Job / Run Stable Tests (push) Has been cancelled
CI Images Build / FD-Clone-Linux (push) Has been cancelled
CI Images Build / Show Code Archive Output (push) Has been cancelled
CI Images Build / CI Images Build (push) Has been cancelled
CI Images Build / BUILD_SM8090 (push) Has been cancelled
CI Images Build / Run FastDeploy Unit Tests and Coverage (push) Has been cancelled
CI Images Build / Run FastDeploy LogProb Tests (push) Has been cancelled
CI Images Build / Extracted partial CE model tasks to run in CI. (push) Has been cancelled
CI Images Build / Run Base Tests (push) Has been cancelled
CI Images Build / Publish Docker Images Pre Check (push) Has been cancelled
* Add pooler unit tests
* Refine pooler tests import handling
* Refactor pooler tests to use real modules
* Clean up test_pooler.py by removing docstring
Removed unnecessary docstring and cleaned up code.
* Clean up imports in test_pooler.py
Removed unnecessary import of sys and related path adjustments.
* Update model config and clean up test code
* Update test_pooler.py
---------
Co-authored-by: CSWYF3634076 <wangyafeng@baidu.com >
2025-12-18 21:32:44 +08:00
Yuanle Liu
b47674c796
[BugFix] fix rl model_weights_signal to support tp>1 ( #5639 )
2025-12-18 04:43:58 -08:00
Jiaxin Sui
d739af5e6e
Revert "[XPU][CI] xpu add ci test for pd ( #5610 )" ( #5645 )
...
This reverts commit 80fb530ce2 .
2025-12-18 19:59:09 +08:00
bukejiyu
4aa2c6871b
[RL]Support loading weights via the load_weights function for RL ( #5549 )
...
* RL support load_weights
* fix
2025-12-18 02:27:05 -08:00
yzwu
ac013803f3
[Iluvatar] Support V1_KVCACHE_SCHEDULER and paddleocr-vl rope mode ( #5555 )
2025-12-18 02:14:25 -08:00
Zhang Yulong
48f3e9797e
Update backend_request_func.py ( #5633 )
CE Compile Job / ce_job_pre_check (push) Has been cancelled
CE Compile Job / print_ce_job_pre_check_outputs (push) Has been cancelled
CE Compile Job / FD-Clone-Linux (push) Has been cancelled
CE Compile Job / Show Code Archive Output (push) Has been cancelled
CE Compile Job / BUILD_SM8090 (push) Has been cancelled
CE Compile Job / BUILD_SM8689 (push) Has been cancelled
CE Compile Job / CE_UPLOAD (push) Has been cancelled
Deploy GitHub Pages / deploy (push) Has been cancelled
2025-12-18 16:21:34 +08:00
xunyoyo
2d2619d300
[CI] 【Hackathon 9th Sprint No.36】NO.36 功能模块单测补充 (修复) ( #5609 )
...
* Implement unit tests for PrefixCacheManager
* Update prefix cache manager tests
* Handle get_all_visible_devices in prefix cache manager tests
* Add repo root to prefix cache manager tests sys.path
* Use pathlib for repo root in prefix cache manager tests
* Refine repo root Path import in tests
* Handle list-based visible device configuration
* Refine PrefixCacheManager test stubs
* Run pre-commit on prefix cache manager tests
* Remove duplicate pytest import in cache manager tests
* Add tests for visible device formatting
* Revert
* Simplify test stubs in prefix cache manager tests
* Refine PrefixCacheManager tests
* Adjust prefix cache manager tests per review
* Remove ignored tests from coverage configuration
* Make prefix cache manager tests runnable without paddle
* Use real paddle import in prefix cache manager tests
* Clean up imports in test_prefix_cache_manager.py
Removed unnecessary import of 'os' and related path manipulation.
* Update test_prefix_cache_manager.py
* Replace pid_suffix with ipc_suffix in tests
* Add local cache queue and RDMA ports to cache config
2025-12-18 16:08:42 +08:00
lizan1999
e1a9b282eb
fix bug for EP+MTP ( #5605 )
...
Co-authored-by: lizan1999 <lizan03@baidu.com >
2025-12-18 14:34:54 +08:00
Longzhi Wang
d8587e987e
[Model] tp+ep support v1_loader ( #5465 )
...
* [Model] tp+ep support v1_loader
* fix
* fix mtp_linear
* fix mtp_linear
* fix
* fix
* fix v0 loader
* fix
* Add get_tensor for ep
* fix linear weight_loader
* fix typo
* fix
2025-12-18 14:31:54 +08:00
Zhang Yulong
c89a62e550
Update backend_request_func.py ( #5631 )
2025-12-18 14:20:17 +08:00
zhupengyang
8735cb5045
[XPU] refactor moe ffn ( #5501 )
...
- remove BKCL_DISPATCH_ALL_GATHER
- support sparse mode
- support moe quant_method
2025-12-18 14:14:05 +08:00
MingkunZhang
d0a7834a17
[Metax] fix metax runner issue ( #5629 )
2025-12-17 21:32:54 -08:00
qw86972190
c606df59f5
[XPU]logprob bug ( #5626 )
2025-12-18 12:07:20 +08:00
kesmeey
d81341b9b3
[CI]【Hackathon 9th Sprint No.14】功能模块 fastdeploy/rl/rollout_model.py 单测补充 ( #5552 )
...
* Add rollout model unit tests
* test: update rl rollout_model tests
* test: fix cache_type_branches unsupported platform case
* test: fix rl rollout_model test indent
* Delete tests/spec_decode/test_mtp_proposer.py
* chore: format test_rollout_model
* chore: translate rollout test comments to English
* test: guard rollout_model import by disabling auto registry
* chore: reorder imports in rl rollout test
* test: isolate env for RL rollout tests
* style: format rollout RL tests with black
* update
* test: remove RL rollout unit tests causing collection issues
* test: add lightweight rollout_model RL unit tests
* fix(coverage): filter test file paths and handle collection failures
- Only extract real test file paths (tests/.../test_*.py) from pytest collect output
- Filter out ERROR/collecting prefixes to prevent garbage in failed_tests.log
- Add proper error handling for pytest collection failures
- Exit early if no test files can be extracted
- Preserve collection error output for debugging
* update
* style: fix code style issues in test_rollout_model.py
- Remove unused 'os' import
- Remove trailing blank lines
---------
Co-authored-by: YuBaoku <49938469+EmmonsCurse@users.noreply.github.com >
2025-12-18 10:57:53 +08:00
Zhang Yulong
f45c131ddf
update ( #5625 )
CE Compile Job / ce_job_pre_check (push) Has been cancelled
CE Compile Job / print_ce_job_pre_check_outputs (push) Has been cancelled
CE Compile Job / FD-Clone-Linux (push) Has been cancelled
CE Compile Job / Show Code Archive Output (push) Has been cancelled
CE Compile Job / BUILD_SM8090 (push) Has been cancelled
CE Compile Job / BUILD_SM8689 (push) Has been cancelled
CE Compile Job / CE_UPLOAD (push) Has been cancelled
Deploy GitHub Pages / deploy (push) Has been cancelled
Publish Job / publish_pre_check (push) Has been cancelled
Publish Job / print_publish_pre_check_outputs (push) Has been cancelled
Publish Job / FD-Clone-Linux (push) Has been cancelled
Publish Job / Show Code Archive Output (push) Has been cancelled
Publish Job / BUILD_SM8090 (push) Has been cancelled
Publish Job / BUILD_SM8689 (push) Has been cancelled
Publish Job / PADDLE_PYPI_UPLOAD_8090 (push) Has been cancelled
Publish Job / PADDLE_PYPI_UPLOAD_8689 (push) Has been cancelled
Publish Job / Run FD Image Build (push) Has been cancelled
Publish Job / Run FastDeploy Unit Tests and Coverage (push) Has been cancelled
Publish Job / Run FastDeploy LogProb Tests (push) Has been cancelled
Publish Job / Extracted partial CE model tasks to run in CI. (push) Has been cancelled
Publish Job / Run Base Tests (push) Has been cancelled
Publish Job / Run Accuracy Tests (push) Has been cancelled
Publish Job / Run Stable Tests (push) Has been cancelled
CI Images Build / FD-Clone-Linux (push) Has been cancelled
CI Images Build / Show Code Archive Output (push) Has been cancelled
CI Images Build / CI Images Build (push) Has been cancelled
CI Images Build / BUILD_SM8090 (push) Has been cancelled
CI Images Build / Run FastDeploy Unit Tests and Coverage (push) Has been cancelled
CI Images Build / Run FastDeploy LogProb Tests (push) Has been cancelled
CI Images Build / Extracted partial CE model tasks to run in CI. (push) Has been cancelled
CI Images Build / Run Base Tests (push) Has been cancelled
CI Images Build / Publish Docker Images Pre Check (push) Has been cancelled
2025-12-17 21:38:14 +08:00
Zhang Yulong
94be5ebdd1
[CI] Add CI case for MTP accept ratio ( #5570 )
...
* Implement test for MTP accept ratio
Add test for MTP accept ratio with assertions on results and metrics.
* Update test_ernie_21b_mtp.py
* Refactor test_mtp_accept_ratio for baseline comparison
Refactor test_mtp_accept_ratio to compare results against baseline file and metrics.
* Fix formatting issues in test_ernie_21b_mtp.py
---------
Co-authored-by: YuBaoku <49938469+EmmonsCurse@users.noreply.github.com >
2025-12-17 21:35:02 +08:00
kesmeey
ac731653b3
[CI]【Hackathon 9th Sprint No.12】功能模块 fastdeploy/spec_decode/mtp.py 单测补充 ( #5533 )
...
* Add unit tests for MTPProposer class in spec_decode/mtp.py
* fix: remove non-existent QuantizationConfig import in test_mtp_proposer
* fix: add logprobs_mode attribute to FakeModelConfig
* fix: fix test failures in test_mtp_proposer - fix Mock setup, remove arrival_time, add missing keys
* fix: add seq_lens_this_time initialization and kv_cache init before insert_tasks_v1
* fix: check pos_emb_type attribute existence before assertion
* test: add minimal coverage for mtp cache type, mm init, preempted
* test: fix cache_type_branches unsupported platform on 12
* test: refine MTPProposer tests for cache type, requests and chunked prefill
* chore: remove stray spec_decode copy
2025-12-17 20:09:45 +08:00
megemini
111955ec0c
[BugFix] 移除重复的 PaddleOCRVLProcessor 初始化代码
2025-12-17 18:58:02 +08:00
fmiao2372
404cf0ece4
[Intel HPU] enable tensor_wise_fp8 ( #5324 )
...
* [Intel HPU] enable tensor_wise_fp8
* update code based on comments
* fix code style issue
* fix bug about RP 5138
* mv kv_cache modifications to HPU backend
* fix FP8 Precision Issues
* fix FP8 Precision Issues
* Add quantization UT
---------
Co-authored-by: yanfeich <yanfei.cheng@intel.com >
Co-authored-by: YuBaoku <49938469+EmmonsCurse@users.noreply.github.com >
2025-12-17 16:45:03 +08:00
freeliuzc
15f5112ecb
[Speculative Decoding]Support different inferseed in speculate decoding ( #5568 )
...
* fix mtp entropy drop in RL
* optimize usage and fix unit test
* optimize padding_sampling_params speed(vectorized)
---------
Co-authored-by: YuBaoku <49938469+EmmonsCurse@users.noreply.github.com >
2025-12-17 16:14:29 +08:00
ddchenhao66
80fb530ce2
[XPU][CI] xpu add ci test for pd ( #5610 )
...
CE Compile Job / ce_job_pre_check (push) Has been cancelled
CE Compile Job / print_ce_job_pre_check_outputs (push) Has been cancelled
CE Compile Job / FD-Clone-Linux (push) Has been cancelled
CE Compile Job / Show Code Archive Output (push) Has been cancelled
CE Compile Job / BUILD_SM8090 (push) Has been cancelled
CE Compile Job / BUILD_SM8689 (push) Has been cancelled
CE Compile Job / CE_UPLOAD (push) Has been cancelled
Deploy GitHub Pages / deploy (push) Has been cancelled
Co-authored-by: ddchenhao66 <dhaochen163.com>
2025-12-17 16:07:44 +08:00
Yonghua Li
0c8c6369ed
[Feature] [PD Disaggregation] simplify configuration for pd-disaggregated deployment, and refactor post-init and usage for all ports ( #5415 )
...
* [feat] simplify configuration for pd-disaggregated deployment, and refactor post-init and usage for all ports
* [fix] fix some bugs
* [fix] fix rdma port for cache manager/messager
* [fix] temporarily cancel port availability check to see if it can pass ci test
* [feat] simplify args for multi api server
* [fix] fix dp
* [fix] fix port for xpu
* [fix] add tests for ports post processing & fix ci
* [test] fix test_multi_api_server
* [fix] fix rdma_comm_ports args for multi_api_server
* [fix] fix test_common_engine
* [fix] fix test_cache_transfer_manager
* [chore] automatically setting FD_ENABLE_MULTI_API_SERVER
* [fix] avoid api server from creating engine_args twice
* [fix] fix test_run_batch
* [fix] fix test_metrics
* [fix] fix splitwise connector init
* [test] add test_rdma_transfer and test_expert_service
* [fix] fix code syntax
* [fix] fix test_rdma_transfer and build wheel with rdma script
2025-12-17 15:50:42 +08:00
Yuanle Liu
cdc0004894
Revert "[Feature] add ue8m0 for per_token_quant_fp8 ( #5563 )" ( #5611 )
...
This reverts commit 73e1d6aa90 .
2025-12-17 13:59:06 +08:00
YuBaoku
21fa2baa51
[CI] disable test_prefix_cache_manager.py in unit_test
2025-12-17 10:48:02 +08:00
周周周
e29b005520
[Others] Clean code && remove GPU sync code ( #5548 )
CE Compile Job / ce_job_pre_check (push) Has been cancelled
CE Compile Job / print_ce_job_pre_check_outputs (push) Has been cancelled
CE Compile Job / FD-Clone-Linux (push) Has been cancelled
CE Compile Job / Show Code Archive Output (push) Has been cancelled
CE Compile Job / BUILD_SM8090 (push) Has been cancelled
CE Compile Job / BUILD_SM8689 (push) Has been cancelled
CE Compile Job / CE_UPLOAD (push) Has been cancelled
Deploy GitHub Pages / deploy (push) Has been cancelled
Publish Job / publish_pre_check (push) Has been cancelled
Publish Job / print_publish_pre_check_outputs (push) Has been cancelled
Publish Job / FD-Clone-Linux (push) Has been cancelled
Publish Job / Show Code Archive Output (push) Has been cancelled
Publish Job / BUILD_SM8090 (push) Has been cancelled
Publish Job / BUILD_SM8689 (push) Has been cancelled
Publish Job / PADDLE_PYPI_UPLOAD_8090 (push) Has been cancelled
Publish Job / PADDLE_PYPI_UPLOAD_8689 (push) Has been cancelled
Publish Job / Run FD Image Build (push) Has been cancelled
Publish Job / Run FastDeploy Unit Tests and Coverage (push) Has been cancelled
Publish Job / Run FastDeploy LogProb Tests (push) Has been cancelled
Publish Job / Extracted partial CE model tasks to run in CI. (push) Has been cancelled
Publish Job / Run Base Tests (push) Has been cancelled
Publish Job / Run Accuracy Tests (push) Has been cancelled
Publish Job / Run Stable Tests (push) Has been cancelled
CI Images Build / FD-Clone-Linux (push) Has been cancelled
CI Images Build / Show Code Archive Output (push) Has been cancelled
CI Images Build / CI Images Build (push) Has been cancelled
CI Images Build / BUILD_SM8090 (push) Has been cancelled
CI Images Build / Run FastDeploy Unit Tests and Coverage (push) Has been cancelled
CI Images Build / Run FastDeploy LogProb Tests (push) Has been cancelled
CI Images Build / Extracted partial CE model tasks to run in CI. (push) Has been cancelled
CI Images Build / Run Base Tests (push) Has been cancelled
CI Images Build / Publish Docker Images Pre Check (push) Has been cancelled
2025-12-16 21:09:37 +08:00
Yuanle Liu
867803ae10
[BugFix] fix speculate_limit_thinking_content_length ( #5590 )
...
* fix speculate_limit_thinking_content_length
* update
2025-12-16 04:31:45 -08:00
kevin
7140939c51
[BugFix] fix video bug ( #5557 )
...
* fix video bug
* add eb5 moe model
2025-12-16 20:06:50 +08:00
chen
27ef3610b5
support glm fa3 ( #5586 )
2025-12-16 19:33:27 +08:00
Jiang-Jia-Jun
2ad3bff4ff
[Optim] Optimize costtime in checking tasks in engine-worker-queue ( #5580 )
...
* [Optim] Optimize costtime in checking tasks in engine-worker-queue
* Update fastdeploy/engine/common_engine.py
Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com >
---------
Co-authored-by: Jiang-Jia-Jun <jiangjiajun@baidu.com >
Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com >
2025-12-16 19:27:31 +08:00
xunyoyo
55609a51fc
[CI] 【Hackathon 9th Sprint No.36】NO.36 功能模块单测补充 ( #5058 )
...
* Implement unit tests for PrefixCacheManager
* Update prefix cache manager tests
* Handle get_all_visible_devices in prefix cache manager tests
* Add repo root to prefix cache manager tests sys.path
* Use pathlib for repo root in prefix cache manager tests
* Refine repo root Path import in tests
* Handle list-based visible device configuration
* Refine PrefixCacheManager test stubs
* Run pre-commit on prefix cache manager tests
* Remove duplicate pytest import in cache manager tests
* Add tests for visible device formatting
* Revert
* Simplify test stubs in prefix cache manager tests
* Refine PrefixCacheManager tests
* Adjust prefix cache manager tests per review
2025-12-16 19:19:03 +08:00
fxyfxy777
73e1d6aa90
[Feature] add ue8m0 for per_token_quant_fp8 ( #5563 )
...
* ue8m0
* add default arg
---------
Co-authored-by: YuBaoku <49938469+EmmonsCurse@users.noreply.github.com >
2025-12-16 18:40:12 +08:00
Yonghua Li
eeb99d2af5
[BugFix] skip model executing after clearing/updating is done ( #5527 )
...
* [fix] fix ep loop
* [fix] another try
* [fix] again
2025-12-16 17:39:03 +08:00
RAM
6fc5eccf83
[RL] R3 Support RDMA Store ( #5467 )
...
* [RL] R3 support rdma store
* refine notes
* refine code
* disable prefix cache
* support preempted task and put cpu tensor
2025-12-16 16:50:13 +08:00
xiaolei373
a30b4da260
[Feature] Tracing: Fine-Grained Tracing for Request Latency Part1 ( #5458 )
2025-12-16 16:36:09 +08:00
kevin
c9b47f90ce
[BugFix] fix cpu prefix cache bug ( #5544 )
...
CE Compile Job / ce_job_pre_check (push) Has been cancelled
CE Compile Job / print_ce_job_pre_check_outputs (push) Has been cancelled
CE Compile Job / FD-Clone-Linux (push) Has been cancelled
CE Compile Job / Show Code Archive Output (push) Has been cancelled
CE Compile Job / BUILD_SM8090 (push) Has been cancelled
CE Compile Job / BUILD_SM8689 (push) Has been cancelled
CE Compile Job / CE_UPLOAD (push) Has been cancelled
Deploy GitHub Pages / deploy (push) Has been cancelled
* fix_dy_c8_bug
* add block_num check
* fix test case
* update ci case
2025-12-16 14:21:42 +08:00
YuBaoku
5d2b16e6f3
[CI] Remove test_metrics.py due to incompatible forced merge ( #5578 )
...
* [CI] Remove test_metrics.py due to incompatible forced merge
2025-12-16 14:04:46 +08:00
Jiang-Jia-Jun
021399f7c9
Revert "[Feature] Use paddle.compat.enable_torch_proxy in `fastdeploy/__ini…" ( #5579 )
...
This reverts commit ff45ac078e .
2025-12-16 13:55:27 +08:00
Echo-Nie
50100f98d7
[Feature] Support fusedmoe on Blackwell ( #5325 )
...
* update sm100
* fix
* fix style
2025-12-16 11:58:50 +08:00
YuBaoku
63fff8df70
[CI] Adapt vl_model baseline changes due to Paddle update ( #5576 )
2025-12-16 11:42:31 +08:00
gaoziyuan
5db08cc1d5
【NewFeature】support load fp8 weight ( #5565 )
2025-12-16 11:23:57 +08:00