YuBaoku
fe55baae47
[CI] Fix unit_test error of unstable execution ( #5660 )
...
* [CI] Fix unit_test error of unstable execution
2025-12-19 22:59:53 +08:00
MingkunZhang
46d83be065
[Metax] update ci test ( #5652 )
2025-12-19 17:25:47 +08:00
xunyoyo
669dfe8dca
[CI] 【Hackathon 9th Sprint No.38】NO.38 功能模块单测补充 ( #5060 )
...
CE Compile Job / ce_job_pre_check (push) Has been cancelled
CE Compile Job / print_ce_job_pre_check_outputs (push) Has been cancelled
CE Compile Job / FD-Clone-Linux (push) Has been cancelled
CE Compile Job / Show Code Archive Output (push) Has been cancelled
CE Compile Job / BUILD_SM8090 (push) Has been cancelled
CE Compile Job / BUILD_SM8689 (push) Has been cancelled
CE Compile Job / CE_UPLOAD (push) Has been cancelled
Deploy GitHub Pages / deploy (push) Has been cancelled
* Add tests for global scheduler functionality
This file contains tests for the global scheduler, including request handling, load accounting, and response management.
* Enhance global scheduler unit tests
* Clean up comments in test_global_scheduler.py
Removed unnecessary comments and coverage instructions.
* Update tests/scheduler/test_global_scheduler.py
Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com >
* Update tests/scheduler/test_global_scheduler.py
Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com >
* Update tests/scheduler/test_global_scheduler.py
Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com >
* Update tests/scheduler/test_global_scheduler.py
Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com >
* Translate comments and docstrings to Chinese
Updated comments and docstrings in Chinese to enhance clarity for native speakers.
* Use monkeypatch for FD_ENABLE_MAX_PREFILL in tests
Replaced direct assignment with monkeypatching for FD_ENABLE_MAX_PREFILL in test cases.
* Correct test function definitions in test_global_scheduler
Fix function definitions and update test cases for scheduler.
* Fix typos in global scheduler tests
* Translate comments to English in test_global_scheduler.py
Updated comments and docstrings in test_global_scheduler.py from Chinese to English for better clarity and consistency.
---------
Co-authored-by: CSWYF3634076 <wangyafeng@baidu.com >
Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com >
Co-authored-by: Jiaxin Sui <95567040+plusNew001@users.noreply.github.com >
2025-12-19 16:28:16 +08:00
xunyoyo
d657455616
[CI] 【Hackathon 9th Sprint No.19】NO.19 功能模块单测补充 ( #5063 )
...
CE Compile Job / ce_job_pre_check (push) Has been cancelled
CE Compile Job / print_ce_job_pre_check_outputs (push) Has been cancelled
CE Compile Job / FD-Clone-Linux (push) Has been cancelled
CE Compile Job / Show Code Archive Output (push) Has been cancelled
CE Compile Job / BUILD_SM8090 (push) Has been cancelled
CE Compile Job / BUILD_SM8689 (push) Has been cancelled
CE Compile Job / CE_UPLOAD (push) Has been cancelled
Deploy GitHub Pages / deploy (push) Has been cancelled
Publish Job / publish_pre_check (push) Has been cancelled
Publish Job / print_publish_pre_check_outputs (push) Has been cancelled
Publish Job / FD-Clone-Linux (push) Has been cancelled
Publish Job / Show Code Archive Output (push) Has been cancelled
Publish Job / BUILD_SM8090 (push) Has been cancelled
Publish Job / BUILD_SM8689 (push) Has been cancelled
Publish Job / PADDLE_PYPI_UPLOAD_8090 (push) Has been cancelled
Publish Job / PADDLE_PYPI_UPLOAD_8689 (push) Has been cancelled
Publish Job / Run FD Image Build (push) Has been cancelled
Publish Job / Run FastDeploy Unit Tests and Coverage (push) Has been cancelled
Publish Job / Run FastDeploy LogProb Tests (push) Has been cancelled
Publish Job / Extracted partial CE model tasks to run in CI. (push) Has been cancelled
Publish Job / Run Base Tests (push) Has been cancelled
Publish Job / Run Accuracy Tests (push) Has been cancelled
Publish Job / Run Stable Tests (push) Has been cancelled
CI Images Build / FD-Clone-Linux (push) Has been cancelled
CI Images Build / Show Code Archive Output (push) Has been cancelled
CI Images Build / CI Images Build (push) Has been cancelled
CI Images Build / BUILD_SM8090 (push) Has been cancelled
CI Images Build / Run FastDeploy Unit Tests and Coverage (push) Has been cancelled
CI Images Build / Run FastDeploy LogProb Tests (push) Has been cancelled
CI Images Build / Extracted partial CE model tasks to run in CI. (push) Has been cancelled
CI Images Build / Run Base Tests (push) Has been cancelled
CI Images Build / Publish Docker Images Pre Check (push) Has been cancelled
* Add pooler unit tests
* Refine pooler tests import handling
* Refactor pooler tests to use real modules
* Clean up test_pooler.py by removing docstring
Removed unnecessary docstring and cleaned up code.
* Clean up imports in test_pooler.py
Removed unnecessary import of sys and related path adjustments.
* Update model config and clean up test code
* Update test_pooler.py
---------
Co-authored-by: CSWYF3634076 <wangyafeng@baidu.com >
2025-12-18 21:32:44 +08:00
Jiaxin Sui
d739af5e6e
Revert "[XPU][CI] xpu add ci test for pd ( #5610 )" ( #5645 )
...
This reverts commit 80fb530ce2 .
2025-12-18 19:59:09 +08:00
yzwu
ac013803f3
[Iluvatar] Support V1_KVCACHE_SCHEDULER and paddleocr-vl rope mode ( #5555 )
2025-12-18 02:14:25 -08:00
xunyoyo
2d2619d300
[CI] 【Hackathon 9th Sprint No.36】NO.36 功能模块单测补充 (修复) ( #5609 )
...
* Implement unit tests for PrefixCacheManager
* Update prefix cache manager tests
* Handle get_all_visible_devices in prefix cache manager tests
* Add repo root to prefix cache manager tests sys.path
* Use pathlib for repo root in prefix cache manager tests
* Refine repo root Path import in tests
* Handle list-based visible device configuration
* Refine PrefixCacheManager test stubs
* Run pre-commit on prefix cache manager tests
* Remove duplicate pytest import in cache manager tests
* Add tests for visible device formatting
* Revert
* Simplify test stubs in prefix cache manager tests
* Refine PrefixCacheManager tests
* Adjust prefix cache manager tests per review
* Remove ignored tests from coverage configuration
* Make prefix cache manager tests runnable without paddle
* Use real paddle import in prefix cache manager tests
* Clean up imports in test_prefix_cache_manager.py
Removed unnecessary import of 'os' and related path manipulation.
* Update test_prefix_cache_manager.py
* Replace pid_suffix with ipc_suffix in tests
* Add local cache queue and RDMA ports to cache config
2025-12-18 16:08:42 +08:00
kesmeey
d81341b9b3
[CI]【Hackathon 9th Sprint No.14】功能模块 fastdeploy/rl/rollout_model.py 单测补充 ( #5552 )
...
* Add rollout model unit tests
* test: update rl rollout_model tests
* test: fix cache_type_branches unsupported platform case
* test: fix rl rollout_model test indent
* Delete tests/spec_decode/test_mtp_proposer.py
* chore: format test_rollout_model
* chore: translate rollout test comments to English
* test: guard rollout_model import by disabling auto registry
* chore: reorder imports in rl rollout test
* test: isolate env for RL rollout tests
* style: format rollout RL tests with black
* update
* test: remove RL rollout unit tests causing collection issues
* test: add lightweight rollout_model RL unit tests
* fix(coverage): filter test file paths and handle collection failures
- Only extract real test file paths (tests/.../test_*.py) from pytest collect output
- Filter out ERROR/collecting prefixes to prevent garbage in failed_tests.log
- Add proper error handling for pytest collection failures
- Exit early if no test files can be extracted
- Preserve collection error output for debugging
* update
* style: fix code style issues in test_rollout_model.py
- Remove unused 'os' import
- Remove trailing blank lines
---------
Co-authored-by: YuBaoku <49938469+EmmonsCurse@users.noreply.github.com >
2025-12-18 10:57:53 +08:00
Zhang Yulong
94be5ebdd1
[CI] Add CI case for MTP accept ratio ( #5570 )
...
* Implement test for MTP accept ratio
Add test for MTP accept ratio with assertions on results and metrics.
* Update test_ernie_21b_mtp.py
* Refactor test_mtp_accept_ratio for baseline comparison
Refactor test_mtp_accept_ratio to compare results against baseline file and metrics.
* Fix formatting issues in test_ernie_21b_mtp.py
---------
Co-authored-by: YuBaoku <49938469+EmmonsCurse@users.noreply.github.com >
2025-12-17 21:35:02 +08:00
kesmeey
ac731653b3
[CI]【Hackathon 9th Sprint No.12】功能模块 fastdeploy/spec_decode/mtp.py 单测补充 ( #5533 )
...
* Add unit tests for MTPProposer class in spec_decode/mtp.py
* fix: remove non-existent QuantizationConfig import in test_mtp_proposer
* fix: add logprobs_mode attribute to FakeModelConfig
* fix: fix test failures in test_mtp_proposer - fix Mock setup, remove arrival_time, add missing keys
* fix: add seq_lens_this_time initialization and kv_cache init before insert_tasks_v1
* fix: check pos_emb_type attribute existence before assertion
* test: add minimal coverage for mtp cache type, mm init, preempted
* test: fix cache_type_branches unsupported platform on 12
* test: refine MTPProposer tests for cache type, requests and chunked prefill
* chore: remove stray spec_decode copy
2025-12-17 20:09:45 +08:00
fmiao2372
404cf0ece4
[Intel HPU] enable tensor_wise_fp8 ( #5324 )
...
* [Intel HPU] enable tensor_wise_fp8
* update code based on comments
* fix code style issue
* fix bug about RP 5138
* mv kv_cache modifications to HPU backend
* fix FP8 Precision Issues
* fix FP8 Precision Issues
* Add quantization UT
---------
Co-authored-by: yanfeich <yanfei.cheng@intel.com >
Co-authored-by: YuBaoku <49938469+EmmonsCurse@users.noreply.github.com >
2025-12-17 16:45:03 +08:00
freeliuzc
15f5112ecb
[Speculative Decoding]Support different inferseed in speculate decoding ( #5568 )
...
* fix mtp entropy drop in RL
* optimize usage and fix unit test
* optimize padding_sampling_params speed(vectorized)
---------
Co-authored-by: YuBaoku <49938469+EmmonsCurse@users.noreply.github.com >
2025-12-17 16:14:29 +08:00
ddchenhao66
80fb530ce2
[XPU][CI] xpu add ci test for pd ( #5610 )
...
CE Compile Job / ce_job_pre_check (push) Has been cancelled
CE Compile Job / print_ce_job_pre_check_outputs (push) Has been cancelled
CE Compile Job / FD-Clone-Linux (push) Has been cancelled
CE Compile Job / Show Code Archive Output (push) Has been cancelled
CE Compile Job / BUILD_SM8090 (push) Has been cancelled
CE Compile Job / BUILD_SM8689 (push) Has been cancelled
CE Compile Job / CE_UPLOAD (push) Has been cancelled
Deploy GitHub Pages / deploy (push) Has been cancelled
Co-authored-by: ddchenhao66 <dhaochen163.com>
2025-12-17 16:07:44 +08:00
Yonghua Li
0c8c6369ed
[Feature] [PD Disaggregation] simplify configuration for pd-disaggregated deployment, and refactor post-init and usage for all ports ( #5415 )
...
* [feat] simplify configuration for pd-disaggregated deployment, and refactor post-init and usage for all ports
* [fix] fix some bugs
* [fix] fix rdma port for cache manager/messager
* [fix] temporarily cancel port availability check to see if it can pass ci test
* [feat] simplify args for multi api server
* [fix] fix dp
* [fix] fix port for xpu
* [fix] add tests for ports post processing & fix ci
* [test] fix test_multi_api_server
* [fix] fix rdma_comm_ports args for multi_api_server
* [fix] fix test_common_engine
* [fix] fix test_cache_transfer_manager
* [chore] automatically setting FD_ENABLE_MULTI_API_SERVER
* [fix] avoid api server from creating engine_args twice
* [fix] fix test_run_batch
* [fix] fix test_metrics
* [fix] fix splitwise connector init
* [test] add test_rdma_transfer and test_expert_service
* [fix] fix code syntax
* [fix] fix test_rdma_transfer and build wheel with rdma script
2025-12-17 15:50:42 +08:00
Yuanle Liu
cdc0004894
Revert "[Feature] add ue8m0 for per_token_quant_fp8 ( #5563 )" ( #5611 )
...
This reverts commit 73e1d6aa90 .
2025-12-17 13:59:06 +08:00
YuBaoku
21fa2baa51
[CI] disable test_prefix_cache_manager.py in unit_test
2025-12-17 10:48:02 +08:00
周周周
e29b005520
[Others] Clean code && remove GPU sync code ( #5548 )
CE Compile Job / ce_job_pre_check (push) Has been cancelled
CE Compile Job / print_ce_job_pre_check_outputs (push) Has been cancelled
CE Compile Job / FD-Clone-Linux (push) Has been cancelled
CE Compile Job / Show Code Archive Output (push) Has been cancelled
CE Compile Job / BUILD_SM8090 (push) Has been cancelled
CE Compile Job / BUILD_SM8689 (push) Has been cancelled
CE Compile Job / CE_UPLOAD (push) Has been cancelled
Deploy GitHub Pages / deploy (push) Has been cancelled
Publish Job / publish_pre_check (push) Has been cancelled
Publish Job / print_publish_pre_check_outputs (push) Has been cancelled
Publish Job / FD-Clone-Linux (push) Has been cancelled
Publish Job / Show Code Archive Output (push) Has been cancelled
Publish Job / BUILD_SM8090 (push) Has been cancelled
Publish Job / BUILD_SM8689 (push) Has been cancelled
Publish Job / PADDLE_PYPI_UPLOAD_8090 (push) Has been cancelled
Publish Job / PADDLE_PYPI_UPLOAD_8689 (push) Has been cancelled
Publish Job / Run FD Image Build (push) Has been cancelled
Publish Job / Run FastDeploy Unit Tests and Coverage (push) Has been cancelled
Publish Job / Run FastDeploy LogProb Tests (push) Has been cancelled
Publish Job / Extracted partial CE model tasks to run in CI. (push) Has been cancelled
Publish Job / Run Base Tests (push) Has been cancelled
Publish Job / Run Accuracy Tests (push) Has been cancelled
Publish Job / Run Stable Tests (push) Has been cancelled
CI Images Build / FD-Clone-Linux (push) Has been cancelled
CI Images Build / Show Code Archive Output (push) Has been cancelled
CI Images Build / CI Images Build (push) Has been cancelled
CI Images Build / BUILD_SM8090 (push) Has been cancelled
CI Images Build / Run FastDeploy Unit Tests and Coverage (push) Has been cancelled
CI Images Build / Run FastDeploy LogProb Tests (push) Has been cancelled
CI Images Build / Extracted partial CE model tasks to run in CI. (push) Has been cancelled
CI Images Build / Run Base Tests (push) Has been cancelled
CI Images Build / Publish Docker Images Pre Check (push) Has been cancelled
2025-12-16 21:09:37 +08:00
Yuanle Liu
867803ae10
[BugFix] fix speculate_limit_thinking_content_length ( #5590 )
...
* fix speculate_limit_thinking_content_length
* update
2025-12-16 04:31:45 -08:00
xunyoyo
55609a51fc
[CI] 【Hackathon 9th Sprint No.36】NO.36 功能模块单测补充 ( #5058 )
...
* Implement unit tests for PrefixCacheManager
* Update prefix cache manager tests
* Handle get_all_visible_devices in prefix cache manager tests
* Add repo root to prefix cache manager tests sys.path
* Use pathlib for repo root in prefix cache manager tests
* Refine repo root Path import in tests
* Handle list-based visible device configuration
* Refine PrefixCacheManager test stubs
* Run pre-commit on prefix cache manager tests
* Remove duplicate pytest import in cache manager tests
* Add tests for visible device formatting
* Revert
* Simplify test stubs in prefix cache manager tests
* Refine PrefixCacheManager tests
* Adjust prefix cache manager tests per review
2025-12-16 19:19:03 +08:00
fxyfxy777
73e1d6aa90
[Feature] add ue8m0 for per_token_quant_fp8 ( #5563 )
...
* ue8m0
* add default arg
---------
Co-authored-by: YuBaoku <49938469+EmmonsCurse@users.noreply.github.com >
2025-12-16 18:40:12 +08:00
xiaolei373
a30b4da260
[Feature] Tracing: Fine-Grained Tracing for Request Latency Part1 ( #5458 )
2025-12-16 16:36:09 +08:00
kevin
c9b47f90ce
[BugFix] fix cpu prefix cache bug ( #5544 )
...
CE Compile Job / ce_job_pre_check (push) Has been cancelled
CE Compile Job / print_ce_job_pre_check_outputs (push) Has been cancelled
CE Compile Job / FD-Clone-Linux (push) Has been cancelled
CE Compile Job / Show Code Archive Output (push) Has been cancelled
CE Compile Job / BUILD_SM8090 (push) Has been cancelled
CE Compile Job / BUILD_SM8689 (push) Has been cancelled
CE Compile Job / CE_UPLOAD (push) Has been cancelled
Deploy GitHub Pages / deploy (push) Has been cancelled
* fix_dy_c8_bug
* add block_num check
* fix test case
* update ci case
2025-12-16 14:21:42 +08:00
YuBaoku
5d2b16e6f3
[CI] Remove test_metrics.py due to incompatible forced merge ( #5578 )
...
* [CI] Remove test_metrics.py due to incompatible forced merge
2025-12-16 14:04:46 +08:00
YuBaoku
63fff8df70
[CI] Adapt vl_model baseline changes due to Paddle update ( #5576 )
2025-12-16 11:42:31 +08:00
xunyoyo
9e8c46c526
[CI] 【Hackathon 9th Sprint No.34】NO.34 功能模块单测补充 ( #5057 )
...
CE Compile Job / ce_job_pre_check (push) Has been cancelled
CE Compile Job / print_ce_job_pre_check_outputs (push) Has been cancelled
CE Compile Job / FD-Clone-Linux (push) Has been cancelled
CE Compile Job / Show Code Archive Output (push) Has been cancelled
CE Compile Job / BUILD_SM8090 (push) Has been cancelled
CE Compile Job / BUILD_SM8689 (push) Has been cancelled
CE Compile Job / CE_UPLOAD (push) Has been cancelled
Deploy GitHub Pages / deploy (push) Has been cancelled
Publish Job / publish_pre_check (push) Has been cancelled
Publish Job / print_publish_pre_check_outputs (push) Has been cancelled
Publish Job / FD-Clone-Linux (push) Has been cancelled
Publish Job / Show Code Archive Output (push) Has been cancelled
Publish Job / BUILD_SM8090 (push) Has been cancelled
Publish Job / BUILD_SM8689 (push) Has been cancelled
Publish Job / PADDLE_PYPI_UPLOAD_8090 (push) Has been cancelled
Publish Job / PADDLE_PYPI_UPLOAD_8689 (push) Has been cancelled
Publish Job / Run FD Image Build (push) Has been cancelled
Publish Job / Run FastDeploy Unit Tests and Coverage (push) Has been cancelled
Publish Job / Run FastDeploy LogProb Tests (push) Has been cancelled
Publish Job / Extracted partial CE model tasks to run in CI. (push) Has been cancelled
Publish Job / Run Base Tests (push) Has been cancelled
Publish Job / Run Accuracy Tests (push) Has been cancelled
Publish Job / Run Stable Tests (push) Has been cancelled
CI Images Build / FD-Clone-Linux (push) Has been cancelled
CI Images Build / Show Code Archive Output (push) Has been cancelled
CI Images Build / CI Images Build (push) Has been cancelled
CI Images Build / BUILD_SM8090 (push) Has been cancelled
CI Images Build / Run FastDeploy Unit Tests and Coverage (push) Has been cancelled
CI Images Build / Run FastDeploy LogProb Tests (push) Has been cancelled
CI Images Build / Extracted partial CE model tasks to run in CI. (push) Has been cancelled
CI Images Build / Run Base Tests (push) Has been cancelled
CI Images Build / Publish Docker Images Pre Check (push) Has been cancelled
* Add unit tests for SplitWiseScheduler module
* Add info and ping to fake redis client for tests
* Document fake redis metadata methods in tests
* Enhance splitwise scheduler tests
* Clean up test_splitwise_scheduler.py
Removed copyright notice and documentation comments.
* Simplify splitwise scheduler test stubs
* Refine splitwise scheduler tests
* Handle empty result keys with restored sleep
---------
Co-authored-by: CSWYF3634076 <wangyafeng@baidu.com >
2025-12-15 20:29:25 +08:00
kesmeey
4bd991aa17
[CI]【Hackathon 9th Sprint No.22】功能模块 fastdeploy/input/ernie4_5_vl_processor/ernie4_5_vl_processor.py 单测补充 ( #5263 )
...
* test: improve ernie4_5_vl_processor.py test coverage
* update
* improve coverage
* update
* fix: correct test expectation for thinking_mode false in test_ernie_vl_processor
* remove test_process_request_dict_comprehensive test case
---------
Co-authored-by: CSWYF3634076 <wangyafeng@baidu.com >
2025-12-15 14:00:53 +08:00
Ryan
d01cb274d6
[Graph Optimization][CI] Add ERNIE45T 21B sot test ( #5538 )
CE Compile Job / ce_job_pre_check (push) Has been cancelled
Deploy GitHub Pages / deploy (push) Has been cancelled
CE Compile Job / print_ce_job_pre_check_outputs (push) Has been cancelled
CE Compile Job / FD-Clone-Linux (push) Has been cancelled
CE Compile Job / Show Code Archive Output (push) Has been cancelled
CE Compile Job / BUILD_SM8090 (push) Has been cancelled
CE Compile Job / BUILD_SM8689 (push) Has been cancelled
CE Compile Job / CE_UPLOAD (push) Has been cancelled
Publish Job / publish_pre_check (push) Has been cancelled
Publish Job / print_publish_pre_check_outputs (push) Has been cancelled
Publish Job / FD-Clone-Linux (push) Has been cancelled
Publish Job / Show Code Archive Output (push) Has been cancelled
Publish Job / BUILD_SM8090 (push) Has been cancelled
Publish Job / BUILD_SM8689 (push) Has been cancelled
Publish Job / PADDLE_PYPI_UPLOAD_8090 (push) Has been cancelled
Publish Job / PADDLE_PYPI_UPLOAD_8689 (push) Has been cancelled
Publish Job / Run FD Image Build (push) Has been cancelled
Publish Job / Run FastDeploy Unit Tests and Coverage (push) Has been cancelled
Publish Job / Run FastDeploy LogProb Tests (push) Has been cancelled
Publish Job / Extracted partial CE model tasks to run in CI. (push) Has been cancelled
Publish Job / Run Base Tests (push) Has been cancelled
Publish Job / Run Accuracy Tests (push) Has been cancelled
Publish Job / Run Stable Tests (push) Has been cancelled
CI Images Build / FD-Clone-Linux (push) Has been cancelled
CI Images Build / Show Code Archive Output (push) Has been cancelled
CI Images Build / CI Images Build (push) Has been cancelled
CI Images Build / BUILD_SM8090 (push) Has been cancelled
CI Images Build / Run FastDeploy Unit Tests and Coverage (push) Has been cancelled
CI Images Build / Run FastDeploy LogProb Tests (push) Has been cancelled
CI Images Build / Extracted partial CE model tasks to run in CI. (push) Has been cancelled
CI Images Build / Run Base Tests (push) Has been cancelled
CI Images Build / Publish Docker Images Pre Check (push) Has been cancelled
2025-12-13 00:43:15 +08:00
Jiaxin Sui
92119773c7
[CI][XPU] add mtp case ( #5537 )
...
* add mtp case
* Refactor test_mtp.py for clarity and efficiency
Removed duplicate import of json and simplified spec_config formatting.
---------
Co-authored-by: root <root@gajl-bbc-onlinec-com-1511972.gajl.baidu.com >
2025-12-12 19:14:40 +08:00
MingkunZhang
f32e331ef5
[Metax] add ci yaml ( #5520 )
...
CE Compile Job / ce_job_pre_check (push) Has been cancelled
CE Compile Job / print_ce_job_pre_check_outputs (push) Has been cancelled
CE Compile Job / FD-Clone-Linux (push) Has been cancelled
CE Compile Job / Show Code Archive Output (push) Has been cancelled
CE Compile Job / BUILD_SM8090 (push) Has been cancelled
CE Compile Job / BUILD_SM8689 (push) Has been cancelled
CE Compile Job / CE_UPLOAD (push) Has been cancelled
Deploy GitHub Pages / deploy (push) Has been cancelled
Co-authored-by: Jiaxin Sui <95567040+plusNew001@users.noreply.github.com >
2025-12-12 13:35:38 +08:00
kesmeey
8d477e3d01
[CI]【Hackathon 9th Sprint No.25】功能模块 fastdeploy/input/ernie4_5_vl_processor/image_preprocessor/image_preprocessor_adaptive.py 单测补充 ( #5265 )
...
* test: add unit tests for image_preprocessor_adaptive.py (NO.25)
* refactor: merge redundant test functions in test_image_preprocessor_adaptive.py
* fix: fix codestyle issues - remove extra blank lines
* update
* update
---------
Co-authored-by: YuBaoku <49938469+EmmonsCurse@users.noreply.github.com >
Co-authored-by: CSWYF3634076 <wangyafeng@baidu.com >
2025-12-12 12:45:06 +08:00
GoldPancake
909059c60a
[Feature] Support for request-level speculative decoding metrics monitoring. ( #5518 )
...
* support spec metrics monitor per request
* fix bug
* remove debug log
* fix ut bugs
2025-12-12 12:22:18 +08:00
kevin
954a145d57
[Optimization] support mm prefill batch ( #5313 )
...
CE Compile Job / ce_job_pre_check (push) Has been cancelled
CE Compile Job / print_ce_job_pre_check_outputs (push) Has been cancelled
CE Compile Job / FD-Clone-Linux (push) Has been cancelled
CE Compile Job / Show Code Archive Output (push) Has been cancelled
CE Compile Job / BUILD_SM8090 (push) Has been cancelled
CE Compile Job / BUILD_SM8689 (push) Has been cancelled
CE Compile Job / CE_UPLOAD (push) Has been cancelled
Deploy GitHub Pages / deploy (push) Has been cancelled
* support mm prefill batch
* update code
* update code
* update code
* update code
* fix encoder cache bug
* update code
* update code
* fix bug
* fix paddle ocr bug
* fix xpu bug
* update code
2025-12-11 22:21:14 +08:00
YuBaoku
7116982995
[CI] Reduce timeout of send_request in test_mtp ( #5512 )
CE Compile Job / ce_job_pre_check (push) Has been cancelled
CE Compile Job / print_ce_job_pre_check_outputs (push) Has been cancelled
CE Compile Job / FD-Clone-Linux (push) Has been cancelled
CE Compile Job / Show Code Archive Output (push) Has been cancelled
CE Compile Job / BUILD_SM8090 (push) Has been cancelled
CE Compile Job / BUILD_SM8689 (push) Has been cancelled
CE Compile Job / CE_UPLOAD (push) Has been cancelled
Deploy GitHub Pages / deploy (push) Has been cancelled
Publish Job / publish_pre_check (push) Has been cancelled
Publish Job / print_publish_pre_check_outputs (push) Has been cancelled
Publish Job / FD-Clone-Linux (push) Has been cancelled
Publish Job / Show Code Archive Output (push) Has been cancelled
Publish Job / BUILD_SM8090 (push) Has been cancelled
Publish Job / BUILD_SM8689 (push) Has been cancelled
Publish Job / PADDLE_PYPI_UPLOAD_8090 (push) Has been cancelled
Publish Job / PADDLE_PYPI_UPLOAD_8689 (push) Has been cancelled
Publish Job / Run FD Image Build (push) Has been cancelled
Publish Job / Run FastDeploy Unit Tests and Coverage (push) Has been cancelled
Publish Job / Run FastDeploy LogProb Tests (push) Has been cancelled
Publish Job / Extracted partial CE model tasks to run in CI. (push) Has been cancelled
Publish Job / Run Base Tests (push) Has been cancelled
Publish Job / Run Accuracy Tests (push) Has been cancelled
Publish Job / Run Stable Tests (push) Has been cancelled
CI Images Build / FD-Clone-Linux (push) Has been cancelled
CI Images Build / Show Code Archive Output (push) Has been cancelled
CI Images Build / CI Images Build (push) Has been cancelled
CI Images Build / BUILD_SM8090 (push) Has been cancelled
CI Images Build / Run FastDeploy Unit Tests and Coverage (push) Has been cancelled
CI Images Build / Run FastDeploy LogProb Tests (push) Has been cancelled
CI Images Build / Extracted partial CE model tasks to run in CI. (push) Has been cancelled
CI Images Build / Run Base Tests (push) Has been cancelled
CI Images Build / Publish Docker Images Pre Check (push) Has been cancelled
2025-12-11 20:40:00 +08:00
Ryan
e58fed3665
[Graph Optimization][BugFix][CI] Fix 0size bug && add unitest ( #5495 )
2025-12-11 16:25:26 +08:00
YuBaoku
9f4512c932
[CI] disable test_cuda_graph_dynamic_subgraph.py in unit_test
2025-12-11 14:12:49 +08:00
qwes5s5
d79438bb86
add detoken switch ( #5463 )
CE Compile Job / ce_job_pre_check (push) Has been cancelled
CE Compile Job / print_ce_job_pre_check_outputs (push) Has been cancelled
CE Compile Job / FD-Clone-Linux (push) Has been cancelled
CE Compile Job / Show Code Archive Output (push) Has been cancelled
CE Compile Job / BUILD_SM8090 (push) Has been cancelled
CE Compile Job / BUILD_SM8689 (push) Has been cancelled
CE Compile Job / CE_UPLOAD (push) Has been cancelled
Deploy GitHub Pages / deploy (push) Has been cancelled
2025-12-10 21:44:02 +08:00
zccjjj
03819f30c3
[CI][XPU] ep+prefix cache+chunk prefill ( #5489 )
2025-12-10 19:39:49 +08:00
luukunn
fbc9bce1e9
[Feature]Optimization of Thinking Pattern Framework ( #4302 )
...
* add model status in vl
* add x1 parser
* add model_status
* fix parser
* fix parser
* fix parser
* fix parser
* Revert "fix parser"
This reverts commit 300f446d8a .
* fix parser
* fix
* fix
* fix
* fix
* fix parser
* fix unit test
* fix unit test
* add unit test
* fix
* fix
* add unit test
* fix unit test
* add unit test
* add unit test
* fix unit test
* fix unit test
* fix bug
* fix unit test
* x1 tool parser
* fix unit test
* fix unit test
* fix unit test
* fix n
* fix unit test
* add unit test
* add unit test
* remove pring
2025-12-10 16:17:06 +08:00
ming1753
9e15191cce
[BugFix] fix audio end bug ( #5464 )
2025-12-10 13:37:26 +08:00
Echo-Nie
1b1bfab341
[CI] Add unittest ( #5328 )
...
* add test_worker_eplb
* remove tesnsor_wise_fp8
* add copyright
2025-12-09 19:19:42 +08:00
lizexu123
95eab9f9ee
[Feature] support stop_token_ids ( #5399 )
...
* support stop_token_ids
* fix
* delete chinese
* support both
* delete print
2025-12-09 17:49:12 +08:00
Haonan Luo
e397c4fba6
[Others] remove add_bias option ( #5425 )
2025-12-09 17:39:35 +08:00
lizexu123
b0cf2c4b7a
[Feature] Support prefill batch inference for pooling models. ( #5436 )
...
* fix multi-inputs
* fix threshold
* fix threshold
* fix
* support multi-batch
* add tests
* fix test
* test
* fix
2025-12-09 16:21:00 +08:00
Juncai
83ea9646f9
[PD Disaggregation] Unify the disaggregation info and the pd communication ( #5438 )
...
* Unify the disaggregation info and the pd communication
* up
* up
* fix
* fix conflict
* fix unittest
2025-12-09 14:44:59 +08:00
Nyakku Shigure
e1c4a12e34
[Graph Optimization][CINN] Use CINN in PaddleOCR-VL ViT part ( #5223 )
...
---------
Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com >
2025-12-09 14:37:00 +08:00
K11OntheBoat
8d99bac532
Remove CUDA ERROR 9 of inputs of get_padding_offset kernel ( #5440 )
...
Co-authored-by: K11OntheBoat <“ruianmaidanglao@163.com ”>
2025-12-09 14:17:30 +08:00
kevin
f7e832efaf
[BugFix] fix mm cudagraph ( #5266 )
...
CE Compile Job / ce_job_pre_check (push) Has been cancelled
CE Compile Job / print_ce_job_pre_check_outputs (push) Has been cancelled
CE Compile Job / FD-Clone-Linux (push) Has been cancelled
CE Compile Job / Show Code Archive Output (push) Has been cancelled
CE Compile Job / BUILD_SM8090 (push) Has been cancelled
CE Compile Job / BUILD_SM8689 (push) Has been cancelled
CE Compile Job / CE_UPLOAD (push) Has been cancelled
Deploy GitHub Pages / deploy (push) Has been cancelled
* fix mm cudagraph
* fix test_prompt_ids bug
* update code
* update ci code
* update ci code
* update ci code
2025-12-09 11:51:00 +08:00
zhouchong
5d9b5e4a5b
[Engine] [Feature] Refactor async_llm:cross-process with EngineService,based on zmq communication ( #4868 )
...
* Refactor async_llm:cross-process with EngineService
* fix: async_llm output process
* fix: return prompt_token_ids and prompt_tokens in first res
* optimize common_engine start func
2025-12-09 10:53:40 +08:00
SunLei
5fb93d84f5
[Feature] [Benchmark]: add ZMQ-based FMQ implementation and benchmark tools ( #5418 )
...
* feat(fmq): add ZMQ-based FMQ implementation and benchmark tools
* move FMQ_CONFIG_JSON to envs
* fix top_p_candidates (#5400 )
Co-authored-by: freeliuzc <lzc842650834@gmail.com >
* [RL] Support Rollout Routing Replay (#5321 )
* [RL] Support Rollout Routing Replay
* add routing indices cache
* fix config bug and moe forward bug
* R3 Support GLM
* support eb4.5
* fix merge bug
* Apply suggestion from @Copilot
Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com >
* Apply suggestion from @Copilot
Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com >
* Apply suggestion from @Copilot
Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com >
* Apply suggestion from @Copilot
Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com >
* add routing replay ci
* support glm topk
* support orther top_k
* fix ci bug
* pre-commit
* only support chatcmpl
---------
Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com >
Co-authored-by: Yuanle Liu <yuanlehome@163.com >
* [Bug fix] Fix the multi-input accuracy issue in the pooling model. (#5374 )
* fix multi-inputs
* fix threshold
* fix threshold
* fix
* [BugFix]remove _execute_empty_input (#5396 )
* Revert "[RL] Support Rollout Routing Replay (#5321 )" (#5402 )
This reverts commit 96d2d4877b .
* [New][RL] Support Rollout Routing Replay (#5405 )
* [RL] Support Rollout Routing Replay
* add routing indices cache
* fix config bug and moe forward bug
* R3 Support GLM
* support eb4.5
* fix merge bug
* Apply suggestion from @Copilot
Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com >
* Apply suggestion from @Copilot
Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com >
* Apply suggestion from @Copilot
Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com >
* Apply suggestion from @Copilot
Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com >
* add routing replay ci
* support glm topk
* support orther top_k
* fix ci bug
* pre-commit
* only support chatcmpl
* Revert "Revert "[RL] Support Rollout Routing Replay (#5321 )" (#5402 )"
This reverts commit c45e064f3d .
* Fix XPU and NPU bug
---------
Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com >
Co-authored-by: Yuanle Liu <yuanlehome@163.com >
* bf16 deepseek (#5379 )
* fix deepseek (#5410 )
* Update tests/inter_communicator/test_fmq_factory.py
Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com >
* Update benchmarks/benchmark_fmq.py
Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com >
* Update fastdeploy/inter_communicator/fmq.py
Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com >
---------
Co-authored-by: GoldPancake <56388518+Deleter-D@users.noreply.github.com >
Co-authored-by: freeliuzc <lzc842650834@gmail.com >
Co-authored-by: RAM <gstian5555@outlook.com >
Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com >
Co-authored-by: Yuanle Liu <yuanlehome@163.com >
Co-authored-by: lizexu123 <39205361+lizexu123@users.noreply.github.com >
Co-authored-by: 周周周 <39978853+zhoutianzi666@users.noreply.github.com >
Co-authored-by: Jiang-Jia-Jun <163579578+Jiang-Jia-Jun@users.noreply.github.com >
Co-authored-by: bukejiyu <52310069+bukejiyu@users.noreply.github.com >
2025-12-08 22:04:49 +08:00
kesmeey
d1bd40d44c
[CI]【Hackathon 9th Sprint Example NO 16】功能模块 fastdeploy/input/ernie4_5_vl_processor/process.py 单测补充 ( #5264 )
...
* test: add unit tests for process.py (NO.16)
* update
* update filename
* update filename
* update
* update
* fix failed testcases
* simplify the code
---------
Co-authored-by: YuBaoku <49938469+EmmonsCurse@users.noreply.github.com >
2025-12-08 14:30:15 +08:00