周周周
876e4a8935
remove input_ids from ForwardMeta ( #4793 )
CE Compile Job / ce_job_pre_check (push) Has been cancelled
CE Compile Job / print_ce_job_pre_check_outputs (push) Has been cancelled
CE Compile Job / FD-Clone-Linux (push) Has been cancelled
CE Compile Job / Show Code Archive Output (push) Has been cancelled
CE Compile Job / BUILD_SM8090 (push) Has been cancelled
CE Compile Job / BUILD_SM8689 (push) Has been cancelled
CE Compile Job / CE_UPLOAD (push) Has been cancelled
Deploy GitHub Pages / deploy (push) Has been cancelled
2025-11-05 11:55:51 +08:00
Ryan
f42ed6d5f2
[Graph Optimization] Add dy_runnable and introduce cudagraph_switch_threshold for cudagraph mode switching ( #4578 )
...
* add new branch for sot
* reorder
* fix batch bug
2025-10-24 18:36:52 +08:00
Ryan
36af88ff3f
[BugFix][CI] Clean up SOT code cache using tearDown in CINN unitest ( #4491 )
...
* fix CINN BUG
* 1e-3 -> 1e-2
2025-10-20 20:45:00 +08:00
Ryan
6160145f82
[SOT] Change warnings to errors and remove fallback operations ( #4378 )
...
* Change warnings to errors and remove fallback operations
* fix unitest
* fix codestyle
2025-10-17 11:27:04 +08:00
YuanRisheng
a2ec2c4152
[FDConfig]Remove max_model_len in FDConfig ( #4350 )
...
* modify max_model_len
* fix unittest
* fix unittest
---------
Co-authored-by: root <root@yqlcc01-sys-rpm12rzmwjd.yqlcc01.baidu.com >
2025-10-11 14:04:17 +08:00
YuanRisheng
24180fba0a
[FDConfig]Remove splitwise_role and engine_worker_queue_port in FDConfig ( #4147 )
...
CE Compile Job / ce_job_pre_check (push) Has been cancelled
CE Compile Job / print_ce_job_pre_check_outputs (push) Has been cancelled
CE Compile Job / FD-Clone-Linux (push) Has been cancelled
CE Compile Job / Show Code Archive Output (push) Has been cancelled
CE Compile Job / BUILD_SM8090 (push) Has been cancelled
CE Compile Job / BUILD_SM8689 (push) Has been cancelled
CE Compile Job / CE_UPLOAD (push) Has been cancelled
Deploy GitHub Pages / deploy (push) Has been cancelled
* remove splitwise_role and engine_worker_queue_port
* fix xpu
* fix xpu
* fix xpu
* fix unittest
* resolve conflct
2025-09-19 17:01:52 +08:00
YuanRisheng
2e9e53ff7e
[FDConfig]Remove max_num_batched_tokens/max_num_seqs in parallel config ( #4116 )
...
* remove max_num_batched_tokens in parallel config
* remove max_num_seqs
* update test case
* fix test
* fix
---------
Co-authored-by: Jiang-Jia-Jun <163579578+Jiang-Jia-Jun@users.noreply.github.com >
2025-09-17 10:43:35 +08:00
ooo oo
2396e49f9e
【Hackathon 9th No.73】add unit tests for graph_opt_backend ( #3609 )
...
* test: add unit tests for graph_opt_backend
* refactor(tests): improve graph optimization test structure and readability
* fix(tests): correct CUDA graph related typos in test files
- Fix class name: TestCUDAGrpahSubgraph -> TestCUDAGraphSubgraph
* refactor(test): support attention layer and optimize graph optimization backend test to eliminate redundant baseline calculations
* remove some func call
---------
Co-authored-by: RAM <gstian5555@outlook.com >
Co-authored-by: Tao Luo <luotao02@baidu.com >
2025-09-03 11:18:00 +08:00