Files
FastDeploy/custom_ops/gpu_ops
RAM 920df5be5a
Some checks failed
CE Compile Job / ce_job_pre_check (push) Has been cancelled
CE Compile Job / print_ce_job_pre_check_outputs (push) Has been cancelled
CE Compile Job / FD-Clone-Linux (push) Has been cancelled
CE Compile Job / Show Code Archive Output (push) Has been cancelled
CE Compile Job / BUILD_SM8090 (push) Has been cancelled
CE Compile Job / BUILD_SM8689 (push) Has been cancelled
CE Compile Job / CE_UPLOAD (push) Has been cancelled
[Graph Optimization][Speculative Decoding] Fix the bug of CUDAGraph + MTP + EP (#4430)
* Fix MTP dummy run bug

* Target Model and Draft Model using the same flag

* aovid moe bug in cudagraph padding

* In mtp replace use_cudagraph as step_use_cudagraph
2025-10-17 14:22:05 +08:00
..
2025-07-03 15:43:53 +08:00
2025-08-29 10:23:08 +08:00
2025-08-25 11:27:45 +08:00
2025-08-05 16:43:07 +08:00
2025-07-09 18:56:27 +08:00
2025-07-07 16:53:14 +08:00