Files
FastDeploy/custom_ops/gpu_ops/append_attn
Jundong Liu 1e4968e810
Some checks failed
Deploy GitHub Pages / deploy (push) Has been cancelled
[Excutor] Fixed the issue of CUDA graph execution failure caused by different branches during decoding (#3223)
* 彻底解决解码切块问题

* update C8 and C4 kernel

* fix problem

* fix with pre-commit

* retain branch for mtp
2025-08-09 07:37:19 +08:00
..
2025-07-28 14:31:37 +08:00