Files
FastDeploy/custom_ops/gpu_ops/speculate_decoding
freeliuzc f1e36ff2f7 [Speculative Decoding][MTP]Support stop_seqs and pd-split mode (#5029)
* support multi_stop_seqs in speculative decoding

* support mtp tp with ep split

* fix custom op register

* fix spec stop_seqs params
2025-11-20 15:26:01 +08:00
..
2025-09-01 17:50:17 +08:00