Files
FastDeploy/custom_ops/gpu_ops
freeliuzc f1e36ff2f7 [Speculative Decoding][MTP]Support stop_seqs and pd-split mode (#5029)
* support multi_stop_seqs in speculative decoding

* support mtp tp with ep split

* fix custom op register

* fix spec stop_seqs params
2025-11-20 15:26:01 +08:00
..
2025-09-01 17:50:17 +08:00
2025-09-01 17:50:17 +08:00
2025-11-18 17:18:12 +08:00
2025-09-01 17:50:17 +08:00
2025-11-17 10:34:01 +08:00
2025-11-19 16:02:21 +08:00
2025-10-31 21:25:11 +08:00
2025-09-01 17:50:17 +08:00
2025-09-01 17:50:17 +08:00
2025-07-09 18:56:27 +08:00
2025-09-01 17:50:17 +08:00
2025-07-07 16:53:14 +08:00
2025-09-01 17:50:17 +08:00
2025-09-01 17:50:17 +08:00