Files
FastDeploy/fastdeploy/spec_decode
freeliuzc c753f1fc9e [Feature][MTP]Support new mtp (#3656)
* update multi-draft-token strategy

* fix format

* support hybrid mtp with ngram speculative decoding method
2025-08-27 19:38:26 +08:00
..
2025-08-27 19:38:26 +08:00
2025-08-27 19:38:26 +08:00