mirror of
https://github.com/PaddlePaddle/FastDeploy.git
synced 2025-10-05 00:33:03 +08:00

* update multi-draft-token strategy * fix format * support hybrid mtp with ngram speculative decoding method
* update multi-draft-token strategy * fix format * support hybrid mtp with ngram speculative decoding method