mirror of
https://github.com/PaddlePaddle/FastDeploy.git
synced 2025-10-04 16:22:57 +08:00

* update multi-draft-token strategy * fix format * support hybrid mtp with ngram speculative decoding method
* update multi-draft-token strategy * fix format * support hybrid mtp with ngram speculative decoding method