mirror of
https://github.com/PaddlePaddle/FastDeploy.git
synced 2025-10-05 16:48:03 +08:00

* update multi-draft-token strategy * fix format * support hybrid mtp with ngram speculative decoding method
* update multi-draft-token strategy * fix format * support hybrid mtp with ngram speculative decoding method