mirror of
https://github.com/PaddlePaddle/FastDeploy.git
synced 2025-12-24 13:28:13 +08:00
* update multi-draft-token strategy * fix format * support hybrid mtp with ngram speculative decoding method
* update multi-draft-token strategy * fix format * support hybrid mtp with ngram speculative decoding method