mirror of
https://github.com/PaddlePaddle/FastDeploy.git
synced 2025-10-30 03:22:05 +08:00
* update multi-draft-token strategy * fix format * support hybrid mtp with ngram speculative decoding method
* update multi-draft-token strategy * fix format * support hybrid mtp with ngram speculative decoding method