Commit Graph

3 Commits

Author SHA1 Message Date
freeliuzc
c753f1fc9e [Feature][MTP]Support new mtp (#3656)
* update multi-draft-token strategy

* fix format

* support hybrid mtp with ngram speculative decoding method
2025-08-27 19:38:26 +08:00
Jiang-Jia-Jun
92c2cfa2e7 Sync v2.0 version of code to github repo 2025-06-29 23:29:37 +00:00
jiangjiajun
684703fd72 [LLM] First commit the llm deployment code 2025-06-09 19:20:15 +08:00