chen
|
3161014e49
|
[BugFix]fix v1 loader moe bf16, and supoort dynamic_load_weight create quant param (#4229)
* fix v1 loader moe bf16, and supoort dynamic_load_weight create quant param
* include_stop_str_in_output=False not return eos text
|
2025-09-24 14:12:05 +08:00 |
|
bukejiyu
|
113e330030
|
fix bf16 and add comments (#4106)
|
2025-09-15 17:23:07 +08:00 |
|
bukejiyu
|
29ed617f0f
|
[v1 loader]qwen Offline fp8 (#4036)
* support offline fp8
* update ut
* update ut
* update ut
* fix
* update
* update
|
2025-09-15 13:44:11 +08:00 |
|
Jiang-Jia-Jun
|
92c2cfa2e7
|
Sync v2.0 version of code to github repo
|
2025-06-29 23:29:37 +00:00 |
|
jiangjiajun
|
684703fd72
|
[LLM] First commit the llm deployment code
|
2025-06-09 19:20:15 +08:00 |
|