Yuan Xiaolan
|
7ce00e597c
|
support qk norm (#3145)
|
2025-08-05 16:46:14 +08:00 |
|
chen
|
a2f5cc54f8
|
moe preprocess op support 160 experts and fused_moe triton kernel name add K (#3121)
|
2025-08-01 10:46:20 +08:00 |
|
Yuanle Liu
|
61b3997b85
|
refactor rl get_name_mappings_to_training (#2847)
Deploy GitHub Pages / deploy (push) Has been cancelled
* refactor rl get_name_mappings_to_training
* fix tp>1
* change variable name(ffn1->up_gate_proj/ffn2->down_proj)
* change variable name(linear_weight->weight/linear_bias->bias)
* add rl names mapping for vl
* fix ernie 0.3B error
* fix develop code
* fix
|
2025-07-15 07:31:42 -07:00 |
|
RichardWooSJTU
|
fee544e808
|
fix ep prefill (#2762)
|
2025-07-09 14:03:05 +08:00 |
|
Jiang-Jia-Jun
|
92c2cfa2e7
|
Sync v2.0 version of code to github repo
|
2025-06-29 23:29:37 +00:00 |
|
jiangjiajun
|
684703fd72
|
[LLM] First commit the llm deployment code
|
2025-06-09 19:20:15 +08:00 |
|