zhupengyang
|
3a6883ac1a
|
c++ code format (#4527)
|
2025-10-22 17:59:50 +08:00 |
|
yzwu
|
504461b6b5
|
[Iluvatar GPU] Optimize attention performance and fix moe load ckpt error (#3651)
|
2025-09-22 21:13:59 +08:00 |
|
co63oc
|
d6369b4d51
|
fix typos (#3684)
|
2025-09-01 17:50:17 +08:00 |
|
yzwu
|
fbdd6b0663
|
[Iluvatar GPU] Optimze attention and moe performance (#3234)
|
2025-08-08 10:51:24 +08:00 |
|
Yuanle Liu
|
61b3997b85
|
refactor rl get_name_mappings_to_training (#2847)
Deploy GitHub Pages / deploy (push) Has been cancelled
* refactor rl get_name_mappings_to_training
* fix tp>1
* change variable name(ffn1->up_gate_proj/ffn2->down_proj)
* change variable name(linear_weight->weight/linear_bias->bias)
* add rl names mapping for vl
* fix ernie 0.3B error
* fix develop code
* fix
|
2025-07-15 07:31:42 -07:00 |
|
liddk1121
|
1b54a2831e
|
Adapt for iluvatar gpu (#2684)
|
2025-07-07 16:53:14 +08:00 |
|