yangjianfengo1
|
e81046fdad
|
【New Feature】集中式支持w4afp8 (#3644)
* 支持tp w4afp8
* code style
|
2025-08-28 10:53:24 +08:00 |
|
Yuan Xiaolan
|
9205c88da1
|
support w4afp8 EP inference (#3044)
CE Compile Job / ce_job_pre_check (push) Has been cancelled
CE Compile Job / print_ce_job_pre_check_outputs (push) Has been cancelled
CE Compile Job / FD-Clone-Linux (push) Has been cancelled
CE Compile Job / Show Code Archive Output (push) Has been cancelled
CE Compile Job / BUILD_SM8090 (push) Has been cancelled
CE Compile Job / BUILD_SM8689 (push) Has been cancelled
CE Compile Job / CE_UPLOAD (push) Has been cancelled
Deploy GitHub Pages / deploy (push) Has been cancelled
|
2025-08-25 11:27:45 +08:00 |
|
Sunny-bot1
|
6c1f3ff897
|
topk_gating_softmax support bias (#3405)
|
2025-08-15 11:57:45 +08:00 |
|
Sunny-bot1
|
2e7831185f
|
[Optimize]Add norm_weights feature for topk_gating_softmax (#3372)
Deploy GitHub Pages / deploy (push) Has been cancelled
|
2025-08-14 15:05:23 +08:00 |
|
Sunny-bot1
|
8224b21525
|
Refactor moe_topk_select op to use apply_norm_weight as a template parameter (#3345)
* Refactor moe_topk_select op to use apply_norm_weight as a template parameter
* update test
|
2025-08-13 08:44:16 +08:00 |
|
Zero Rains
|
25698d56d1
|
polish code with new pre-commit rule (#2923)
|
2025-07-19 23:19:27 +08:00 |
|
Jiang-Jia-Jun
|
92c2cfa2e7
|
Sync v2.0 version of code to github repo
|
2025-06-29 23:29:37 +00:00 |
|
jiangjiajun
|
684703fd72
|
[LLM] First commit the llm deployment code
|
2025-06-09 19:20:15 +08:00 |
|