Yuan Xiaolan
|
5f56d289a7
|
fix is_permuted (#3098)
Co-authored-by: Jiang-Jia-Jun <163579578+Jiang-Jia-Jun@users.noreply.github.com>
|
2025-07-31 19:58:05 +08:00 |
|
Yuan Xiaolan
|
35935da9e5
|
support W4A8 EPLB (#3075)
|
2025-07-30 14:34:12 +08:00 |
|
Yuan Xiaolan
|
3214fb5393
|
support model loading for w4a8 offline quant (#3064)
支持W4A8 EP 对离线量化权重的load
|
2025-07-29 21:54:37 +08:00 |
|
YuanRisheng
|
502ee92a0a
|
Unify server-side and model-side Config (Part3) (#3047)
* merge model config
* fix arch
* fix rl
|
2025-07-29 17:07:44 +08:00 |
|
Yuan Xiaolan
|
b1d787a272
|
[fix] w4a8 model loading and hadamard config (#3013)
|
2025-07-28 18:17:59 +08:00 |
|
xiaoxiaohehe001
|
2970b00dfa
|
[Feature] Support_eplb (#2997)
Deploy GitHub Pages / deploy (push) Has been cancelled
* [Feature] support_eplb
* [Feature] support_eplb
* [Fix] fix mm ep
|
2025-07-24 20:22:45 +08:00 |
|
Zero Rains
|
0fb37ab7e4
|
update flake8 version to support pre-commit in python3.12 (#3000)
* update flake8 version to support pre-commit in python3.12
* polish code
|
2025-07-24 01:43:31 -07:00 |
|
bukejiyu
|
bfeb664ab8
|
update (#2978)
Deploy GitHub Pages / deploy (push) Has been cancelled
|
2025-07-24 00:16:42 +08:00 |
|
Zero Rains
|
25698d56d1
|
polish code with new pre-commit rule (#2923)
|
2025-07-19 23:19:27 +08:00 |
|
Yuanle Liu
|
dda4a9f848
|
rl update (#2861)
|
2025-07-16 00:33:10 -07:00 |
|
Yuanle Liu
|
61b3997b85
|
refactor rl get_name_mappings_to_training (#2847)
Deploy GitHub Pages / deploy (push) Has been cancelled
* refactor rl get_name_mappings_to_training
* fix tp>1
* change variable name(ffn1->up_gate_proj/ffn2->down_proj)
* change variable name(linear_weight->weight/linear_bias->bias)
* add rl names mapping for vl
* fix ernie 0.3B error
* fix develop code
* fix
|
2025-07-15 07:31:42 -07:00 |
|
YuanRisheng
|
4c7b8bc458
|
Simplify the Config code (#2770)
* simplify the code
* fix vl
* delete config
* fix
* perfect code
* fix ci
* fix xpu
* fix xpu
* fix server
* resolve conflict
* fix mtp
* resolve conflict
* fix xpu
* fix xpu
* fix vl
* fix log
* fix qwen moe
* fix qwen moe
* fix qwen moe
|
2025-07-14 19:50:05 +08:00 |
|
yulangz
|
be21ef5047
|
[XPU] Supports BF16 for ERNIE-4.5-21B-A3B and ERNIE-4.5-0.3B (#2765)
* fix no quant xpu moe
* change dir of xpu moe weight only
|
2025-07-09 15:57:51 +08:00 |
|
EnflameGCU
|
d0f4d6ba3a
|
[GCU] Support gcu platform (#2702)
baseline: e7fa57ebae
Co-authored-by: yongqiangma <xing.wo@163.com>
|
2025-07-08 13:00:52 +08:00 |
|
gaoziyuan
|
26d5d737dd
|
【Fearture】support qwen2 some func (#2740)
* add rl qwen model support
* fix
* fix
|
2025-07-08 12:03:04 +08:00 |
|
Jiang-Jia-Jun
|
05c670e593
|
[Sync] Update to latest code (#2679)
* [Sync] Update to latest code
* Add new code files
* Add new code files
* update code
* Try to fix build.sh
* Try to fix build.sh
* Update code
* Update requirements.txt
* Update code
---------
Co-authored-by: Jiang-Jia-Jun <jiangjiajun@baidu.com>
|
2025-07-03 15:43:53 +08:00 |
|
Jiang-Jia-Jun
|
92c2cfa2e7
|
Sync v2.0 version of code to github repo
|
2025-06-29 23:29:37 +00:00 |
|
jiangjiajun
|
684703fd72
|
[LLM] First commit the llm deployment code
|
2025-06-09 19:20:15 +08:00 |
|