yangjianfengo1
93fcf7e4ec
【New Feature】W4afp8 supports per group quantization (#4272)
* w4afp8 支持per group
* code style
* 精度完成
* revert append attn utils
* ffn1 动态量化
* ffn2 支持动态量化
* code style
* code style
* 修改单测
* 修改单测
* fix bug
* Implement conditional parameter creation for layers
Add parameter creation for up_gate_proj_in_scale when ep_size > 1.
* code style
* fix conflict
* code style
* code style
* 修复w4aint8 精度
* fix ci
---------
Co-authored-by: yuanxiaolan <yuanxiaolan01@baidu.com>
2025-11-05 21:00:23 +08:00
..
2025-11-05 21:00:23 +08:00
2025-08-01 22:43:18 +08:00
2025-07-19 23:19:27 +08:00
2025-09-17 20:24:53 +08:00
2025-11-05 21:00:23 +08:00
2025-07-19 23:19:27 +08:00
2025-11-05 21:00:23 +08:00
2025-07-15 07:31:42 -07:00
2025-06-29 23:29:37 +00:00
2025-06-09 19:20:15 +08:00
2025-06-09 19:20:15 +08:00
2025-06-09 19:20:15 +08:00
2025-07-19 23:19:27 +08:00
2025-11-05 21:00:23 +08:00
2025-11-05 21:00:23 +08:00
2025-10-30 10:28:36 +08:00
2025-10-30 10:28:36 +08:00
2025-10-30 10:28:36 +08:00
2025-10-30 10:28:36 +08:00
2025-11-05 21:00:23 +08:00
2025-09-10 13:11:57 +08:00
2025-08-15 11:57:45 +08:00
2025-08-15 11:57:45 +08:00
2025-06-29 23:29:37 +00:00
2025-06-29 23:29:37 +00:00
2025-10-20 14:44:58 +08:00
2025-10-20 14:44:58 +08:00
2025-10-30 10:28:36 +08:00
2025-10-20 14:44:58 +08:00
2025-09-03 10:54:34 +08:00