yangjianfengo1
93fcf7e4ec
【New Feature】W4afp8 supports per group quantization (#4272)
* w4afp8 支持per group
* code style
* 精度完成
* revert append attn utils
* ffn1 动态量化
* ffn2 支持动态量化
* code style
* code style
* 修改单测
* 修改单测
* fix bug
* Implement conditional parameter creation for layers
Add parameter creation for up_gate_proj_in_scale when ep_size > 1.
* code style
* fix conflict
* code style
* code style
* 修复w4aint8 精度
* fix ci
---------
Co-authored-by: yuanxiaolan <yuanxiaolan01@baidu.com>
2025-11-05 21:00:23 +08:00
..
2025-10-24 16:46:45 +08:00
2025-11-03 15:38:31 +08:00
2025-08-30 17:06:26 +08:00
2025-10-28 22:11:03 +08:00
2025-11-05 12:04:59 +08:00
2025-11-05 20:46:33 +08:00
2025-11-03 14:08:15 +08:00
2025-11-04 22:40:15 +08:00
2025-11-03 15:38:31 +08:00
2025-10-28 09:47:47 +08:00
2025-10-15 11:49:24 +08:00
2025-11-05 21:00:23 +08:00
2025-11-05 17:15:24 +08:00
2025-11-05 12:04:59 +08:00
2025-10-24 10:14:53 +08:00
2025-09-24 14:50:45 +08:00
2025-11-05 11:27:30 +08:00
2025-10-28 16:02:47 +08:00
2025-11-03 20:12:14 +08:00
2025-11-05 11:55:51 +08:00
2025-11-03 15:38:31 +08:00
2025-09-22 14:09:09 +08:00
2025-11-05 11:55:51 +08:00
2025-10-28 09:47:47 +08:00
2025-10-28 20:23:46 +08:00
2025-11-05 10:43:25 +08:00
2025-11-03 15:38:31 +08:00
2025-07-22 14:06:01 +08:00
2025-07-19 23:19:27 +08:00
2025-07-03 15:43:53 +08:00
2025-10-31 15:26:35 +08:00