yangjianfengo1
93fcf7e4ec
【New Feature】W4afp8 supports per group quantization (#4272)
* w4afp8 支持per group
* code style
* 精度完成
* revert append attn utils
* ffn1 动态量化
* ffn2 支持动态量化
* code style
* code style
* 修改单测
* 修改单测
* fix bug
* Implement conditional parameter creation for layers
Add parameter creation for up_gate_proj_in_scale when ep_size > 1.
* code style
* fix conflict
* code style
* code style
* 修复w4aint8 精度
* fix ci
---------
Co-authored-by: yuanxiaolan <yuanxiaolan01@baidu.com>
2025-11-05 21:00:23 +08:00
..
2025-10-24 16:46:45 +08:00
2025-10-16 20:45:24 +08:00
2025-11-02 21:28:04 +08:00
2025-11-05 11:15:57 +08:00
2025-10-17 11:47:16 +08:00
2025-11-04 22:40:15 +08:00
2025-10-30 19:53:09 +08:00
2025-11-05 11:27:30 +08:00
2025-11-05 11:55:51 +08:00
2025-11-04 22:40:15 +08:00
2025-10-25 22:45:38 +08:00
2025-11-05 11:55:51 +08:00
2025-10-28 09:47:47 +08:00
2025-10-15 11:47:47 +08:00
2025-11-05 11:55:51 +08:00
2025-11-04 13:57:55 +08:00
2025-10-27 17:39:51 +08:00
2025-11-05 21:00:23 +08:00
2025-11-05 12:04:59 +08:00
2025-08-29 10:34:05 +08:00
2025-09-22 14:09:09 +08:00
2025-10-31 22:32:05 +08:00
2025-09-12 17:44:03 +08:00
2025-10-31 10:45:27 +08:00
2025-10-17 20:51:59 +08:00
2025-10-27 17:39:51 +08:00
2025-10-21 14:25:45 +08:00
2025-10-20 10:13:21 +08:00
2025-10-11 14:04:17 +08:00