yangjianfengo1
3754a9906d
[Feature] block sparse attention (#3668)
* 支持稀疏attn
* fix bug
* code style
* fix moba attn get kv shape
* 修复a100编译
* codestyle
* code style
* code style
* code style
* fix conflict
* 增加单侧
* code style
* 增加eblite 加载时间
* fix bug
* for ci
* for ci
* for ci
* for ci
* 支持mlp block size 128
* 增加小算子单测
* fix 单测 mlp
* 将环境变量加入到config里面
* fix rollout config
* 修复显存
* add test server
* add test server
* fix mlp 最后一层使用full attn
2025-08-29 19:46:30 +08:00
..
2025-06-09 19:20:15 +08:00
2025-08-29 10:23:08 +08:00
2025-08-29 10:23:08 +08:00
2025-07-19 23:19:27 +08:00
2025-07-24 20:22:45 +08:00
2025-08-26 19:59:02 +08:00
2025-08-06 15:20:47 +08:00
2025-08-29 18:28:39 +08:00
2025-08-22 16:59:05 +08:00
2025-08-08 10:51:24 +08:00
2025-08-13 11:38:02 +08:00
2025-08-28 09:49:36 +08:00
2025-08-25 17:44:20 +08:00
2025-07-28 10:51:52 +08:00
2025-07-31 20:25:56 +08:00
2025-08-19 19:32:04 +08:00
2025-07-28 10:51:52 +08:00
2025-08-29 19:46:30 +08:00
2025-08-29 14:56:35 +08:00
2025-08-21 14:19:50 +08:00