RAM
d850660872
[Executor] Refactor GetBlockShapeAndSplitKVBlock Kernel (#2989)
* reset decoder_block_shape_q buffer
* refactor GetBlockShapeAndSplitKVBlock Kernel and cudagraph padding batch
* update decode_max_tile_size
* fix pre-commit
* update block_multihead_attn_backend
* update flas attn backend
* update MLA Attention
* update XPU Attention
* update gcu,iluvatar model runner
* Update MTP
* fix MTP bug
2025-07-31 00:09:31 +08:00
..
2025-07-31 00:09:31 +08:00
2025-07-19 23:19:27 +08:00
2025-07-09 16:00:27 +08:00
2025-07-28 10:32:43 +08:00
2025-07-28 22:20:13 +08:00
2025-07-19 23:19:27 +08:00
2025-06-09 19:20:15 +08:00
2025-06-09 19:20:15 +08:00
2025-07-22 14:01:30 +08:00
2025-07-28 22:20:13 +08:00
2025-06-29 23:29:37 +00:00
2025-07-22 05:53:37 -07:00
2025-07-30 09:31:29 +08:00
2025-07-17 18:41:31 +08:00
2025-06-09 19:20:15 +08:00
2025-07-31 00:09:31 +08:00
2025-06-09 19:20:15 +08:00
2025-06-09 19:20:15 +08:00
2025-06-09 19:20:15 +08:00
2025-07-03 15:43:53 +08:00
2025-07-19 23:19:27 +08:00
2025-06-09 19:20:15 +08:00
2025-07-07 20:06:28 +08:00
2025-07-22 15:03:41 +08:00
2025-06-09 19:20:15 +08:00
2025-06-29 23:29:37 +00:00
2025-07-30 16:05:55 +08:00
2025-07-19 23:19:27 +08:00
2025-07-17 17:58:08 +08:00
2025-07-10 16:33:40 +08:00
2025-06-09 19:20:15 +08:00
2025-06-09 19:20:15 +08:00
2025-07-18 14:16:44 +08:00
2025-07-22 15:03:41 +08:00
2025-06-09 19:20:15 +08:00
2025-07-09 18:56:27 +08:00
2025-06-09 19:20:15 +08:00
2025-07-19 23:19:27 +08:00
2025-07-19 23:19:27 +08:00
2025-07-22 15:03:41 +08:00
2025-06-09 19:20:15 +08:00
2025-07-22 15:03:41 +08:00
2025-07-07 16:53:14 +08:00
2025-06-29 23:29:37 +00:00
2025-07-03 15:43:53 +08:00
2025-06-09 19:20:15 +08:00
2025-07-19 23:19:27 +08:00
2025-07-19 23:19:27 +08:00
2025-07-07 16:53:14 +08:00
2025-07-23 20:31:31 +08:00
2025-07-19 23:19:27 +08:00
2025-07-19 23:19:27 +08:00
2025-07-10 16:33:40 +08:00
2025-06-29 23:29:37 +00:00
2025-06-09 19:20:15 +08:00
2025-06-09 19:20:15 +08:00
2025-06-09 19:20:15 +08:00
2025-07-19 23:19:27 +08:00
2025-06-09 19:20:15 +08:00
2025-07-09 18:56:27 +08:00
2025-06-09 19:20:15 +08:00
2025-06-09 19:20:15 +08:00
2025-07-07 16:53:14 +08:00
2025-07-19 23:19:27 +08:00
2025-06-29 23:29:37 +00:00
2025-07-19 23:19:27 +08:00
2025-07-07 16:53:14 +08:00
2025-07-29 14:17:37 +08:00
2025-06-09 19:20:15 +08:00
2025-06-09 19:20:15 +08:00
2025-07-19 23:19:27 +08:00
2025-06-09 19:20:15 +08:00
2025-07-19 23:19:27 +08:00
2025-07-19 23:19:27 +08:00
2025-07-30 09:31:29 +08:00
2025-06-09 19:20:15 +08:00
2025-06-09 19:20:15 +08:00
2025-06-09 19:20:15 +08:00
2025-07-19 23:19:27 +08:00
2025-07-19 23:19:27 +08:00
2025-07-23 20:31:31 +08:00
2025-07-07 16:53:14 +08:00
2025-06-09 19:20:15 +08:00