Ayakouji
453487d5b0
[Feat] ernie4_5_vl_moe
support CudaGraph (#3226)
* delete dynamic control flow for decode
* coda-style
* fix scatter/gather typos and use input stream instead default stream
* support 0-Size Tensor
* update runner and model
* using static mem address as input
* fix mem leak
* refine code
* update mm_buffer
* fix typo
* fix buffersize
* fix unk token
* refine code
* refine
* support other arch
* open cudagraph in vlci
* fix
* update
* update
* update
* fix cmd
* update
---------
Co-authored-by: aquagull <hongyuh@qq.com>
Co-authored-by: Yuanle Liu <yuanlehome@163.com>
2025-09-10 13:11:57 +08:00
..
2025-09-08 21:58:34 +08:00
2025-07-19 23:19:27 +08:00
2025-08-20 20:29:58 +08:00
2025-09-01 17:50:17 +08:00
2025-09-01 17:50:17 +08:00
2025-08-05 12:20:48 +08:00
2025-07-19 23:19:27 +08:00
2025-06-09 19:20:15 +08:00
2025-09-01 17:50:17 +08:00
2025-08-28 09:49:58 +08:00
2025-07-22 14:01:30 +08:00
2025-09-08 15:22:41 +08:00
2025-09-10 13:11:57 +08:00
2025-06-29 23:29:37 +00:00
2025-08-14 22:40:44 +08:00
2025-09-07 10:24:58 +08:00
2025-09-02 19:17:01 +08:00
2025-08-19 02:54:47 -07:00
2025-09-07 20:41:29 -07:00
2025-06-09 19:20:15 +08:00
2025-09-10 13:11:57 +08:00
2025-06-09 19:20:15 +08:00
2025-06-09 19:20:15 +08:00
2025-06-09 19:20:15 +08:00
2025-07-03 15:43:53 +08:00
2025-08-01 17:28:07 +08:00
2025-09-03 10:54:34 +08:00
2025-07-07 20:06:28 +08:00
2025-07-22 15:03:41 +08:00
2025-06-09 19:20:15 +08:00
2025-06-29 23:29:37 +00:00
2025-07-30 16:05:55 +08:00
2025-07-19 23:19:27 +08:00
2025-08-14 22:40:44 +08:00
2025-07-10 16:33:40 +08:00
2025-06-09 19:20:15 +08:00
2025-06-09 19:20:15 +08:00
2025-09-10 13:11:57 +08:00
2025-07-22 15:03:41 +08:00
2025-06-09 19:20:15 +08:00
2025-08-25 11:27:45 +08:00
2025-06-09 19:20:15 +08:00
2025-07-19 23:19:27 +08:00
2025-08-01 18:03:36 +08:00
2025-07-19 23:19:27 +08:00
2025-07-22 15:03:41 +08:00
2025-06-09 19:20:15 +08:00
2025-08-08 17:30:37 +08:00
2025-08-05 16:43:07 +08:00
2025-06-29 23:29:37 +00:00
2025-08-28 10:52:53 +08:00
2025-06-09 19:20:15 +08:00
2025-07-19 23:19:27 +08:00
2025-07-19 23:19:27 +08:00
2025-08-18 20:21:25 +08:00
2025-09-04 17:39:59 +08:00
2025-07-19 23:19:27 +08:00
2025-07-19 23:19:27 +08:00
2025-09-01 17:50:17 +08:00
2025-09-01 17:50:17 +08:00
2025-06-09 19:20:15 +08:00
2025-06-09 19:20:15 +08:00
2025-06-09 19:20:15 +08:00
2025-08-29 11:04:04 +08:00
2025-06-09 19:20:15 +08:00
2025-07-09 18:56:27 +08:00
2025-06-09 19:20:15 +08:00
2025-06-09 19:20:15 +08:00
2025-09-03 10:54:34 +08:00
2025-07-19 23:19:27 +08:00
2025-09-01 17:50:17 +08:00
2025-07-19 23:19:27 +08:00
2025-07-07 16:53:14 +08:00
2025-07-29 14:17:37 +08:00
2025-06-09 19:20:15 +08:00
2025-06-09 19:20:15 +08:00
2025-07-19 23:19:27 +08:00
2025-06-09 19:20:15 +08:00
2025-09-10 13:11:57 +08:00
2025-09-10 13:11:57 +08:00
2025-07-30 09:31:29 +08:00
2025-06-09 19:20:15 +08:00
2025-09-01 17:50:17 +08:00
2025-06-09 19:20:15 +08:00
2025-07-19 23:19:27 +08:00
2025-07-19 23:19:27 +08:00
2025-07-23 20:31:31 +08:00
2025-07-07 16:53:14 +08:00
2025-09-01 17:50:17 +08:00