Ayakouji
453487d5b0
[Feat] ernie4_5_vl_moe
support CudaGraph (#3226)
* delete dynamic control flow for decode
* coda-style
* fix scatter/gather typos and use input stream instead default stream
* support 0-Size Tensor
* update runner and model
* using static mem address as input
* fix mem leak
* refine code
* update mm_buffer
* fix typo
* fix buffersize
* fix unk token
* refine code
* refine
* support other arch
* open cudagraph in vlci
* fix
* update
* update
* update
* fix cmd
* update
---------
Co-authored-by: aquagull <hongyuh@qq.com>
Co-authored-by: Yuanle Liu <yuanlehome@163.com>
2025-09-10 13:11:57 +08:00
..
2025-09-08 14:13:13 +08:00
2025-08-30 17:06:26 +08:00
2025-08-26 00:04:01 -07:00
2025-09-09 11:08:23 +08:00
2025-09-08 15:52:26 +08:00
2025-09-04 20:31:48 +08:00
2025-08-28 22:53:57 +08:00
2025-08-20 09:52:34 +08:00
2025-09-10 10:47:20 +08:00
2025-09-10 13:11:57 +08:00
2025-08-29 18:28:39 +08:00
2025-09-08 14:13:13 +08:00
2025-09-09 11:08:23 +08:00
2025-09-03 18:31:27 +08:00
2025-08-21 17:25:44 +08:00
2025-09-09 11:08:23 +08:00
2025-08-19 19:32:04 +08:00
2025-09-09 20:05:54 -07:00
2025-08-30 23:20:58 +08:00
2025-09-09 20:05:54 -07:00
2025-08-14 11:36:24 +08:00
2025-09-09 15:08:03 +08:00
2025-09-07 18:52:46 +08:00
2025-07-22 14:06:01 +08:00
2025-07-19 23:19:27 +08:00
2025-07-03 15:43:53 +08:00
2025-09-09 20:05:54 -07:00