Files
FastDeploy/fastdeploy/model_executor/models
Ayakouji 453487d5b0 [Feat] ernie4_5_vl_moe support CudaGraph (#3226)
* delete dynamic control flow for decode

* coda-style

* fix scatter/gather typos and use input stream instead default stream

* support 0-Size Tensor

* update runner and model

* using static mem address as input

* fix mem leak

* refine code

* update mm_buffer

* fix typo

* fix buffersize

* fix unk token

* refine code

* refine

* support other arch

* open cudagraph in vlci

* fix

* update

* update

* update

* fix cmd

* update

---------

Co-authored-by: aquagull <hongyuh@qq.com>
Co-authored-by: Yuanle Liu <yuanlehome@163.com>
2025-09-10 13:11:57 +08:00
..
2025-08-28 22:53:57 +08:00
2025-08-28 19:42:32 +08:00
2025-09-03 10:54:34 +08:00
2025-09-03 10:54:34 +08:00
2025-09-03 10:54:34 +08:00