FastDeploy

mirror of https://github.com/PaddlePaddle/FastDeploy.git synced 2025-10-04 16:22:57 +08:00

Files

Yuanle Liu c3b2a60fb8 [BugFix] Fix the abnormal memory usage caused by shape errors in the triton moe backend (#4026 )

* fix device_id to in

* fix triton_moe bug

2025-09-09 20:05:54 -07:00

2025-09-02 17:32:13 +08:00

2025-09-02 16:21:09 +08:00

2025-09-09 20:05:54 -07:00

cache feature (#3857 )

2025-09-07 18:52:46 +08:00

2025-09-09 05:25:08 -07:00

fix cpu __ini__.py (#3448 )

2025-08-17 12:38:54 +08:00

__init__.py

2025-07-19 23:19:27 +08:00

forward_meta.py

2025-09-08 13:12:24 +08:00

load_weight_utils.py

cache feature (#3857 )

2025-09-07 18:52:46 +08:00

pre_and_post_process.py

2025-09-04 17:39:59 +08:00

utils.py

cache feature (#3857 )

2025-09-07 18:52:46 +08:00