FastDeploy/model_executor at fac2f64837572613ebdf1b46603d219c39d68a48 - FastDeploy - 子说镜像小站

apps/FastDeploy

mirror of https://github.com/PaddlePaddle/FastDeploy.git synced 2025-10-05 16:48:03 +08:00

Files

History

yzwu fbdd6b0663 [Iluvatar GPU] Optimze attention and moe performance (#3234 )

2025-08-08 10:51:24 +08:00

..

graph_optimization

update flake8 version to support pre-commit in python3.12 (#3000 )

2025-07-24 01:43:31 -07:00

guided_decoding

Unify server-side and model-side Config (Part3) (#3047 )

2025-07-29 17:07:44 +08:00

[Iluvatar GPU] Optimze attention and moe performance (#3234 )

2025-08-08 10:51:24 +08:00

[feat]add fast_weights_iterator (#3258 )

2025-08-07 22:36:46 +08:00

qwen3_moe (#3084 )

2025-08-06 14:45:27 +08:00

[Iluvatar GPU] Optimze attention and moe performance (#3234 )

2025-08-08 10:51:24 +08:00

__init__.py

polish code with new pre-commit rule (#2923 )

2025-07-19 23:19:27 +08:00

forward_meta.py

[Executor] Refactor GetBlockShapeAndSplitKVBlock Kernel (#2989 )

2025-07-31 00:09:31 +08:00

load_weight_utils.py

[feat]add fast_weights_iterator (#3258 )

2025-08-07 22:36:46 +08:00

pre_and_post_process.py

[Iluvatar GPU] Optimze attention and moe performance (#3234 )

2025-08-08 10:51:24 +08:00