FastDeploy/fastdeploy/model_executor/ops at 2c0d85306715829a5741ace3be6ed6d8d3aaa123 - FastDeploy - 子说镜像小站

apps/FastDeploy

mirror of https://github.com/PaddlePaddle/FastDeploy.git synced 2025-12-24 13:28:13 +08:00

Files

History

yzwu fbdd6b0663 [Iluvatar GPU] Optimze attention and moe performance (#3234 )

2025-08-08 10:51:24 +08:00

..

polish code with new pre-commit rule (#2923 )

2025-07-19 23:19:27 +08:00

polish code with new pre-commit rule (#2923 )

2025-07-19 23:19:27 +08:00

【Fearture】support qwen2 some func (#2740 )

2025-07-08 12:03:04 +08:00

[Iluvatar GPU] Optimze attention and moe performance (#3234 )

2025-08-08 10:51:24 +08:00

[LLM] First commit the llm deployment code

2025-06-09 19:20:15 +08:00

moe preprocess op support 160 experts and fused_moe triton kernel name add K (#3121 )

2025-08-01 10:46:20 +08:00

[LLM] First commit the llm deployment code

2025-06-09 19:20:15 +08:00

__init__.py

polish code with new pre-commit rule (#2923 )

2025-07-19 23:19:27 +08:00