FastDeploy/custom_ops at fac2f64837572613ebdf1b46603d219c39d68a48 - FastDeploy - 子说镜像小站

apps/FastDeploy

mirror of https://github.com/PaddlePaddle/FastDeploy.git synced 2025-10-05 08:37:06 +08:00

Files

History

yzwu fbdd6b0663 [Iluvatar GPU] Optimze attention and moe performance (#3234 )

2025-08-08 10:51:24 +08:00

..

Sync v2.0 version of code to github repo

2025-06-29 23:29:37 +00:00

[Iluvatar GPU] Optimze attention and moe performance (#3234 )

2025-08-08 10:51:24 +08:00

[Iluvatar GPU] Optimze attention and moe performance (#3234 )

2025-08-08 10:51:24 +08:00

[New Feature] Support W4Afp8 MoE GroupGemm (#3171 )

2025-08-06 10:34:05 +08:00

[XPU] Support kvblock centralized management (#3017 )

2025-07-29 10:40:55 +08:00

0001-DeepGEMM-95e81b3.patch

[feat] support fa3 backend for pd disaggregated (#2695 )

2025-07-03 22:33:27 +08:00

MANIFEST.in

[LLM] First commit the llm deployment code

2025-06-09 19:20:15 +08:00

setup_ops_base.py

polish code with new pre-commit rule (#2923 )

2025-07-19 23:19:27 +08:00

setup_ops_cpu.py

polish code with new pre-commit rule (#2923 )

2025-07-19 23:19:27 +08:00

setup_ops.py

[Iluvatar GPU] Optimze attention and moe performance (#3234 )

2025-08-08 10:51:24 +08:00