FastDeploy/custom_ops at fd5fd0bdd78cdf7b752d714d45f1ede53c6abb13 - FastDeploy - 子说镜像小站

apps/FastDeploy

mirror of https://github.com/PaddlePaddle/FastDeploy.git synced 2025-12-24 13:28:13 +08:00

Files

History

Nyakku Shigure fd5fd0bdd7 Remove redundant inplace outputs for append_attention (#4341 )

2025-10-10 10:45:26 +08:00

..

[Code Simplification] remove cum_offsets (#3410 )

2025-08-18 20:21:25 +08:00

Remove redundant inplace outputs for append_attention (#4341 )

2025-10-10 10:45:26 +08:00

[Iluvatar GPU] Optimze attention and moe performance (#3234 )

2025-08-08 10:51:24 +08:00

【Fix bug] w4afp8 的nblock固定为256，并且fa3的append attn 增加mask参数 (#3771 ) (#3835 )

2025-09-03 19:36:45 +08:00

[XPU] Update XPU stable xvllm and xtdk version for 2.2 & Change CI Case (#3855 )

2025-09-03 19:33:06 +08:00

0001-DeepGEMM-95e81b3.patch

[feat] support fa3 backend for pd disaggregated (#2695 )

2025-07-03 22:33:27 +08:00

MANIFEST.in

[LLM] First commit the llm deployment code

2025-06-09 19:20:15 +08:00

setup_ops_cpu.py

polish code with new pre-commit rule (#2923 )

2025-07-19 23:19:27 +08:00

setup_ops.py

[FIX]Fix Machete compile via ENABLE_MACHETE (#3727 )

2025-08-30 17:50:17 +08:00