FastDeploy

mirror of https://github.com/PaddlePaddle/FastDeploy.git synced 2025-12-24 13:28:13 +08:00

Files

Neil Zhu 4403a21d4b [Metax] refactor cutlass moe and optimize flash attention (#5361 )

* [Metax] refactor moe and flash attention backend
---------

Co-authored-by: zhangchenyi_dl <16219492+zhangchenyidl@user.noreply.gitee.com>

2025-12-10 17:15:17 +08:00

c++ code format (#4527 )

2025-10-22 17:59:50 +08:00

2025-12-10 17:15:17 +08:00

c++ code format (#4527 )

2025-10-22 17:59:50 +08:00

2025-12-10 17:15:17 +08:00

2025-09-11 17:41:16 +08:00

2025-12-02 18:56:16 +08:00

2025-12-09 14:40:11 +08:00

0001-DeepGEMM-95e81b3.patch

2025-11-28 14:23:44 +08:00

MANIFEST.in

2025-06-09 19:20:15 +08:00

setup_ops_cpu.py

2025-07-19 23:19:27 +08:00

setup_ops.py

2025-12-10 17:15:17 +08:00