This website requires JavaScript.
Explore
Help
Sign In
apps
/
FastDeploy
Watch
1
Star
0
Fork
0
You've already forked FastDeploy
mirror of
https://github.com/PaddlePaddle/FastDeploy.git
synced
2025-12-24 13:28:13 +08:00
Code
Issues
Actions
2
Packages
Projects
Releases
Wiki
Activity
Files
7272afe3dccc5de500b923b50b0eef832686d58c
FastDeploy
/
custom_ops
History
yangjianfengo1
9213a58a06
【Fix bug] w4afp8 的nblock固定为256,并且fa3的append attn 增加mask参数 (
#3771
) (
#3835
)
...
* fix w4afp8 * 增加集中式配置 * codestyle * fix fa3 append attn
2025-09-03 19:36:45 +08:00
..
cpu_ops
[Code Simplification] remove cum_offsets (
#3410
)
2025-08-18 20:21:25 +08:00
gpu_ops
【Fix bug] w4afp8 的nblock固定为256,并且fa3的append attn 增加mask参数 (
#3771
) (
#3835
)
2025-09-03 19:36:45 +08:00
iluvatar_ops
[Iluvatar GPU] Optimze attention and moe performance (
#3234
)
2025-08-08 10:51:24 +08:00
utils
【Fix bug] w4afp8 的nblock固定为256,并且fa3的append attn 增加mask参数 (
#3771
) (
#3835
)
2025-09-03 19:36:45 +08:00
xpu_ops
[XPU] Update XPU stable xvllm and xtdk version for 2.2 & Change CI Case (
#3855
)
2025-09-03 19:33:06 +08:00
0001-DeepGEMM-95e81b3.patch
…
MANIFEST.in
…
setup_ops_cpu.py
…
setup_ops.py
[FIX]Fix Machete compile via ENABLE_MACHETE (
#3727
)
2025-08-30 17:50:17 +08:00