This website requires JavaScript.
Explore
Help
Sign In
apps
/
FastDeploy
Watch
1
Star
0
Fork
0
You've already forked FastDeploy
mirror of
https://github.com/PaddlePaddle/FastDeploy.git
synced
2025-09-27 04:46:16 +08:00
Code
Issues
Actions
5
Packages
Projects
Releases
Wiki
Activity
Files
release/2.2
FastDeploy
/
custom_ops
/
utils
History
yangjianfengo1
9213a58a06
【Fix bug] w4afp8 的nblock固定为256,并且fa3的append attn 增加mask参数 (
#3771
) (
#3835
)
...
* fix w4afp8 * 增加集中式配置 * codestyle * fix fa3 append attn
2025-09-03 19:36:45 +08:00
..
auto_gen_fp8_fp8_block_gemm_fused_kernels_sm90.py
polish code with new pre-commit rule (
#2923
)
2025-07-19 23:19:27 +08:00
auto_gen_fp8_fp8_dual_gemm_fused_kernels_sm90.py
polish code with new pre-commit rule (
#2923
)
2025-07-19 23:19:27 +08:00
auto_gen_fp8_fp8_dual_gemm_fused_kernels.py
polish code with new pre-commit rule (
#2923
)
2025-07-19 23:19:27 +08:00
auto_gen_fp8_fp8_gemm_fused_kernels_sm90.py
polish code with new pre-commit rule (
#2923
)
2025-07-19 23:19:27 +08:00
auto_gen_fp8_fp8_gemm_fused_kernels.py
polish code with new pre-commit rule (
#2923
)
2025-07-19 23:19:27 +08:00
auto_gen_visitor_fp8_gemm_fused_kernels.py
polish code with new pre-commit rule (
#2923
)
2025-07-19 23:19:27 +08:00
auto_gen_w4afp8_gemm_kernel.py
【Fix bug] w4afp8 的nblock固定为256,并且fa3的append attn 增加mask参数 (
#3771
) (
#3835
)
2025-09-03 19:36:45 +08:00
auto_gen_wfp8afp8_sparse_gemm_kernel.py
【New Feature】支持Fp8 group Gemm 24稀疏 (
#3463
)
2025-08-19 02:54:47 -07:00