This website requires JavaScript.
Explore
Help
Sign In
apps
/
FastDeploy
Watch
1
Star
0
Fork
0
You've already forked FastDeploy
mirror of
https://github.com/PaddlePaddle/FastDeploy.git
synced
2025-10-12 20:11:20 +08:00
Code
Issues
Actions
5
Packages
Projects
Releases
Wiki
Activity
Files
62659a7a73f4a5d39db9f1f74baf757d577ed49d
FastDeploy
/
custom_ops
/
gpu_ops
/
cutlass_kernels
History
Yuan Xiaolan
7d87aaace8
optimize w4a8 decoding (
#3050
)
2025-07-28 22:20:13 +08:00
..
fp8_gemm_fused
polish code with new pre-commit rule (
#2923
)
2025-07-19 23:19:27 +08:00
fpA_intB_gemm
polish code with new pre-commit rule (
#2923
)
2025-07-19 23:19:27 +08:00
moe_gemm
Optimize the performance of moe_expert_ffn_wint2 (
#2990
)
2025-07-28 10:32:43 +08:00
w4a8_moe
optimize w4a8 decoding (
#3050
)
2025-07-28 22:20:13 +08:00
w8a8
…
cutlass_helper.h
…
cutlass_heuristic.cu
Feat/blackwell sm100 support (
#2670
)
2025-07-09 15:29:42 +08:00
cutlass_heuristic.h
…
cutlass_preprocessors.cu
…
cutlass_preprocessors.h
…
cutlass_type_conversion.h
…
weight_process_utils.h
…