This website requires JavaScript.
Explore
Help
Sign In
apps
/
FastDeploy
Watch
1
Star
0
Fork
0
You've already forked FastDeploy
mirror of
https://github.com/PaddlePaddle/FastDeploy.git
synced
2025-12-24 13:28:13 +08:00
Code
Issues
Actions
2
Packages
Projects
Releases
Wiki
Activity
Files
888c4b992dd9081881a2dfeed445f4e630788eed
FastDeploy
/
custom_ops
/
gpu_ops
/
cutlass_kernels
/
w4a8_moe
History
co63oc
d6369b4d51
fix typos (
#3684
)
2025-09-01 17:50:17 +08:00
..
cutlass_extensions
fix typos (
#3684
)
2025-09-01 17:50:17 +08:00
base64_encode.h
…
compile_w4a8_moe.sh
…
cuda_utils.h
…
cutlass_heuristic_w4a4.h
…
w4a4_gemm_configs.h
…
w4a8_gemm_grouped.h
optimize w4a8 decoding (
#3050
)
2025-07-28 22:20:13 +08:00
w4a8_moe_cutlass_kernel_template.cu
…
w4a8_moe_cutlass_kernel.h
…
w4a8_moe_gemm_config_search.sh
optimize w4a8 decoding (
#3050
)
2025-07-28 22:20:13 +08:00
w4a8_moe_gemm_kernel_template.h
…
w4a8_moe_gemm_kernel.h
…
w4a8_moe_gemm_test.cu
optimize w4a8 decoding (
#3050
)
2025-07-28 22:20:13 +08:00
w4a8_moe_gemm_with_epilogue_visitor.h
…
weight_process_utils.h
…