This website requires JavaScript.
Explore
Help
Sign In
apps
/
FastDeploy
Watch
1
Star
0
Fork
0
You've already forked FastDeploy
mirror of
https://github.com/PaddlePaddle/FastDeploy.git
synced
2025-12-24 13:28:13 +08:00
Code
Issues
Actions
2
Packages
Projects
Releases
Wiki
Activity
Files
0bef9b684f170a65375493b7e9a355fca1565a69
FastDeploy
/
custom_ops
/
gpu_ops
/
w4afp8_gemm
History
lizexu123
6d323769dd
fix w4afp8 (
#5634
)
2025-12-22 13:39:41 +08:00
..
kernel_traits.h
【New Feature】W4afp8 supports per group quantization (
#4987
)
2025-11-13 19:17:27 +08:00
mainloop_fwd.h
【New Feature】W4afp8 supports per group quantization (
#4987
)
2025-11-13 19:17:27 +08:00
utils.hpp
【New Feature】W4afp8 supports per group quantization (
#4987
)
2025-11-13 19:17:27 +08:00
w4afp8_gemm_kernel.hpp
fix w4afp8 (
#5634
)
2025-12-22 13:39:41 +08:00
w4afp8_gemm.cu
【New Feature】W4afp8 supports per group quantization (
#4987
)
2025-11-13 19:17:27 +08:00
w4afp8_gemm.h
【New Feature】W4afp8 supports per group quantization (
#4987
)
2025-11-13 19:17:27 +08:00
weight_kernel.hpp
【New Feature】W4afp8 supports per group quantization (
#4987
)
2025-11-13 19:17:27 +08:00
weight_scale_kernel.hpp
【New Feature】W4afp8 supports per group quantization (
#4987
)
2025-11-13 19:17:27 +08:00