This website requires JavaScript.
Explore
Help
Sign In
apps
/
FastDeploy
Watch
1
Star
0
Fork
0
You've already forked FastDeploy
mirror of
https://github.com/PaddlePaddle/FastDeploy.git
synced
2025-10-06 09:07:10 +08:00
Code
Issues
Actions
2
Packages
Projects
Releases
Wiki
Activity
Files
5be18dea000b9976b12b152f3bb2055e34e6efec
FastDeploy
/
custom_ops
/
gpu_ops
/
fp8_gemm_with_cutlass
History
jiangjiajun
149c79699d
[LLM] First commit the llm deployment code
2025-06-16 00:04:48 +08:00
..
fp8_common.h
[LLM] First commit the llm deployment code
2025-06-16 00:04:48 +08:00
fp8_fp8_fp8_dual_gemm.cu
[LLM] First commit the llm deployment code
2025-06-16 00:04:48 +08:00
fp8_fp8_half_block_gemm.cu
[LLM] First commit the llm deployment code
2025-06-16 00:04:48 +08:00
fp8_fp8_half_cuda_core_gemm.cu
[LLM] First commit the llm deployment code
2025-06-16 00:04:48 +08:00
fp8_fp8_half_cuda_core_gemm.h
[LLM] First commit the llm deployment code
2025-06-16 00:04:48 +08:00
fp8_fp8_half_gemm.cu
[LLM] First commit the llm deployment code
2025-06-16 00:04:48 +08:00
per_channel_fp8_fp8_half_gemm.cu
[LLM] First commit the llm deployment code
2025-06-16 00:04:48 +08:00