This website requires JavaScript.
Explore
Help
Sign In
apps
/
FastDeploy
Watch
1
Star
0
Fork
0
You've already forked FastDeploy
mirror of
https://github.com/PaddlePaddle/FastDeploy.git
synced
2025-10-01 06:42:23 +08:00
Code
Issues
Actions
4
Packages
Projects
Releases
Wiki
Activity
Files
copilot/fix-4157
FastDeploy
/
custom_ops
/
gpu_ops
/
fp8_gemm_with_cutlass
History
Zero Rains
25698d56d1
polish code with new pre-commit rule (
#2923
)
2025-07-19 23:19:27 +08:00
..
fp8_common.h
[LLM] First commit the llm deployment code
2025-06-09 19:20:15 +08:00
fp8_fp8_fp8_dual_gemm.cu
[LLM] First commit the llm deployment code
2025-06-09 19:20:15 +08:00
fp8_fp8_half_block_gemm.cu
Sync v2.0 version of code to github repo
2025-06-29 23:29:37 +00:00
fp8_fp8_half_cuda_core_gemm.cu
polish code with new pre-commit rule (
#2923
)
2025-07-19 23:19:27 +08:00
fp8_fp8_half_cuda_core_gemm.h
[LLM] First commit the llm deployment code
2025-06-09 19:20:15 +08:00
fp8_fp8_half_gemm.cu
[Optimize] Optimize tensorwise fp8 performance (
#2729
)
2025-07-07 20:06:28 +08:00
per_channel_fp8_fp8_half_gemm.cu
[LLM] First commit the llm deployment code
2025-06-09 19:20:15 +08:00