Logo
Explore Help
Sign In
apps/FastDeploy
1
0
Fork 0
You've already forked FastDeploy
mirror of https://github.com/PaddlePaddle/FastDeploy.git synced 2025-10-04 16:22:57 +08:00
Code Issues Actions 2 Packages Projects Releases Wiki Activity
Files
e42dc8c694b25733f73b063ac9a40fbec19bee23
FastDeploy/custom_ops/gpu_ops/cutlass_kernels
History
Yuan Xiaolan 7d87aaace8 optimize w4a8 decoding (#3050)
2025-07-28 22:20:13 +08:00
..
fp8_gemm_fused
polish code with new pre-commit rule (#2923)
2025-07-19 23:19:27 +08:00
fpA_intB_gemm
polish code with new pre-commit rule (#2923)
2025-07-19 23:19:27 +08:00
moe_gemm
Optimize the performance of moe_expert_ffn_wint2 (#2990)
2025-07-28 10:32:43 +08:00
w4a8_moe
optimize w4a8 decoding (#3050)
2025-07-28 22:20:13 +08:00
w8a8
Sync v2.0 version of code to github repo
2025-06-29 23:29:37 +00:00
cutlass_helper.h
Sync v2.0 version of code to github repo
2025-06-29 23:29:37 +00:00
cutlass_heuristic.cu
Feat/blackwell sm100 support (#2670)
2025-07-09 15:29:42 +08:00
cutlass_heuristic.h
[LLM] First commit the llm deployment code
2025-06-09 19:20:15 +08:00
cutlass_preprocessors.cu
[LLM] First commit the llm deployment code
2025-06-09 19:20:15 +08:00
cutlass_preprocessors.h
[LLM] First commit the llm deployment code
2025-06-09 19:20:15 +08:00
cutlass_type_conversion.h
[LLM] First commit the llm deployment code
2025-06-09 19:20:15 +08:00
weight_process_utils.h
[LLM] First commit the llm deployment code
2025-06-09 19:20:15 +08:00
Powered by Gitea Version: 1.24.5 Page: 180ms Template: 6ms
English
Bahasa Indonesia Deutsch English Español Français Gaeilge Italiano Latviešu Magyar nyelv Nederlands Polski Português de Portugal Português do Brasil Suomi Svenska Türkçe Čeština Ελληνικά Български Русский Українська فارسی മലയാളം 日本語 简体中文 繁體中文(台灣) 繁體中文(香港) 한국어
Licenses API