Logo
Explore Help
Sign In
apps/FastDeploy
1
0
Fork 0
You've already forked FastDeploy
mirror of https://github.com/PaddlePaddle/FastDeploy.git synced 2025-10-12 20:11:20 +08:00
Code Issues Actions 5 Packages Projects Releases Wiki Activity
Files
62659a7a73f4a5d39db9f1f74baf757d577ed49d
FastDeploy/custom_ops/gpu_ops/cutlass_kernels
History
Yuan Xiaolan 7d87aaace8 optimize w4a8 decoding (#3050)
2025-07-28 22:20:13 +08:00
..
fp8_gemm_fused
polish code with new pre-commit rule (#2923)
2025-07-19 23:19:27 +08:00
fpA_intB_gemm
polish code with new pre-commit rule (#2923)
2025-07-19 23:19:27 +08:00
moe_gemm
Optimize the performance of moe_expert_ffn_wint2 (#2990)
2025-07-28 10:32:43 +08:00
w4a8_moe
optimize w4a8 decoding (#3050)
2025-07-28 22:20:13 +08:00
w8a8
…
cutlass_helper.h
…
cutlass_heuristic.cu
Feat/blackwell sm100 support (#2670)
2025-07-09 15:29:42 +08:00
cutlass_heuristic.h
…
cutlass_preprocessors.cu
…
cutlass_preprocessors.h
…
cutlass_type_conversion.h
…
weight_process_utils.h
…
Powered by Gitea Version: 1.24.5 Page: 4580ms Template: 446ms
English
Bahasa Indonesia Deutsch English Español Français Gaeilge Italiano Latviešu Magyar nyelv Nederlands Polski Português de Portugal Português do Brasil Suomi Svenska Türkçe Čeština Ελληνικά Български Русский Українська فارسی മലയാളം 日本語 简体中文 繁體中文(台灣) 繁體中文(香港) 한국어
Licenses API