Logo
Explore Help
Sign In
apps/FastDeploy
1
0
Fork 0
You've already forked FastDeploy
mirror of https://github.com/PaddlePaddle/FastDeploy.git synced 2025-12-24 13:28:13 +08:00
Code Issues Actions 2 Packages Projects Releases Wiki Activity
Files
40f7f3e0d8eed498e686ad09806190c847038394
FastDeploy/custom_ops/gpu_ops/cutlass_kernels
History
Yuan Xiaolan 7d87aaace8 optimize w4a8 decoding (#3050)
2025-07-28 22:20:13 +08:00
..
fp8_gemm_fused
polish code with new pre-commit rule (#2923)
2025-07-19 23:19:27 +08:00
fpA_intB_gemm
polish code with new pre-commit rule (#2923)
2025-07-19 23:19:27 +08:00
moe_gemm
Optimize the performance of moe_expert_ffn_wint2 (#2990)
2025-07-28 10:32:43 +08:00
w4a8_moe
optimize w4a8 decoding (#3050)
2025-07-28 22:20:13 +08:00
w8a8
Sync v2.0 version of code to github repo
2025-06-29 23:29:37 +00:00
cutlass_helper.h
Sync v2.0 version of code to github repo
2025-06-29 23:29:37 +00:00
cutlass_heuristic.cu
Feat/blackwell sm100 support (#2670)
2025-07-09 15:29:42 +08:00
cutlass_heuristic.h
…
cutlass_preprocessors.cu
…
cutlass_preprocessors.h
…
cutlass_type_conversion.h
…
weight_process_utils.h
…
Powered by Gitea Version: 1.25.2 Page: 2710ms Template: 468ms
English
Bahasa Indonesia Deutsch English Español Français Gaeilge Italiano Latviešu Magyar nyelv Nederlands Polski Português de Portugal Português do Brasil Suomi Svenska Türkçe Čeština Ελληνικά Български Русский Українська فارسی മലയാളം 日本語 简体中文 繁體中文(台灣) 繁體中文(香港) 한국어
Licenses API