FastDeploy/cutlass_kernels at e42dc8c694b25733f73b063ac9a40fbec19bee23 - FastDeploy - 子说镜像小站

apps/FastDeploy

mirror of https://github.com/PaddlePaddle/FastDeploy.git synced 2025-10-04 16:22:57 +08:00

Files

History

Yuan Xiaolan 7d87aaace8 optimize w4a8 decoding (#3050 )

2025-07-28 22:20:13 +08:00

..

polish code with new pre-commit rule (#2923 )

2025-07-19 23:19:27 +08:00

polish code with new pre-commit rule (#2923 )

2025-07-19 23:19:27 +08:00

Optimize the performance of moe_expert_ffn_wint2 (#2990 )

2025-07-28 10:32:43 +08:00

optimize w4a8 decoding (#3050 )

2025-07-28 22:20:13 +08:00

Sync v2.0 version of code to github repo

2025-06-29 23:29:37 +00:00

cutlass_helper.h

Sync v2.0 version of code to github repo

2025-06-29 23:29:37 +00:00

cutlass_heuristic.cu

Feat/blackwell sm100 support (#2670 )

2025-07-09 15:29:42 +08:00

cutlass_heuristic.h

[LLM] First commit the llm deployment code

2025-06-09 19:20:15 +08:00

cutlass_preprocessors.cu

[LLM] First commit the llm deployment code

2025-06-09 19:20:15 +08:00

cutlass_preprocessors.h

[LLM] First commit the llm deployment code

2025-06-09 19:20:15 +08:00

cutlass_type_conversion.h

[LLM] First commit the llm deployment code

2025-06-09 19:20:15 +08:00

weight_process_utils.h

[LLM] First commit the llm deployment code

2025-06-09 19:20:15 +08:00