This website requires JavaScript.
Explore
Help
Sign In
apps
/
FastDeploy
Watch
1
Star
0
Fork
0
You've already forked FastDeploy
mirror of
https://github.com/PaddlePaddle/FastDeploy.git
synced
2025-12-24 13:28:13 +08:00
Code
Issues
Actions
2
Packages
Projects
Releases
Wiki
Activity
Files
9d6a42b334c8fef9d446fa5848d1e437c830a1c5
FastDeploy
/
custom_ops
/
gpu_ops
/
append_attn
/
template_instantiation
History
jiangjiajun
684703fd72
[LLM] First commit the llm deployment code
2025-06-09 19:20:15 +08:00
..
append_attention_c4_bfloat16_bfloat16_kernel.cu
…
append_attention_c4_bfloat16_fp8_kernel.cu
…
append_attention_c4_bfloat16_int8_kernel.cu
…
append_attention_c4_float16_float16_kernel.cu
…
append_attention_c4_float16_fp8_kernel.cu
…
append_attention_c4_float16_int8_kernel.cu
…
append_attention_c8_bfloat16_bfloat16_kernel.cu
…
append_attention_c8_bfloat16_fp8_kernel.cu
…
append_attention_c8_bfloat16_int8_kernel.cu
…
append_attention_c8_float16_float16_kernel.cu
…
append_attention_c8_float16_fp8_kerne.cu
…
append_attention_c8_float16_int8_kerne.cu
…
append_attention_c16_bfloat16_bfloat16_kernel.cu
…
append_attention_c16_bfloat16_fp8_kernel.cu
…
append_attention_c16_bfloat16_int8_kernel.cu
…
append_attention_c16_float16_float16_kernel.cu
…
append_attention_c16_float16_fp8_kernel.cu
…
append_attention_c16_float16_int8_kernel.cu
…
encoder_write_cache_with_rope_bfloat16_bfloat16_kernel.cu
…
encoder_write_cache_with_rope_bfloat16_int_kernel.cu
…
encoder_write_cache_with_rope_float16_float16_kernel.cu
…
encoder_write_cache_with_rope_float16_int_kernel.cu
…