This website requires JavaScript.
Explore
Help
Sign In
apps
/
FastDeploy
Watch
1
Star
0
Fork
0
You've already forked FastDeploy
mirror of
https://github.com/PaddlePaddle/FastDeploy.git
synced
2025-10-16 21:51:31 +08:00
Code
Issues
Actions
6
Packages
Projects
Releases
Wiki
Activity
Files
dafe02a7b947cbcce44684334b8311fa0c9f46c9
FastDeploy
/
custom_ops
/
gpu_ops
/
append_attn
History
lizhenyun01
238766e403
fix c4 prompt_cache
2025-07-28 14:31:37 +08:00
..
template_instantiation
…
append_attention_c4_impl.cuh
…
append_attention_c8_impl.cuh
…
append_attention_c16_impl.cuh
…
append_attention_func.cuh
…
append_attention_kernel.h
…
decode_attention_func.cuh
…
decode_attention_kernel.cu
…
decoder_write_cache_with_rope_impl.cuh
…
decoder_write_cache_with_rope_kernel.cu
…
decoder_write_cache_with_rope_kernel.h
…
encoder_write_cache_with_rope_impl.cuh
…
encoder_write_cache_with_rope_kernel.h
…
get_block_shape_and_split_kv_block.cu
…
gqa_rope_write_cache.cu
…
mem_util.cuh
…
mla_cache_kernel.cu
…
mla_cache_kernel.cuh
…
mma_tensor_op.cuh
…
multi_head_latent_attention_kernel.h
…
pre_cache_len_concat.cu
…
speculate_write_cache_with_rope_impl.cuh
…
speculate_write_cache_with_rope_kernel.cu
…
speculate_write_cache_with_rope_kernel.h
…
utils.cuh
…