FastDeploy/custom_ops/gpu_ops/append_attn at 9e4eb339b8def173078d75e82c3523d8c67759b2 - FastDeploy - 子说镜像小站

apps/FastDeploy

mirror of https://github.com/PaddlePaddle/FastDeploy.git synced 2025-12-24 13:28:13 +08:00

Files

History

…

..

template_instantiation

…

append_attention_c4_impl.cuh

…

append_attention_c8_impl.cuh

…

append_attention_c16_impl.cuh

…

append_attention_func.cuh

…

append_attention_kernel.h

…

decode_attention_func.cuh

…

decoder_mla_attention_kernel.cu

…

decoder_mla_attention_kernel.h

…

decoder_write_cache_with_rope_impl.cuh

…

decoder_write_cache_with_rope_kernel.cu

…

decoder_write_cache_with_rope_kernel.h

…

encoder_write_cache_with_rope_impl.cuh

…

encoder_write_cache_with_rope_kernel.h

…

get_block_shape_and_split_kv_block.cu

…

gqa_rope_write_cache.cu

…

mem_util.cuh

…

mla_cache_kernel.cu

…

mla_cache_kernel.cuh

…

mma_tensor_op.cuh

…

multiquery_attention_c4_impl.cuh

…

multiquery_attention_c4_kernel.h

…

multiquery_attention_c8_impl.cuh

…

multiquery_attention_c8_kernel.h

…

multiquery_attention_c16_impl.cuh

…

multiquery_attention_c16_kernel.h

…

multiquery_decoder_attention_impl.cuh

…

multiquery_decoder_attention_kernel.h

…

pre_cache_len_concat.cu

…

qwen3_rope.h

…

speculate_write_cache_with_rope_impl.cuh

…

speculate_write_cache_with_rope_kernel.cu

…

speculate_write_cache_with_rope_kernel.h

…

template_config.json

…

utils.cuh

…