This website requires JavaScript.
Explore
Help
Sign In
apps
/
FastDeploy
Watch
1
Star
0
Fork
0
You've already forked FastDeploy
mirror of
https://github.com/PaddlePaddle/FastDeploy.git
synced
2025-12-24 13:28:13 +08:00
Code
Issues
Actions
2
Packages
Projects
Releases
Wiki
Activity
Files
73886204d428c992233ff2349abf8c0cb6d6939f
FastDeploy
/
fastdeploy
/
model_executor
/
layers
/
attention
/
ops
History
lizhenyun01
aba4fc657f
[Feature] support flash_mask_attention backend (
#5134
)
...
* [Feature] suppert flash_mask_attention backend * fix unittest * clean code
2025-11-28 10:12:16 +08:00
..
__init__.py
[Feature] support flash_mask_attention backend (
#5134
)
2025-11-28 10:12:16 +08:00
append_attention.py
…
flash_mask_attention.py
[Feature] support flash_mask_attention backend (
#5134
)
2025-11-28 10:12:16 +08:00
get_block_shape_and_split_kv_block.py
…
gqa_rope_write_cache.py
[Feature] support flash_mask_attention backend (
#5134
)
2025-11-28 10:12:16 +08:00
init_kv_signal_per_query.py
…
init_signal_layerwise.py
…
open_shm_and_get_meta_signal.py
…
pre_cache_len_concat.py
…