This website requires JavaScript.
Explore
Help
Sign In
apps
/
FastDeploy
Watch
1
Star
0
Fork
0
You've already forked FastDeploy
mirror of
https://github.com/PaddlePaddle/FastDeploy.git
synced
2025-12-24 13:28:13 +08:00
Code
Issues
Actions
2
Packages
Projects
Releases
Wiki
Activity
Files
dd877f38b15969d88f0520d8bcdfce3b410f4e01
FastDeploy
/
fastdeploy
/
model_executor
/
layers
/
attention
History
YuanRisheng
6ccc10ad47
Unify server-side and model-side Config (Part1) (
#3018
)
...
* move cache config * fix mtp
2025-07-28 10:51:52 +08:00
..
ops
support chunk_prefill in fa3
2025-07-23 12:19:20 +08:00
__init__.py
[SOT] Mark dynamic dims by type annotations (
#2771
)
2025-07-22 00:23:52 -07:00
append_attn_backend.py
Unify server-side and model-side Config (Part1) (
#3018
)
2025-07-28 10:51:52 +08:00
attention_selecter.py
…
attention.py
…
base_attention_backend.py
…
block_multihead_attn_backend.py
Unify server-side and model-side Config (Part1) (
#3018
)
2025-07-28 10:51:52 +08:00
flash_attn_backend.py
Unify server-side and model-side Config (Part1) (
#3018
)
2025-07-28 10:51:52 +08:00
iluvatar_attn_backend.py
Unify server-side and model-side Config (Part1) (
#3018
)
2025-07-28 10:51:52 +08:00
mla_attention_backend.py
Unify server-side and model-side Config (Part1) (
#3018
)
2025-07-28 10:51:52 +08:00
native_paddle_backend.py
…
utils.py
…
xpu_attn_backend.py
Unify server-side and model-side Config (Part1) (
#3018
)
2025-07-28 10:51:52 +08:00