This website requires JavaScript.
Explore
Help
Sign In
apps
/
FastDeploy
Watch
1
Star
0
Fork
0
You've already forked FastDeploy
mirror of
https://github.com/PaddlePaddle/FastDeploy.git
synced
2025-10-06 00:57:33 +08:00
Code
Issues
Actions
2
Packages
Projects
Releases
Wiki
Activity
Files
ddb10ac509966c9587dd4da3da4c3ab3b7fcb43c
FastDeploy
/
fastdeploy
/
model_executor
/
layers
/
attention
/
ops
History
周周周
ddb10ac509
[Inference, rename] remove padding_offsets from atten use batch_id_per_token (
#2880
)
...
* remove padding_offsets from atten
2025-07-17 18:41:31 +08:00
..
__init__.py
[Feature][MTP] Support cacheKV transfer in per_chunk mode (
#2890
)
2025-07-17 17:58:08 +08:00
append_attention.py
[Inference, rename] remove padding_offsets from atten use batch_id_per_token (
#2880
)
2025-07-17 18:41:31 +08:00
get_block_shape_and_split_kv_block.py
[SOT] Remove breakgraph in post processing && fix datatype (
#2780
)
2025-07-10 11:26:00 +08:00
gqa_rope_write_cache.py
[feat] support fa3 backend for pd disaggregated (
#2695
)
2025-07-03 22:33:27 +08:00
init_kv_signal_per_query.py
[Feature][MTP] Support cacheKV transfer in per_chunk mode (
#2890
)
2025-07-17 17:58:08 +08:00
init_signal_layerwise.py
Sync v2.0 version of code to github repo
2025-06-29 23:29:37 +00:00
open_shm_and_get_meta_signal.py
Sync v2.0 version of code to github repo
2025-06-29 23:29:37 +00:00
pre_cache_len_concat.py
[feat] support fa3 backend for pd disaggregated (
#2695
)
2025-07-03 22:33:27 +08:00