FastDeploy

mirror of https://github.com/PaddlePaddle/FastDeploy.git synced 2025-12-24 13:28:13 +08:00

Files

Sunny-bot1 930f7b781c [Optimization] Put get_block_shape_and_split_kv_block in cuda graph for append attention backend (#4443 )

* get block in cuda graph

* fix sot

2025-10-17 10:59:56 +08:00

2025-10-16 19:31:19 +08:00

2025-10-17 10:40:59 +08:00

2025-10-17 10:59:56 +08:00

2025-09-23 19:36:00 +08:00

2025-10-11 14:04:17 +08:00

2025-09-24 12:27:50 +08:00

__init__.py

2025-07-19 23:19:27 +08:00

forward_meta.py

2025-10-13 20:35:00 +08:00

load_weight_utils.py

2025-09-23 19:36:00 +08:00

pre_and_post_process.py

2025-10-16 15:46:26 +08:00

utils.py

V1 loader default (#4251 )

2025-10-15 16:49:17 +08:00