FastDeploy

mirror of https://github.com/PaddlePaddle/FastDeploy.git synced 2025-10-04 00:06:38 +08:00

Files

AIbin a7392a0ff9 【Inference Optimize】DeepSeek-V3-model MLA Optimize (#3886 )

* support MLA chunk_size auto search & cuda_graph

2025-09-11 10:46:09 +08:00

2025-09-10 13:24:20 +08:00

2025-09-02 16:21:09 +08:00

2025-09-11 10:46:09 +08:00

cache feature (#3857 )

2025-09-07 18:52:46 +08:00

2025-09-10 19:36:10 +08:00

fix cpu __ini__.py (#3448 )

2025-08-17 12:38:54 +08:00

__init__.py

2025-07-19 23:19:27 +08:00

forward_meta.py

2025-09-11 10:46:09 +08:00

load_weight_utils.py

cache feature (#3857 )

2025-09-07 18:52:46 +08:00

pre_and_post_process.py

2025-09-04 17:39:59 +08:00

utils.py

cache feature (#3857 )

2025-09-07 18:52:46 +08:00