FastDeploy/fastdeploy/model_executor at 2cf55168ca99406094fc6eefd98163196f90473f - FastDeploy - 子说镜像小站

apps/FastDeploy

mirror of https://github.com/PaddlePaddle/FastDeploy.git synced 2025-12-24 13:28:13 +08:00

Files

History

Yuan Xiaolan 2cf55168ca load hadamard_block_size from config (#3797 )

2025-09-05 17:07:58 +08:00

..

graph_optimization

[Executor] Fix bug of import paddle with RLHF (#3781 )

2025-09-02 17:32:13 +08:00

guided_decoding

[Feature] mm and thinking model support structred output (#2749 )

2025-09-02 16:21:09 +08:00

load hadamard_block_size from config (#3797 )

2025-09-05 17:07:58 +08:00

[BugFix]fix dp&ep&tp and muti node infer (#3629 )

2025-08-28 19:09:10 +08:00

【Inference Optimize】Update MergedReplicatedLinear for DSK qkv_a_proj_with_mqa. (#3673 )

2025-09-04 21:16:05 -07:00

fix cpu __ini__.py (#3448 )

2025-08-17 12:38:54 +08:00

__init__.py

polish code with new pre-commit rule (#2923 )

2025-07-19 23:19:27 +08:00

forward_meta.py

enable dcu ci (#3402 )

2025-08-29 10:23:08 +08:00

load_weight_utils.py

[v1loader]Reduce EB300B model loading time (#3700 )

2025-09-02 19:13:57 +08:00

pre_and_post_process.py

support mtp in v1_scheduler mode (#3695 )

2025-09-04 17:39:59 +08:00

utils.py

[v1loader]Reduce EB300B model loading time (#3700 )

2025-09-02 19:13:57 +08:00