FastDeploy/fastdeploy/model_executor at c06a6234b94d922540860223101754da9a2d1adf - FastDeploy - 子说镜像小站

apps/FastDeploy

mirror of https://github.com/PaddlePaddle/FastDeploy.git synced 2025-12-24 13:28:13 +08:00

Files

History

xiaozude c06a6234b9 [Metax] optimize mla attention (#5258 )

2025-12-09 11:18:19 +08:00

..

graph_optimization

[Iluvatar] add vl into ci and support v1 loader (#4774 )

2025-11-11 10:50:17 +08:00

guided_decoding

[Feature] Guided Decoding add LLguidance backend (#5124 )

2025-12-03 20:23:57 +08:00

[Metax] optimize mla attention (#5258 )

2025-12-09 11:18:19 +08:00

logits_processor

[Feature] support logits processors (#4515 )

2025-10-29 00:08:53 +08:00

[RL]Resolve shape mismatch problems in RL-related modules (#5032 )

2025-11-19 11:12:48 +08:00

[Metax] optimize mla attention (#5258 )

2025-12-09 11:18:19 +08:00

[CI]【Hackathon 9th Sprint No.13】NO.13 功能模块 fastdeploy/model_executor/ops/triton_ops/triton_utils.py 单测补充 (#5035 )

2025-11-17 11:43:31 +08:00

__init__.py

polish code with new pre-commit rule (#2923 )

2025-07-19 23:19:27 +08:00

forward_meta.py

[New][RL] Support Rollout Routing Replay (#5405 )

2025-12-05 22:06:26 +08:00

load_weight_utils.py

remove fastsafetensors (#5371 )

2025-12-04 19:22:04 +08:00

pre_and_post_process.py

[Feature] support reward model (#5301 )

2025-12-02 14:55:31 +08:00

utils.py

[BugFix]Set default OMP_NUM_THREADS=3 and fix extra GPU memory usage in DeepSeek (#5219 )

2025-11-28 14:22:04 +08:00

xpu_pre_and_post_process.py

[XPU]add enable_logprob (#5279 )

2025-12-02 15:32:28 +08:00