FastDeploy/fastdeploy/model_executor at 6ed9136a4e1c4e9970c554f4ceca966ed9a5f439 - FastDeploy - 子说镜像小站

apps/FastDeploy

mirror of https://github.com/PaddlePaddle/FastDeploy.git synced 2025-12-24 13:28:13 +08:00

Files

History

Sunny-bot1 04035e4ebf support w4afp8 two stage (#5608 )

2025-12-22 15:13:05 +08:00

..

graph_optimization

[Others] add assert and only count the actual load in cuda_graph (#5445 )

2025-12-10 11:22:54 +08:00

guided_decoding

[Feature] Guided Decoding add LLguidance backend (#5124 )

2025-12-03 20:23:57 +08:00

support w4afp8 two stage (#5608 )

2025-12-22 15:13:05 +08:00

logits_processor

[Feature] support logits processors (#4515 )

2025-10-29 00:08:53 +08:00

[RL]Resolve shape mismatch problems in RL-related modules (#5032 )

2025-11-19 11:12:48 +08:00

[Iluvatar] Support V1_KVCACHE_SCHEDULER and paddleocr-vl rope mode (#5555 )

2025-12-18 02:14:25 -08:00

[Iluvatar] Support V1_KVCACHE_SCHEDULER and paddleocr-vl rope mode (#5555 )

2025-12-18 02:14:25 -08:00

__init__.py

polish code with new pre-commit rule (#2923 )

2025-07-19 23:19:27 +08:00

forward_meta.py

[Intel HPU] enable tensor_wise_fp8 (#5324 )

2025-12-17 16:45:03 +08:00

load_weight_utils.py

remove fastsafetensors (#5371 )

2025-12-04 19:22:04 +08:00

pre_and_post_process.py

[Iluvatar] Support V1_KVCACHE_SCHEDULER and paddleocr-vl rope mode (#5555 )

2025-12-18 02:14:25 -08:00

utils.py

[RL]Support loading weights via the load_weights function for RL (#5549 )

2025-12-18 02:27:05 -08:00

xpu_pre_and_post_process.py

[XPU]add enable_logprob (#5279 )

2025-12-02 15:32:28 +08:00