FastDeploy/model_executor at 1f15ca21e444c2a258adb46f64ac0862ed72fa88 - FastDeploy - 子说镜像小站

apps/FastDeploy

mirror of https://github.com/PaddlePaddle/FastDeploy.git synced 2025-10-05 16:48:03 +08:00

Files

History

ming1753 1f15ca21e4

Deploy GitHub Pages / deploy (push) Has been cancelled

Details

[Feature] support prompt repetition_penalty (#2806 )

2025-07-17 12:05:52 +08:00

..

graph_optimization

[Executor] CUDA Graph support padding batch (#2844 )

2025-07-15 19:49:01 -07:00

guided_decoding

Sync v2.0 version of code to github repo

2025-06-29 23:29:37 +00:00

[Feature] support prompt repetition_penalty (#2806 )

2025-07-17 12:05:52 +08:00

fix and refine vl (#2866 )

2025-07-16 05:59:28 -07:00

refactor rl get_name_mappings_to_training (#2847 )

2025-07-15 07:31:42 -07:00

__init__.py

[LLM] First commit the llm deployment code

2025-06-09 19:20:15 +08:00

forward_meta.py

[Executor] CUDA Graph support padding batch (#2844 )

2025-07-15 19:49:01 -07:00

load_weight_utils.py

[Fix] Fix mm ep weight init. (#2855 )

2025-07-16 12:02:39 +08:00

model_loader.py

[vl]remove duplicated load logic (#2744 )

2025-07-13 07:36:26 +08:00

pre_and_post_process.py

Merge vl execution path into normal execution path (#2829 )

2025-07-15 22:20:03 +08:00