This website requires JavaScript.
Explore
Help
Sign In
apps
/
FastDeploy
Watch
1
Star
0
Fork
0
You've already forked FastDeploy
mirror of
https://github.com/PaddlePaddle/FastDeploy.git
synced
2025-10-05 16:48:03 +08:00
Code
Issues
Actions
2
Packages
Projects
Releases
Wiki
Activity
Files
1f15ca21e444c2a258adb46f64ac0862ed72fa88
FastDeploy
/
fastdeploy
/
model_executor
History
ming1753
1f15ca21e4
Some checks failed
Deploy GitHub Pages / deploy (push)
Has been cancelled
Details
[Feature] support prompt repetition_penalty (
#2806
)
2025-07-17 12:05:52 +08:00
..
graph_optimization
[Executor] CUDA Graph support padding batch (
#2844
)
2025-07-15 19:49:01 -07:00
guided_decoding
Sync v2.0 version of code to github repo
2025-06-29 23:29:37 +00:00
layers
[Feature] support prompt repetition_penalty (
#2806
)
2025-07-17 12:05:52 +08:00
models
fix and refine vl (
#2866
)
2025-07-16 05:59:28 -07:00
ops
refactor rl get_name_mappings_to_training (
#2847
)
2025-07-15 07:31:42 -07:00
__init__.py
[LLM] First commit the llm deployment code
2025-06-09 19:20:15 +08:00
forward_meta.py
[Executor] CUDA Graph support padding batch (
#2844
)
2025-07-15 19:49:01 -07:00
load_weight_utils.py
[Fix] Fix mm ep weight init. (
#2855
)
2025-07-16 12:02:39 +08:00
model_loader.py
[vl]remove duplicated load logic (
#2744
)
2025-07-13 07:36:26 +08:00
pre_and_post_process.py
Merge vl execution path into normal execution path (
#2829
)
2025-07-15 22:20:03 +08:00