This website requires JavaScript.
Explore
Help
Sign In
apps
/
FastDeploy
Watch
1
Star
0
Fork
0
You've already forked FastDeploy
mirror of
https://github.com/PaddlePaddle/FastDeploy.git
synced
2025-10-05 08:37:06 +08:00
Code
Issues
Actions
2
Packages
Projects
Releases
Wiki
Activity
Files
ece88596ede6ea99df882f4e4a6f556adfa30c3a
FastDeploy
/
fastdeploy
/
model_executor
History
bukejiyu
bad53c6b6e
Some checks failed
Deploy GitHub Pages / deploy (push)
Has been cancelled
Details
[vl]remove duplicated load logic (
#2744
)
2025-07-13 07:36:26 +08:00
..
graph_optimization
[Executor] Fix bug of logger.debug (
#2778
)
2025-07-09 04:13:43 -07:00
guided_decoding
Sync v2.0 version of code to github repo
2025-06-29 23:29:37 +00:00
layers
[Feature] support tensor-parallel-size>num_key_value_heads for qwen3 (
#2799
)
2025-07-11 15:09:43 +08:00
models
[vl]remove duplicated load logic (
#2744
)
2025-07-13 07:36:26 +08:00
ops
[GCU] Support gcu platform (
#2702
)
2025-07-08 13:00:52 +08:00
__init__.py
[LLM] First commit the llm deployment code
2025-06-09 19:20:15 +08:00
forward_meta.py
[Executor] Move forward_meta.py to fastdeploy/model_executor (
#2774
)
2025-07-10 20:36:51 +08:00
load_weight_utils.py
[feat] support fa3 backend for pd disaggregated (
#2695
)
2025-07-03 22:33:27 +08:00
model_loader.py
[vl]remove duplicated load logic (
#2744
)
2025-07-13 07:36:26 +08:00
pre_and_post_process.py
[Feature] Online Chat API Support Return logprobs (
#2777
)
2025-07-10 16:33:40 +08:00