This website requires JavaScript.
Explore
Help
Sign In
apps
/
FastDeploy
Watch
1
Star
0
Fork
0
You've already forked FastDeploy
mirror of
https://github.com/PaddlePaddle/FastDeploy.git
synced
2025-12-24 13:28:13 +08:00
Code
Issues
Actions
2
Packages
Projects
Releases
Wiki
Activity
Files
cba185f1fe5b676bdfc8dcbf5a2fb13e0ea78e88
FastDeploy
/
fastdeploy
/
model_executor
History
ming1753
cba185f1fe
[Feature] Optim PaddleOCR-VL (
#4873
)
...
* [Feature] Optim PaddleOCR-VL * fix bug
2025-11-07 14:56:44 +08:00
..
graph_optimization
…
guided_decoding
…
layers
Revert "【New Feature】W4afp8 supports per group quantization (
#4272
)" (
#4854
)
2025-11-06 17:48:28 +08:00
logits_processor
…
model_loader
…
models
[Feature] Optim PaddleOCR-VL (
#4873
)
2025-11-07 14:56:44 +08:00
ops
…
__init__.py
…
forward_meta.py
…
load_weight_utils.py
[XPU] ep+tp all2all (
#4836
)
2025-11-06 17:26:14 +08:00
pre_and_post_process.py
remove seq_lens_this_time (
#4821
)
2025-11-06 11:06:28 +08:00
utils.py
[PD Disaggregation] Support Qwen3-MoE use PD + EP inference. (
#4691
)
2025-11-06 10:32:15 +08:00