Logo
Explore Help
Sign In
apps/FastDeploy
1
0
Fork 0
You've already forked FastDeploy
mirror of https://github.com/PaddlePaddle/FastDeploy.git synced 2025-12-24 13:28:13 +08:00
Code Issues Actions 2 Packages Projects Releases Wiki Activity
Files
6871aad03dc8cb5408bcdf0323799bde5aa56262
FastDeploy/fastdeploy/model_executor
History
ming1753 cba185f1fe [Feature] Optim PaddleOCR-VL (#4873)
* [Feature] Optim PaddleOCR-VL

* fix bug
2025-11-07 14:56:44 +08:00
..
graph_optimization
[Graph Optimization] Refactor default capture list (#4617)
2025-10-28 21:31:02 +08:00
guided_decoding
[FDConfig]Remove reasoning_parser/guided_decoding_backend/disable_any_whitespace/device_ids in FDConfig (#4362)
2025-10-17 10:40:59 +08:00
layers
Revert "【New Feature】W4afp8 supports per group quantization (#4272)" (#4854)
2025-11-06 17:48:28 +08:00
logits_processor
[Feature] support logits processors (#4515)
2025-10-29 00:08:53 +08:00
model_loader
[Speculative Decoding][MTP]Support mtp in epdptp mode (#4614)
2025-10-28 16:02:47 +08:00
models
[Feature] Optim PaddleOCR-VL (#4873)
2025-11-07 14:56:44 +08:00
ops
delete useless code (#4544)
2025-10-23 13:40:34 +08:00
__init__.py
polish code with new pre-commit rule (#2923)
2025-07-19 23:19:27 +08:00
forward_meta.py
remove input_ids from ForwardMeta (#4793)
2025-11-05 11:55:51 +08:00
load_weight_utils.py
[XPU] ep+tp all2all (#4836)
2025-11-06 17:26:14 +08:00
pre_and_post_process.py
remove seq_lens_this_time (#4821)
2025-11-06 11:06:28 +08:00
utils.py
[PD Disaggregation] Support Qwen3-MoE use PD + EP inference. (#4691)
2025-11-06 10:32:15 +08:00
Powered by Gitea Version: 1.25.2 Page: 255ms Template: 7ms
English
Bahasa Indonesia Deutsch English Español Français Gaeilge Italiano Latviešu Magyar nyelv Nederlands Polski Português de Portugal Português do Brasil Suomi Svenska Türkçe Čeština Ελληνικά Български Русский Українська فارسی മലയാളം 日本語 简体中文 繁體中文(台灣) 繁體中文(香港) 한국어
Licenses API