Logo
Explore Help
Sign In
apps/FastDeploy
1
0
Fork 0
You've already forked FastDeploy
mirror of https://github.com/PaddlePaddle/FastDeploy.git synced 2025-12-24 13:28:13 +08:00
Code Issues Actions 2 Packages Projects Releases Wiki Activity
Files
6c316286c19b0da0c41132838e9b2ee837d8073b
FastDeploy/fastdeploy/model_executor
History
Ayakouji 831266da7a [Fix] fix ernie4_5_vl model torch format loadding (#4447)
* fix

* add test

* fix test

* fix test

* update
2025-11-06 14:34:21 +08:00
..
graph_optimization
[Graph Optimization] Refactor default capture list (#4617)
2025-10-28 21:31:02 +08:00
guided_decoding
[FDConfig]Remove reasoning_parser/guided_decoding_backend/disable_any_whitespace/device_ids in FDConfig (#4362)
2025-10-17 10:40:59 +08:00
layers
[PD Disaggregation] Support Qwen3-MoE use PD + EP inference. (#4691)
2025-11-06 10:32:15 +08:00
logits_processor
[Feature] support logits processors (#4515)
2025-10-29 00:08:53 +08:00
model_loader
[Speculative Decoding][MTP]Support mtp in epdptp mode (#4614)
2025-10-28 16:02:47 +08:00
models
[Fix] fix ernie4_5_vl model torch format loadding (#4447)
2025-11-06 14:34:21 +08:00
ops
delete useless code (#4544)
2025-10-23 13:40:34 +08:00
__init__.py
polish code with new pre-commit rule (#2923)
2025-07-19 23:19:27 +08:00
forward_meta.py
remove input_ids from ForwardMeta (#4793)
2025-11-05 11:55:51 +08:00
load_weight_utils.py
cache scale load (#4624)
2025-10-31 11:58:33 +08:00
pre_and_post_process.py
remove seq_lens_this_time (#4821)
2025-11-06 11:06:28 +08:00
utils.py
[PD Disaggregation] Support Qwen3-MoE use PD + EP inference. (#4691)
2025-11-06 10:32:15 +08:00
Powered by Gitea Version: 1.25.2 Page: 209ms Template: 7ms
English
Bahasa Indonesia Deutsch English Español Français Gaeilge Italiano Latviešu Magyar nyelv Nederlands Polski Português de Portugal Português do Brasil Suomi Svenska Türkçe Čeština Ελληνικά Български Русский Українська فارسی മലയാളം 日本語 简体中文 繁體中文(台灣) 繁體中文(香港) 한국어
Licenses API