Logo
Explore Help
Sign In
apps/FastDeploy
1
0
Fork 0
You've already forked FastDeploy
mirror of https://github.com/PaddlePaddle/FastDeploy.git synced 2025-12-24 13:28:13 +08:00
Code Issues Actions 2 Packages Projects Releases Wiki Activity
Files
c18b177f2177811537be2aece9dd6cbb9a842d50
FastDeploy/fastdeploy/model_executor
History
Echo-Nie c18b177f21 fix the get_act_fn,_load_st_projector (#4824)
2025-11-06 16:13:35 +08:00
..
graph_optimization
[Graph Optimization] Refactor default capture list (#4617)
2025-10-28 21:31:02 +08:00
guided_decoding
[FDConfig]Remove reasoning_parser/guided_decoding_backend/disable_any_whitespace/device_ids in FDConfig (#4362)
2025-10-17 10:40:59 +08:00
layers
fix the get_act_fn,_load_st_projector (#4824)
2025-11-06 16:13:35 +08:00
logits_processor
[Feature] support logits processors (#4515)
2025-10-29 00:08:53 +08:00
model_loader
[Speculative Decoding][MTP]Support mtp in epdptp mode (#4614)
2025-10-28 16:02:47 +08:00
models
fix the get_act_fn,_load_st_projector (#4824)
2025-11-06 16:13:35 +08:00
ops
delete useless code (#4544)
2025-10-23 13:40:34 +08:00
__init__.py
polish code with new pre-commit rule (#2923)
2025-07-19 23:19:27 +08:00
forward_meta.py
remove input_ids from ForwardMeta (#4793)
2025-11-05 11:55:51 +08:00
load_weight_utils.py
cache scale load (#4624)
2025-10-31 11:58:33 +08:00
pre_and_post_process.py
remove seq_lens_this_time (#4821)
2025-11-06 11:06:28 +08:00
utils.py
[PD Disaggregation] Support Qwen3-MoE use PD + EP inference. (#4691)
2025-11-06 10:32:15 +08:00
Powered by Gitea Version: 1.25.2 Page: 1236ms Template: 131ms
English
Bahasa Indonesia Deutsch English Español Français Gaeilge Italiano Latviešu Magyar nyelv Nederlands Polski Português de Portugal Português do Brasil Suomi Svenska Türkçe Čeština Ελληνικά Български Русский Українська فارسی മലയാളം 日本語 简体中文 繁體中文(台灣) 繁體中文(香港) 한국어
Licenses API