Logo
Explore Help
Sign In
apps/FastDeploy
1
0
Fork 0
You've already forked FastDeploy
mirror of https://github.com/PaddlePaddle/FastDeploy.git synced 2025-12-24 13:28:13 +08:00
Code Issues Actions 2 Packages Projects Releases Wiki Activity
Files
83532e1d01fc645c767e530f9a6a3a0b11bb1abe
FastDeploy/fastdeploy/model_executor
History
Echo-Nie c18b177f21 fix the get_act_fn,_load_st_projector (#4824)
2025-11-06 16:13:35 +08:00
..
graph_optimization
[Graph Optimization] Refactor default capture list (#4617)
2025-10-28 21:31:02 +08:00
guided_decoding
[FDConfig]Remove reasoning_parser/guided_decoding_backend/disable_any_whitespace/device_ids in FDConfig (#4362)
2025-10-17 10:40:59 +08:00
layers
fix the get_act_fn,_load_st_projector (#4824)
2025-11-06 16:13:35 +08:00
logits_processor
[Feature] support logits processors (#4515)
2025-10-29 00:08:53 +08:00
model_loader
[Speculative Decoding][MTP]Support mtp in epdptp mode (#4614)
2025-10-28 16:02:47 +08:00
models
fix the get_act_fn,_load_st_projector (#4824)
2025-11-06 16:13:35 +08:00
ops
delete useless code (#4544)
2025-10-23 13:40:34 +08:00
__init__.py
polish code with new pre-commit rule (#2923)
2025-07-19 23:19:27 +08:00
forward_meta.py
remove input_ids from ForwardMeta (#4793)
2025-11-05 11:55:51 +08:00
load_weight_utils.py
cache scale load (#4624)
2025-10-31 11:58:33 +08:00
pre_and_post_process.py
remove seq_lens_this_time (#4821)
2025-11-06 11:06:28 +08:00
utils.py
[PD Disaggregation] Support Qwen3-MoE use PD + EP inference. (#4691)
2025-11-06 10:32:15 +08:00
Powered by Gitea Version: 1.25.2 Page: 139ms Template: 7ms
English
Bahasa Indonesia Deutsch English Español Français Gaeilge Italiano Latviešu Magyar nyelv Nederlands Polski Português de Portugal Português do Brasil Suomi Svenska Türkçe Čeština Ελληνικά Български Русский Українська فارسی മലയാളം 日本語 简体中文 繁體中文(台灣) 繁體中文(香港) 한국어
Licenses API