Logo
Explore Help
Sign In
apps/FastDeploy
1
0
Fork 0
You've already forked FastDeploy
mirror of https://github.com/PaddlePaddle/FastDeploy.git synced 2025-10-08 10:00:29 +08:00
Code Issues Actions 2 Packages Projects Releases Wiki Activity
Files
7bdc6f41e50836f3c2276e9df843d87fe0fd6103
FastDeploy/fastdeploy/model_executor/models
History
chen 7bdc6f41e5 fix glm all_reduce tp group (#4188)
2025-09-22 10:57:13 +08:00
..
ernie4_5_vl
[Bug Fix] VL Support w4a8/w4afp8 (#3686)
2025-08-28 21:38:35 +08:00
qwen2_5_vl
[Model]support qwen2_5_vl (#3557)
2025-08-29 18:28:39 +08:00
__init__.py
add input_processor plugin (#3657)
2025-08-28 22:53:57 +08:00
deepseek_v3.py
[Precision] Support lm_head layer running in float32 (#3597)
2025-08-27 11:34:53 +08:00
ernie4_5_moe.py
【CP】Compatible with EB 0.3B torch model arch (#3914)
2025-09-05 19:05:07 +08:00
ernie4_5_mtp.py
fix mtp (#4153)
2025-09-18 10:53:07 +08:00
glm4_moe.py
fix glm all_reduce tp group (#4188)
2025-09-22 10:57:13 +08:00
model_base.py
[plugin] Custom model_runner/model support (#3186)
2025-08-04 18:52:39 -07:00
qwen2.py
[Precision] Support lm_head layer running in float32 (#3597)
2025-08-27 11:34:53 +08:00
qwen3.py
check (#3639)
2025-08-27 14:32:13 +08:00
qwen3moe.py
fix deepcopy(tp_group) in spec (#3648)
2025-08-29 16:08:21 +08:00
tp_utils.py
Supports DP+TP+EP hybrid parallel deployment strategy (#3489)
2025-08-26 00:04:01 -07:00
utils.py
[V1 Loader] support weight_only (#3413)
2025-08-23 13:13:41 +08:00
Powered by Gitea Version: 1.24.5 Page: 1036ms Template: 17ms
English
Bahasa Indonesia Deutsch English Español Français Gaeilge Italiano Latviešu Magyar nyelv Nederlands Polski Português de Portugal Português do Brasil Suomi Svenska Türkçe Čeština Ελληνικά Български Русский Українська فارسی മലയാളം 日本語 简体中文 繁體中文(台灣) 繁體中文(香港) 한국어
Licenses API