Logo
Explore Help
Sign In
apps/FastDeploy
1
0
Fork 0
You've already forked FastDeploy
mirror of https://github.com/PaddlePaddle/FastDeploy.git synced 2025-12-24 13:28:13 +08:00
Code Issues Actions 2 Packages Projects Releases Wiki Activity
Files
05b6591c23a5fc1c300205d7bcad397c19e7f736
FastDeploy/fastdeploy/model_executor
History
gaoziyuan 05b6591c23 【BugFix】add moe noaux_tc tatics in trition backend (#3821)
* add moe noaux_tc tatics in trition backend

* fix

* add dp config
2025-09-03 13:28:44 +08:00
..
graph_optimization
[Executor] Fix bug of import paddle with RLHF (#3781) (#3817)
2025-09-02 21:42:59 +08:00
guided_decoding
rename ernie_xxx to ernie4_5_xxx (#3621)
2025-08-26 19:29:27 +08:00
layers
【BugFix】add moe noaux_tc tatics in trition backend (#3821)
2025-09-03 13:28:44 +08:00
model_loader
[BugFix]fix dp&ep&tp and muti node infer (#3629)
2025-08-28 19:09:10 +08:00
models
[Feature]support load eb 0.3B and 21B torch model (#3660)
2025-08-29 20:00:48 +08:00
ops
fix cpu __ini__.py (#3448)
2025-08-17 12:38:54 +08:00
__init__.py
polish code with new pre-commit rule (#2923)
2025-07-19 23:19:27 +08:00
forward_meta.py
enable dcu ci (#3402)
2025-08-29 10:23:08 +08:00
load_weight_utils.py
[v1loader]Reduce EB300B model loading time (#3700) (#3810)
2025-09-03 10:14:31 +08:00
pre_and_post_process.py
add dtype int32 (#3692)
2025-08-29 14:56:35 +08:00
utils.py
[v1loader]Reduce EB300B model loading time (#3700) (#3810)
2025-09-03 10:14:31 +08:00
Powered by Gitea Version: 1.25.2 Page: 1742ms Template: 6ms
English
Bahasa Indonesia Deutsch English Español Français Gaeilge Italiano Latviešu Magyar nyelv Nederlands Polski Português de Portugal Português do Brasil Suomi Svenska Türkçe Čeština Ελληνικά Български Русский Українська فارسی മലയാളം 日本語 简体中文 繁體中文(台灣) 繁體中文(香港) 한국어
Licenses API