Logo
Explore Help
Sign In
apps/FastDeploy
1
0
Fork 0
You've already forked FastDeploy
mirror of https://github.com/PaddlePaddle/FastDeploy.git synced 2025-11-03 11:02:01 +08:00
Code Issues Actions 5 Packages Projects Releases Wiki Activity
Files
8c0e7d6fe9caacd96afb7717124b1a02ce425be6
FastDeploy/fastdeploy/model_executor
History
yangjianfengo1 9213a58a06 【Fix bug] w4afp8 的nblock固定为256,并且fa3的append attn 增加mask参数 (#3771) (#3835)
* fix w4afp8

* 增加集中式配置

* codestyle

* fix fa3 append attn
2025-09-03 19:36:45 +08:00
..
graph_optimization
[Executor] Fix bug of import paddle with RLHF (#3781) (#3817)
2025-09-02 21:42:59 +08:00
guided_decoding
rename ernie_xxx to ernie4_5_xxx (#3621)
2025-08-26 19:29:27 +08:00
layers
【Fix bug] w4afp8 的nblock固定为256,并且fa3的append attn 增加mask参数 (#3771) (#3835)
2025-09-03 19:36:45 +08:00
model_loader
[BugFix]fix dp&ep&tp and muti node infer (#3629)
2025-08-28 19:09:10 +08:00
models
[Feature]support load eb 0.3B and 21B torch model (#3660)
2025-08-29 20:00:48 +08:00
ops
fix cpu __ini__.py (#3448)
2025-08-17 12:38:54 +08:00
__init__.py
polish code with new pre-commit rule (#2923)
2025-07-19 23:19:27 +08:00
forward_meta.py
enable dcu ci (#3402)
2025-08-29 10:23:08 +08:00
load_weight_utils.py
[v1loader]Reduce EB300B model loading time (#3700) (#3810)
2025-09-03 10:14:31 +08:00
pre_and_post_process.py
add dtype int32 (#3692)
2025-08-29 14:56:35 +08:00
utils.py
[v1loader]Reduce EB300B model loading time (#3700) (#3810)
2025-09-03 10:14:31 +08:00
Powered by Gitea Version: 1.24.7 Page: 526ms Template: 7ms
English
Bahasa Indonesia Deutsch English Español Français Gaeilge Italiano Latviešu Magyar nyelv Nederlands Polski Português de Portugal Português do Brasil Suomi Svenska Türkçe Čeština Ελληνικά Български Русский Українська فارسی മലയാളം 日本語 简体中文 繁體中文(台灣) 繁體中文(香港) 한국어
Licenses API