Logo
Explore Help
Sign In
apps/FastDeploy
1
0
Fork 0
You've already forked FastDeploy
mirror of https://github.com/PaddlePaddle/FastDeploy.git synced 2025-12-24 13:28:13 +08:00
Code Issues Actions 2 Packages Projects Releases Wiki Activity
Files
2ae7ab28d2637bd327f2db9293be3673290063f5
FastDeploy/fastdeploy/model_executor
History
Kane2011 2ae7ab28d2 [MetaxGPU] adapt to the latest fastdeploy on metax gpu (#3492)
2025-08-25 17:44:20 +08:00
..
graph_optimization
[Excutor] Change cudagraph hashkey from batch size to num_tokens (#3454)
2025-08-18 16:16:48 +08:00
guided_decoding
add error traceback info (#3419)
2025-08-19 19:32:04 +08:00
layers
[MetaxGPU] adapt to the latest fastdeploy on metax gpu (#3492)
2025-08-25 17:44:20 +08:00
model_loader
[V1 Loader] support weight_only (#3413)
2025-08-23 13:13:41 +08:00
models
support w4afp8 EP inference (#3044)
2025-08-25 11:27:45 +08:00
ops
…
__init__.py
…
forward_meta.py
[Excutor] Increase buffer size to prevent address corruption; add forward metadata debug tool (#3404)
2025-08-18 16:14:09 +08:00
load_weight_utils.py
…
pre_and_post_process.py
[MetaxGPU] adapt to the latest fastdeploy on metax gpu (#3492)
2025-08-25 17:44:20 +08:00
utils.py
[V1 Loader] support weight_only (#3413)
2025-08-23 13:13:41 +08:00
Powered by Gitea Version: 1.25.2 Page: 2791ms Template: 163ms
English
Bahasa Indonesia Deutsch English Español Français Gaeilge Italiano Latviešu Magyar nyelv Nederlands Polski Português de Portugal Português do Brasil Suomi Svenska Türkçe Čeština Ελληνικά Български Русский Українська فارسی മലയാളം 日本語 简体中文 繁體中文(台灣) 繁體中文(香港) 한국어
Licenses API