This website requires JavaScript.
Explore
Help
Sign In
apps
/
FastDeploy
Watch
1
Star
0
Fork
0
You've already forked FastDeploy
mirror of
https://github.com/PaddlePaddle/FastDeploy.git
synced
2025-10-06 09:07:10 +08:00
Code
Issues
Actions
2
Packages
Projects
Releases
Wiki
Activity
Files
31f639f10b81d9974418b943b4571d83431935d7
FastDeploy
/
fastdeploy
/
model_executor
History
Zero Rains
30b3f2dc07
Some checks failed
Deploy GitHub Pages / deploy (push)
Has been cancelled
Details
[BugFix][V1 Loader] fix the bug in creat weight for block_wise_fp8 (
#3486
)
2025-08-20 05:52:54 -07:00
..
graph_optimization
[Excutor] Change cudagraph hashkey from batch size to num_tokens (
#3454
)
2025-08-18 16:16:48 +08:00
guided_decoding
add error traceback info (
#3419
)
2025-08-19 19:32:04 +08:00
layers
[BugFix][V1 Loader] fix the bug in creat weight for block_wise_fp8 (
#3486
)
2025-08-20 05:52:54 -07:00
model_loader
[feat]add fast_weights_iterator (
#3258
)
2025-08-07 22:36:46 +08:00
models
【Inference Optimize】DeepSeek-v3 model inference performance optimization (
#3455
)
2025-08-19 10:42:42 +08:00
ops
fix cpu __ini__.py (
#3448
)
2025-08-17 12:38:54 +08:00
__init__.py
polish code with new pre-commit rule (
#2923
)
2025-07-19 23:19:27 +08:00
forward_meta.py
[Excutor] Increase buffer size to prevent address corruption; add forward metadata debug tool (
#3404
)
2025-08-18 16:14:09 +08:00
load_weight_utils.py
Move create_parameters to __init__ in FuseMOE for CultassBackend and TritonBackend (
#3148
)
2025-08-08 15:55:47 +08:00
pre_and_post_process.py
[Code Simplification] remove cum_offsets (
#3410
)
2025-08-18 20:21:25 +08:00