This website requires JavaScript.
Explore
Help
Sign In
apps
/
FastDeploy
Watch
1
Star
0
Fork
0
You've already forked FastDeploy
mirror of
https://github.com/PaddlePaddle/FastDeploy.git
synced
2025-12-24 13:28:13 +08:00
Code
Issues
Actions
2
Packages
Projects
Releases
Wiki
Activity
Files
abdcef30aa71f6fe2fbef7133ffe73868d1980a5
FastDeploy
/
fastdeploy
/
model_executor
/
layers
/
moe
History
Yuanle Liu
c3b2a60fb8
[BugFix] Fix the abnormal memory usage caused by shape errors in the triton moe backend (
#4026
)
...
* fix device_id to in * fix triton_moe bug
2025-09-09 20:05:54 -07:00
..
__init__.py
…
ep.py
[BugFix] fix error of import paddle.base.core.Config (
#3761
)
2025-09-02 17:23:27 +08:00
fused_moe_backend_base.py
V1 loader support ep (
#3801
)
2025-09-03 16:05:41 +08:00
fused_moe_cutlass_backend.py
cache feature (
#3857
)
2025-09-07 18:52:46 +08:00
fused_moe_deepgemm_backend.py
V1 loader support ep (
#3801
)
2025-09-03 16:05:41 +08:00
fused_moe_marlin_backend.py
…
fused_moe_triton_backend.py
[BugFix] Fix the abnormal memory usage caused by shape errors in the triton moe backend (
#4026
)
2025-09-09 20:05:54 -07:00
fused_moe_wint2_backend.py
【New Feature】集中式支持w4afp8 (
#3644
)
2025-08-28 10:53:24 +08:00
fused_moe_xpu_backend.py
…
moe.py
V1 loader support ep (
#3801
)
2025-09-03 16:05:41 +08:00
triton_moe_kernels.py
…