Logo
Explore Help
Sign In
apps/FastDeploy
1
0
Fork 0
You've already forked FastDeploy
mirror of https://github.com/PaddlePaddle/FastDeploy.git synced 2025-12-24 13:28:13 +08:00
Code Issues Actions 2 Packages Projects Releases Wiki Activity
Files
7d87aaace855c81dc33dfabcd2d35a4355cf79d6
FastDeploy/fastdeploy/model_executor/layers/moe
History
Yuan Xiaolan b1d787a272 [fix] w4a8 model loading and hadamard config (#3013)
2025-07-28 18:17:59 +08:00
..
__init__.py
polish code with new pre-commit rule (#2923)
2025-07-19 23:19:27 +08:00
ep.py
[Feat] support mixed ep (#2969)
2025-07-25 15:29:30 +08:00
fused_moe_backend_base.py
fix arguement error (#3030)
2025-07-28 11:03:29 +08:00
fused_moe_cutlass_backend.py
[fix] w4a8 model loading and hadamard config (#3013)
2025-07-28 18:17:59 +08:00
fused_moe_deepgemm_backend.py
[Feature] Support_eplb (#2997)
2025-07-24 20:22:45 +08:00
fused_moe_marlin_backend.py
update flake8 version to support pre-commit in python3.12 (#3000)
2025-07-24 01:43:31 -07:00
fused_moe_triton_backend.py
update flake8 version to support pre-commit in python3.12 (#3000)
2025-07-24 01:43:31 -07:00
fused_moe_wint2_backend.py
【Inference Optimize】Update wint2 weight n-dim reorder (#3042)
2025-07-28 16:31:56 +08:00
fused_moe_xpu_backend.py
custom all reduce support cuda graph (#2938)
2025-07-21 22:52:03 +08:00
moe.py
[fix] w4a8 model loading and hadamard config (#3013)
2025-07-28 18:17:59 +08:00
triton_moe_kernels.py
polish code with new pre-commit rule (#2923)
2025-07-19 23:19:27 +08:00
Powered by Gitea Version: 1.25.2 Page: 65ms Template: 6ms
English
Bahasa Indonesia Deutsch English Español Français Gaeilge Italiano Latviešu Magyar nyelv Nederlands Polski Português de Portugal Português do Brasil Suomi Svenska Türkçe Čeština Ελληνικά Български Русский Українська فارسی മലയാളം 日本語 简体中文 繁體中文(台灣) 繁體中文(香港) 한국어
Licenses API