FastDeploy

mirror of https://github.com/PaddlePaddle/FastDeploy.git synced 2025-12-24 13:28:13 +08:00

Files

K11OntheBoat 62dfad4a5f [PD Disaggregation] Support Qwen3-MoE use PD + EP inference. (#4691 )

support Qwen-MoE PD/EP

2025-11-06 10:32:15 +08:00

__init__.py

2025-08-25 11:27:45 +08:00

ep.py

2025-11-04 16:35:40 +08:00

fused_moe_backend_base.py

2025-11-06 10:32:15 +08:00

fused_moe_cutlass_backend.py

2025-11-05 21:00:23 +08:00

fused_moe_deepgemm_backend.py

2025-11-06 10:32:15 +08:00

fused_moe_marlin_backend.py

2025-10-30 18:59:04 +08:00

fused_moe_triton_backend.py

2025-10-17 11:47:16 +08:00

fused_moe_wint2_backend.py

2025-11-05 21:00:23 +08:00

moe.py

2025-11-06 10:32:15 +08:00

triton_moe_kernels.py

2025-09-24 16:39:51 +08:00