Files
FastDeploy/fastdeploy/model_executor/layers/quantization
zhupengyang 8735cb5045 [XPU] refactor moe ffn (#5501)
- remove BKCL_DISPATCH_ALL_GATHER
- support sparse mode
- support moe quant_method
2025-12-18 14:14:05 +08:00
..
2025-11-11 21:30:39 +08:00
2025-12-18 14:14:05 +08:00
2025-09-03 10:57:26 +08:00
2025-12-18 14:14:05 +08:00
2025-11-11 21:30:39 +08:00
2025-10-31 15:44:14 +08:00