Files
FastDeploy/fastdeploy/model_executor/layers/quantization
Zero Rains ce1f353c70 Move create_parameters to __init__ in FuseMOE for CultassBackend and TritonBackend (#3148)
* w4a8 bug

* fix w4a8 bug

* remove code

* modify the triton backend

* fix ep

* fix the bug with tensor_wise_fp8 in triton backend

* fix the RL

* fix bug by merge

* fix the bug in w4a8

* fix the tensor_wise_fp8 bug

* fix RL
2025-08-08 15:55:47 +08:00
..
2025-07-24 12:00:52 +08:00
2025-07-31 19:58:05 +08:00
2025-07-31 19:58:05 +08:00
2025-08-06 14:45:27 +08:00
2025-08-06 14:45:27 +08:00
2025-08-06 14:45:27 +08:00
2025-08-06 14:45:27 +08:00