Files
FastDeploy/tests
fmiao2372 404cf0ece4 [Intel HPU] enable tensor_wise_fp8 (#5324)
* [Intel HPU] enable tensor_wise_fp8

* update code based on comments

* fix code style issue

* fix bug about RP 5138

* mv kv_cache modifications to HPU backend

* fix FP8 Precision Issues

* fix FP8 Precision Issues

* Add quantization UT

---------

Co-authored-by: yanfeich <yanfei.cheng@intel.com>
Co-authored-by: YuBaoku <49938469+EmmonsCurse@users.noreply.github.com>
2025-12-17 16:45:03 +08:00
..
2025-12-09 19:19:42 +08:00
2025-12-09 19:19:42 +08:00
2025-11-25 11:00:34 +08:00
2025-09-22 14:09:09 +08:00
2025-12-09 19:19:42 +08:00
2025-11-12 20:26:49 +08:00