Files
FastDeploy/fastdeploy
lizexu123 b0cf2c4b7a [Feature] Support prefill batch inference for pooling models. (#5436)
* fix multi-inputs

* fix threshold

* fix threshold

* fix

* support multi-batch

* add tests

* fix test

* test

* fix
2025-12-09 16:21:00 +08:00
..
2025-12-09 16:16:16 +08:00
2025-12-04 19:22:04 +08:00
2025-07-22 14:06:01 +08:00
2025-07-03 15:43:53 +08:00