Files
FastDeploy/fastdeploy/worker
lizexu123 b0cf2c4b7a [Feature] Support prefill batch inference for pooling models. (#5436)
* fix multi-inputs

* fix threshold

* fix threshold

* fix

* support multi-batch

* add tests

* fix test

* test

* fix
2025-12-09 16:21:00 +08:00
..
2025-09-01 17:50:17 +08:00
2025-12-04 10:38:51 +08:00