FastDeploy/benchmarks/yaml/qwen25_7b-vl-32k-bf16.yaml at 19df1aec2bd586e5273eee2b593c8280f4856522 - FastDeploy - 子说镜像小站

apps/FastDeploy

mirror of https://github.com/PaddlePaddle/FastDeploy.git synced 2025-12-24 13:28:13 +08:00

Files

xjkmfa 19df1aec2b [Docs] add Qwen25vl yaml (#4662 )

* Add ci case for min token and max token

* 【CI case】include total_tokens in the last packet of completion interface stream output

* 【CE】add qwen25-vl

* 【CE】add qwen25-vl

---------

Co-authored-by: xujing43 <xujing43@baidu.com>

2025-10-29 17:39:40 +08:00

6 lines

159 B

YAML

Raw Blame History

 max_model_len: 32768
 max_num_seqs: 128
 gpu_memory_utilization: 0.85
 tensor_parallel_size: 1
 limit_mm_per_prompt: '{"image": 100, "video": 100}'
 enable_mm: True