Files
FastDeploy/benchmarks/yaml/qwen25_7b-vl-32k-bf16.yaml
xjkmfa 19df1aec2b [Docs] add Qwen25vl yaml (#4662)
* Add ci case for min token and max token

* 【CI case】include total_tokens in the last packet of completion interface stream output

* 【CE】add qwen25-vl

* 【CE】add qwen25-vl

---------

Co-authored-by: xujing43 <xujing43@baidu.com>
2025-10-29 17:39:40 +08:00

6 lines
159 B
YAML

max_model_len: 32768
max_num_seqs: 128
gpu_memory_utilization: 0.85
tensor_parallel_size: 1
limit_mm_per_prompt: '{"image": 100, "video": 100}'
enable_mm: True