[Docs] add Qwen25vl yaml (#4662)

* Add ci case for min token and max token

* 【CI case】include total_tokens in the last packet of completion interface stream output

* 【CE】add qwen25-vl

* 【CE】add qwen25-vl

---------

Co-authored-by: xujing43 <xujing43@baidu.com>
This commit is contained in:
xjkmfa
2025-10-29 17:39:40 +08:00
committed by GitHub
parent 8f40dfa9bf
commit 19df1aec2b
2 changed files with 14 additions and 0 deletions

View File

@@ -0,0 +1,6 @@
max_model_len: 32768
max_num_seqs: 128
gpu_memory_utilization: 0.85
tensor_parallel_size: 1
limit_mm_per_prompt: '{"image": 100, "video": 100}'
enable_mm: True

View File

@@ -0,0 +1,8 @@
top_p: 0.8
temperature: 0.7
metadata:
min_tokens: 1
max_tokens: 32768
repetition_penalty: 1.05
frequency_penalty: 0
presence_penalty: 0