[Graph Optimization][BugFix][CI] Fix 0size bug && add unitest (#5495)

This commit is contained in:
Ryan
2025-12-11 16:25:26 +08:00
committed by GitHub
parent e1347be4d9
commit e58fed3665
7 changed files with 25 additions and 4 deletions

View File

@@ -0,0 +1,8 @@
max_model_len: 32768
max_num_seqs: 128
tensor_parallel_size: 1
quantization: wint4
graph_optimization_config:
graph_opt_level: 2
sot_warmup_sizes: [2,16,32,64]
use_cudagraph: True

View File

@@ -91,7 +91,7 @@ def setup_and_run_server():
"--reasoning-parser",
"ernie-45-vl",
"--graph-optimization-config",
'{"graph_opt_level": 1, "use_cudagraph": true, "full_cuda_graph": false}',
'{"graph_opt_level": 2, "use_cudagraph": true, "full_cuda_graph": false}',
]
# Start subprocess in new process group