FastDeploy/benchmarks/yaml/eb45-32k-wint4-tp1-dp4_ep.yaml at 672620cdfeac535ff9a6fac23ed0a60472f12a5b - FastDeploy - 子说镜像小站

apps/FastDeploy

mirror of https://github.com/PaddlePaddle/FastDeploy.git synced 2025-12-24 13:28:13 +08:00

Files

xiegegege b7e1e6c953 [CE]change yaml name

2025-12-04 19:14:11 +08:00

8 lines

162 B

YAML

Raw Blame History

 num_gpu_blocks_override: 1024
 max_model_len: 8192
 max_num_seqs: 64
 data_parallel_size: 4
 tensor_parallel_size: 1
 enable_expert_parallel: True
 quantization: wint4