default enable chunked prefill (#3731)

* add error traceback info

* update error msg

* update code

* default enable chunked prefill

* update code

* update code

* add envs

* update code

---------

Co-authored-by: Jiang-Jia-Jun <163579578+Jiang-Jia-Jun@users.noreply.github.com>
This commit is contained in:
kevin
2025-08-31 13:15:13 +08:00
committed by GitHub
parent 98e03fb4ea
commit 753772ace8
4 changed files with 13 additions and 1 deletions

View File

@@ -243,7 +243,7 @@ class EngineArgs:
Ports for rdma communication.
"""
enable_chunked_prefill: bool = False
enable_chunked_prefill: bool = True
"""
Flag to enable chunked prefilling.
"""