mirror of
https://github.com/PaddlePaddle/FastDeploy.git
synced 2025-10-11 19:40:25 +08:00
default enable chunked prefill (#3731)
* add error traceback info * update error msg * update code * default enable chunked prefill * update code * update code * add envs * update code --------- Co-authored-by: Jiang-Jia-Jun <163579578+Jiang-Jia-Jun@users.noreply.github.com>
This commit is contained in:
@@ -243,7 +243,7 @@ class EngineArgs:
|
||||
Ports for rdma communication.
|
||||
"""
|
||||
|
||||
enable_chunked_prefill: bool = False
|
||||
enable_chunked_prefill: bool = True
|
||||
"""
|
||||
Flag to enable chunked prefilling.
|
||||
"""
|
||||
|
Reference in New Issue
Block a user