Commit Graph

5 Commits

Author SHA1 Message Date
chenjian
cc8f5312f5 [Feature] Add timestamp for profiler (#4726)
* [Feature] Add timestamp for profiler

* fix bug for offine inference

* fix for ci

* fix

* fix ci
2025-11-05 12:04:59 +08:00
chen
1c3ca48128 [Feature][Executor] GPU Model Runner Supports prompt_logprobs and max_logprobs (#4769) 2025-11-05 10:43:25 +08:00
zhouchong
35286ce31a fix total_block_num init error in worker_process (#4687) 2025-10-30 19:53:09 +08:00
chen
5c63a089f6 [Feature] Support logprobs_mode (#4567) 2025-10-27 14:27:48 +08:00
zhouchong
dce988824d [Feature] Support AsyncLLM (#4458)
Some checks failed
CE Compile Job / ce_job_pre_check (push) Has been cancelled
CE Compile Job / print_ce_job_pre_check_outputs (push) Has been cancelled
CE Compile Job / FD-Clone-Linux (push) Has been cancelled
CE Compile Job / Show Code Archive Output (push) Has been cancelled
CE Compile Job / BUILD_SM8090 (push) Has been cancelled
CE Compile Job / BUILD_SM8689 (push) Has been cancelled
CE Compile Job / CE_UPLOAD (push) Has been cancelled
Deploy GitHub Pages / deploy (push) Has been cancelled
* add async_llm

* apply review

* update engine config

* Adapt to latest engine.py changes

* add more unit tests

* Increase unit test coverage

---------

Co-authored-by: Jiang-Jia-Jun <163579578+Jiang-Jia-Jun@users.noreply.github.com>
2025-10-22 15:50:12 +08:00