Yuanle Liu
|
3dc0ffa46d
|
[TSP] Support qwen3 moe tsp + cudagraph (#4871)
CE Compile Job / ce_job_pre_check (push) Has been cancelled
CE Compile Job / print_ce_job_pre_check_outputs (push) Has been cancelled
CE Compile Job / FD-Clone-Linux (push) Has been cancelled
CE Compile Job / Show Code Archive Output (push) Has been cancelled
CE Compile Job / BUILD_SM8090 (push) Has been cancelled
CE Compile Job / BUILD_SM8689 (push) Has been cancelled
CE Compile Job / CE_UPLOAD (push) Has been cancelled
Deploy GitHub Pages / deploy (push) Has been cancelled
* support qwen3_moe tsp mode
* fix
* fix
* update
* update
* update
* fix
* support external_rmsnorm
* update
* fix
|
2025-11-10 23:37:51 +08:00 |
|
chenjian
|
78895e2c7d
|
[Bug Fix] fix bug for PD EP (#4823)
* fix bug for PD EP
* fix
* optimize perf for engine worker queue
* fix bug
* fix internode ll two stage
* fix for ci
* fix bug
|
2025-11-10 15:33:29 +08:00 |
|
chenjian
|
cc8f5312f5
|
[Feature] Add timestamp for profiler (#4726)
* [Feature] Add timestamp for profiler
* fix bug for offine inference
* fix for ci
* fix
* fix ci
|
2025-11-05 12:04:59 +08:00 |
|
chen
|
1c3ca48128
|
[Feature][Executor] GPU Model Runner Supports prompt_logprobs and max_logprobs (#4769)
|
2025-11-05 10:43:25 +08:00 |
|
zhouchong
|
35286ce31a
|
fix total_block_num init error in worker_process (#4687)
|
2025-10-30 19:53:09 +08:00 |
|
chen
|
5c63a089f6
|
[Feature] Support logprobs_mode (#4567)
|
2025-10-27 14:27:48 +08:00 |
|
zhouchong
|
dce988824d
|
[Feature] Support AsyncLLM (#4458)
CE Compile Job / ce_job_pre_check (push) Has been cancelled
CE Compile Job / print_ce_job_pre_check_outputs (push) Has been cancelled
CE Compile Job / FD-Clone-Linux (push) Has been cancelled
CE Compile Job / Show Code Archive Output (push) Has been cancelled
CE Compile Job / BUILD_SM8090 (push) Has been cancelled
CE Compile Job / BUILD_SM8689 (push) Has been cancelled
CE Compile Job / CE_UPLOAD (push) Has been cancelled
Deploy GitHub Pages / deploy (push) Has been cancelled
* add async_llm
* apply review
* update engine config
* Adapt to latest engine.py changes
* add more unit tests
* Increase unit test coverage
---------
Co-authored-by: Jiang-Jia-Jun <163579578+Jiang-Jia-Jun@users.noreply.github.com>
|
2025-10-22 15:50:12 +08:00 |
|