Juncai
|
0925d44f18
|
[PD Disaggregation] support different tp_size for prefill and decode (#5296)
* up
* up
* up
* fix
|
2025-12-01 17:50:20 +08:00 |
|
Daci
|
7dc06cac6e
|
[BugFix] race condition [is_fetching] causing multiple fetch requests (#5238)
* RouterArgs port str -> int
* fix race condition [is_fetching] causing multiple fetch requests
* bugfix: Delete duplicate input_ids tensor creation
|
2025-11-28 13:41:36 +08:00 |
|
Juncai
|
f9b0545a7f
|
[PD Disaggregation] [Refine] Refine splitwise deployment (#5151)
* Refine splitwise deployment
* up
|
2025-11-21 15:30:24 +08:00 |
|
Juncai
|
08ca0f6aea
|
[Feature] [PD] add simple router and refine splitwise deployment (#4709)
* add simple router and refine splitwise deployment
* fix
|
2025-11-06 14:56:02 +08:00 |
|