mirror of
https://github.com/PaddlePaddle/FastDeploy.git
synced 2025-12-24 13:28:13 +08:00
[RL] Support Rollout Routing Replay (#5321)
* [RL] Support Rollout Routing Replay * add routing indices cache * fix config bug and moe forward bug * R3 Support GLM * support eb4.5 * fix merge bug * Apply suggestion from @Copilot Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com> * Apply suggestion from @Copilot Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com> * Apply suggestion from @Copilot Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com> * Apply suggestion from @Copilot Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com> * add routing replay ci * support glm topk * support orther top_k * fix ci bug * pre-commit * only support chatcmpl --------- Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com> Co-authored-by: Yuanle Liu <yuanlehome@163.com>
This commit is contained in:
@@ -142,6 +142,8 @@ class ForwardMeta:
|
||||
caches: Optional[list[paddle.Tensor]] = None
|
||||
# Flag of profile run
|
||||
is_dummy_or_profile_run: bool = False
|
||||
# Routing Replay table buffer
|
||||
routing_replay_table: Optional[paddle.Tensor] = None
|
||||
|
||||
# chunked MoE related
|
||||
moe_num_chunk: int = 1
|
||||
|
||||
Reference in New Issue
Block a user