[feature]2.2 custom_allreduce support cudagraph recapture (#4307)

* custom_allreduce support cudagraph recapture

* delete code

* add shut_down/restart default group
This commit is contained in:
chen
2025-09-29 18:14:21 +08:00
committed by GitHub
parent 9cc2c99539
commit 63a03ee152
7 changed files with 31 additions and 3 deletions

View File

@@ -66,6 +66,7 @@ class DynamicWeightManager:
paddle.device.cuda.empty_cache()
if not self.first_load:
paddle.distributed.restart_process_group()
paddle.distributed.restart_process_group(self.parallel_config.tp_group)
if self.parallel_config.enable_expert_parallel:
paddle.distributed.restart_process_group(self.parallel_config.ep_group)