Modified to support custom all reduce by default (#3538)

2025-12-24 13:28:13 +08:00 · 2025-08-22 16:59:05 +08:00
parent 27666ee586
commit df7c31012b
15 changed files with 18 additions and 30 deletions
--- a/docs/best_practices/ERNIE-4.5-0.3B-Paddle.md
+++ b/docs/best_practices/ERNIE-4.5-0.3B-Paddle.md
@@ -77,8 +77,7 @@ Add the following lines to the startup parameters
 ```
 Notes:
 1. Usually, no additional parameters need to be set, but CUDAGraph will generate some additional memory overhead, which may need to be adjusted in some scenarios with limited memory. For detailed parameter adjustments, please refer to [GraphOptimizationBackend](../features/graph_optimization.md) for related configuration parameter descriptions
-2. When CUDAGraph is enabled, if running with multi-GPUs TP>1, `--enable-custom-all-reduce` must be specified at the same time.
-3. When CUDAGraph is enabled, the scenario of `max-model-len > 32768` is not currently supported.
+2. When CUDAGraph is enabled, the scenario of `max-model-len > 32768` is not currently supported.

 #### 2.2.6 Rejection Sampling
 **Idea:**
--- a/docs/best_practices/ERNIE-4.5-21B-A3B-Paddle.md
+++ b/docs/best_practices/ERNIE-4.5-21B-A3B-Paddle.md
@@ -87,8 +87,7 @@ Add the following lines to the startup parameters
 ```
 Notes:
 1. Usually, no additional parameters need to be set, but CUDAGraph will generate some additional memory overhead, which may need to be adjusted in some scenarios with limited memory. For detailed parameter adjustments, please refer to [GraphOptimizationBackend](../features/graph_optimization.md) for related configuration parameter descriptions
-2. When CUDAGraph is enabled, if running with multi-GPUs TP>1, `--enable-custom-all-reduce` must be specified at the same time.
-3. When CUDAGraph is enabled, the scenario of `max-model-len > 32768` is not currently supported.
+2. When CUDAGraph is enabled, the scenario of `max-model-len > 32768` is not currently supported.

 #### 2.2.6 Rejection Sampling
 **Idea:**
--- a/docs/best_practices/ERNIE-4.5-300B-A47B-Paddle.md
+++ b/docs/best_practices/ERNIE-4.5-300B-A47B-Paddle.md
@@ -132,12 +132,10 @@ CUDAGraph is a GPU computing acceleration technology provided by NVIDIA. It achi
 Add the following lines to the startup parameters
 ```
 --use-cudagraph
--enable-custom-all-reduce
 ```
 Notes:
 1. Usually, no additional parameters need to be set, but CUDAGraph will generate some additional memory overhead, which may need to be adjusted in some scenarios with limited memory. For detailed parameter adjustments, please refer to [GraphOptimizationBackend](../features/graph_optimization.md) for related configuration parameter descriptions
-2. When CUDAGraph is enabled, if running with multi-GPUs TP>1, `--enable-custom-all-reduce` must be specified at the same time.
-3. When CUDAGraph is enabled, the scenario of `max-model-len > 32768` is not currently supported.
+2. When CUDAGraph is enabled, the scenario of `max-model-len > 32768` is not currently supported.

 ## FAQ
 If you encounter any problems during use, you can refer to [FAQ](./FAQ.md).