mirror of
https://github.com/PaddlePaddle/FastDeploy.git
synced 2025-10-10 02:50:19 +08:00
@@ -10,7 +10,7 @@ The following installation methods are available when your environment meets the
|
||||
|
||||
## 1. Pre-built Docker Installation (Recommended)
|
||||
|
||||
**Notice**: The pre-built image only supports SM80/90 GPU(e.g. H800/A800),if you are deploying on SM86/89GPU(L40/4090/L20), please reinstall ```fastdpeloy-gpu``` after you create the container.
|
||||
**Notice**: The pre-built image only supports SM80/90 GPU(e.g. H800/A800),if you are deploying on SM86/89GPU(L40/4090/L20), please reinstall ```fastdeploy-gpu``` after you create the container.
|
||||
|
||||
```shell
|
||||
docker pull ccr-2vdh3abv-pub.cnc.bj.baidubce.com/paddlepaddle/fastdeploy-cuda-12.6:2.2.0
|
||||
|
@@ -20,6 +20,6 @@ Below is an overview of the FastDeploy code structure and functionality organize
|
||||
- ```platforms```: Platform-specific modules for underlying hardware support.
|
||||
- ```scheduler```: Request scheduling module for large models.
|
||||
- ```metrics```: Core component for collecting, managing, and exporting Prometheus metrics, tracking key runtime performance data (e.g., request latency, resource utilization, successful request counts).
|
||||
- ```splitwise```: Modules related to PD disaggragation deployment.
|
||||
- ```splitwise```: Modules related to PD disaggregation deployment.
|
||||
- ```scripts```/```tools```: Utility scripts for FastDeploy operations (e.g., compilation, unit testing, code style fixes).
|
||||
- ```test```: Code for unit testing and validation.
|
||||
|
@@ -30,7 +30,7 @@ By default, logs are stored in the `log` directory under the execution path. To
|
||||
* `cache_transfer_manager.log` : Logs startup parameters and received request information.
|
||||
* `launch_cache_manager.log` : Records cache transfer startup parameters and error messages.
|
||||
|
||||
## PD Disaggragation Logs
|
||||
## PD Disaggregation Logs
|
||||
* `cache_messager.log` : Logs transmission protocols and messages used by the P instance.
|
||||
* `splitwise_connector.log` : Records data received from P/D instances and connection establishment details.
|
||||
|
||||
|
Reference in New Issue
Block a user