mirror of
https://github.com/PaddlePaddle/FastDeploy.git
synced 2025-12-24 13:28:13 +08:00
update wint2 doc
This commit is contained in:
@@ -13,7 +13,7 @@ export FD_MODEL_CACHE=/ssd1/download_models
|
||||
|
||||
| Model Name | Context Length | Quantization | Minimum Deployment Resources | Notes |
|
||||
| :--------- | :------------- | :----------- | :-------------------------- | :---- |
|
||||
| baidu/ERNIE-4.5-VL-424B-A47B-Paddle | 32K/128K | WINT2 | 1*96G GPU VRAM/1T RAM | Chunked Prefill required for 128K |
|
||||
| baidu/ERNIE-4.5-VL-424B-A47B-Paddle | 32K/128K | WINT2 | 1*141G GPU VRAM/1T RAM | Chunked Prefill required for 128K |
|
||||
| baidu/ERNIE-4.5-VL-424B-A47B-Paddle | 32K/128K | WINT4 | 4*80G GPU VRAM/1T RAM | Chunked Prefill required for 128K |
|
||||
| baidu/ERNIE-4.5-VL-424B-A47B-Paddle | 32K/128K | WINT8 | 8*80G GPU VRAM/1T RAM | Chunked Prefill required for 128K |
|
||||
| baidu/ERNIE-4.5-300B-A47B-Paddle | 32K/128K | WINT4 | 4*64G GPU VRAM/600G RAM | Chunked Prefill required for 128K |
|
||||
|
||||
Reference in New Issue
Block a user