add w4afp8 offline script (#3636)

This commit is contained in:
Yuan Xiaolan
2025-08-29 17:56:05 +08:00
committed by GitHub
parent f677c032c0
commit c71ee0831c
12 changed files with 163 additions and 37 deletions

View File

@@ -60,12 +60,13 @@ curl -i http://0.0.0.0:8180/health
Send requests to the service with the following command:
```shell
curl -X POST "http://0.0.0.0:8180/v1/chat/completions" \
curl -X POST "http://0.0.0.0:1822/v1/chat/completions" \
-H "Content-Type: application/json" \
-d '{
"messages": [
{"role": "user", "content": "Write me a poem about large language model."}
]
],
"stream": true
}'
```