Files
FastDeploy/examples/splitwise/test.sh
Juncai 08ca0f6aea [Feature] [PD] add simple router and refine splitwise deployment (#4709)
* add simple router and refine splitwise deployment

* fix
2025-11-06 14:56:02 +08:00

21 lines
515 B
Bash

#!/bin/bash
# using v0 version, the request must be sent to the decode instance
# using v1 version, the request can be sent to the prefill or decode instance
# using v2 version, the request must be sent to the router
port=${1:-9000}
echo "port: ${port}"
unset http_proxy && unset https_proxy
curl -X POST "http://0.0.0.0:${port}/v1/chat/completions" \
-H "Content-Type: application/json" \
-d '{
"messages": [
{"role": "user", "content": "Introduce shenzhen"}
],
"max_tokens": 20,
"stream": true
}'