Files
FastDeploy/docs/metrics.md
2025-06-16 00:04:48 +08:00

21 lines
1.0 KiB
Markdown
Raw Blame History

This file contains ambiguous Unicode characters

This file contains Unicode characters that might be confused with other characters. If you think that this is intentional, you can safely ignore this warning. Use the Escape button to reveal them.

# FastDeploy 指标说明
| 指标名称 | 类型 | 说明 | 单位 |
| ------------------------------------------- | --------- |----------------| ---- |
| `fastdeploy:num_requests_running` | Gauge | 当前正在运行的请求数量 | 个 |
| `fastdeploy:num_requests_waiting` | Gauge | 当前等待中的请求数量 | 个 |
| `fastdeploy:time_to_first_token_seconds` | Histogram | 首 token 生成所需时间 | 秒 |
| `fastdeploy:time_per_output_token_seconds` | Histogram | 间隔输出 token 的生成时间 | 秒 |
| `fastdeploy:e2e_request_latency_seconds` | Histogram | 请求的端到端延迟分布 | 秒 |
| `fastdeploy:request_inference_time_seconds` | Histogram | 请求在RUNNING阶段耗时 | 秒 |
| `fastdeploy:request_queue_time_seconds` | Histogram | 请求在WAITING阶段耗时 | 秒 |
## 指标访问
- 访问地址:`http://localhost:8000/metrics`
- 指标类型Prometheus 格式