Files
FastDeploy/fastdeploy/engine/sched
Yonghua Li 3672afb487 [Cherry-Pick] [Metrics] Update time_to_first_token to include tokenization & queue time, and remove redundant metrics (#5076)
* [update] update time_to_first_tokens to include queue time, and remove first_token_latency and infer_latency

* [doc] update docs

* [ci] fix test
2025-11-18 14:38:59 +08:00
..