This website requires JavaScript.
Explore
Help
Sign In
apps
/
FastDeploy
Watch
1
Star
0
Fork
0
You've already forked FastDeploy
mirror of
https://github.com/PaddlePaddle/FastDeploy.git
synced
2025-12-24 13:28:13 +08:00
Code
Issues
Actions
2
Packages
Projects
Releases
Wiki
Activity
Files
50100f98d7517c888cb4bb419ec19aa3b38ea075
FastDeploy
/
fastdeploy
/
entrypoints
History
GoldPancake
909059c60a
[Feature] Support for request-level speculative decoding metrics monitoring. (
#5518
)
...
* support spec metrics monitor per request * fix bug * remove debug log * fix ut bugs
2025-12-12 12:22:18 +08:00
..
cli
[CLI]Update parameters in bench latecy cli tool and fix collect-env cli tool (
#4558
)
2025-10-24 16:46:45 +08:00
openai
[Feature] Support for request-level speculative decoding metrics monitoring. (
#5518
)
2025-12-12 12:22:18 +08:00
__init__.py
polish code with new pre-commit rule (
#2923
)
2025-07-19 23:19:27 +08:00
api_server.py
add error traceback info (
#3419
)
2025-08-19 19:32:04 +08:00
chat_utils.py
[Feature] mm support prefix cache (
#4134
)
2025-10-27 17:39:51 +08:00
engine_client.py
[Optimization] support mm prefill batch (
#5313
)
2025-12-11 22:21:14 +08:00
llm.py
fix logprobs (
#5335
)
2025-12-04 10:38:51 +08:00