This website requires JavaScript.
Explore
Help
Sign In
apps
/
FastDeploy
Watch
1
Star
0
Fork
0
You've already forked FastDeploy
mirror of
https://github.com/PaddlePaddle/FastDeploy.git
synced
2025-12-24 13:28:13 +08:00
Code
Issues
Actions
2
Packages
Projects
Releases
Wiki
Activity
Files
gzy19990617-patch-1
Add File
New File
Upload File
Apply Patch
FastDeploy
/
fastdeploy
/
entrypoints
History
GoldPancake
e56c4dd0a8
[Cherry-Pick] Support for request-level speculative decoding metrics monitoring.(
#5518
) (
#5614
)
...
* support spec metrics monitor per request
2025-12-17 20:53:04 +08:00
..
cli
[CLI]Update parameters in bench latecy cli tool and fix collect-env cli tool (
#4558
)
2025-10-24 16:46:45 +08:00
openai
[Cherry-Pick] Support for request-level speculative decoding metrics monitoring.(
#5518
) (
#5614
)
2025-12-17 20:53:04 +08:00
__init__.py
polish code with new pre-commit rule (
#2923
)
2025-07-19 23:19:27 +08:00
api_server.py
add error traceback info (
#3419
)
2025-08-19 19:32:04 +08:00
chat_utils.py
[Feature] mm support prefix cache (
#4134
)
2025-10-27 17:39:51 +08:00
engine_client.py
[Optimization]
Qwen2.5-VL
support multi-batch prefill (
#5269
)
2025-12-05 18:22:39 +08:00
llm.py
fix logprobs (
#5335
)
2025-12-04 10:38:51 +08:00