This website requires JavaScript.
Explore
Help
Sign In
apps
/
FastDeploy
Watch
1
Star
0
Fork
0
You've already forked FastDeploy
mirror of
https://github.com/PaddlePaddle/FastDeploy.git
synced
2025-12-24 13:28:13 +08:00
Code
Issues
Actions
2
Packages
Projects
Releases
Wiki
Activity
Files
feature/online/unify_code_20250922
Add File
New File
Upload File
Apply Patch
FastDeploy
/
tests
/
e2e
History
K11OntheBoat
05b7800d80
Support limit thinking lengths (
#4244
)
...
Co-authored-by: K11OntheBoat <“
ruianmaidanglao@163.com
”>
2025-09-24 17:30:53 +08:00
..
test_EB_Lite_serving.py
add cache queue port (
#3904
)
2025-09-05 21:17:06 +08:00
test_EB_VL_Lite_serving.py
Support limit thinking lengths (
#4244
)
2025-09-24 17:30:53 +08:00
test_fake_Glm45_AIR_serving.py
[Feature] GLM-45-AIR Support Mix Quantization(Dense wfp8afp8 and wint8 triton_moe_backend) (
#4051
)
2025-09-11 20:08:09 +08:00
test_Qwen2-7B-Instruct_serving.py
[metrics] Add serveral observability metrics (
#3868
)
2025-09-08 14:13:13 +08:00