This website requires JavaScript.
Explore
Help
Sign In
apps
/
FastDeploy
Watch
1
Star
0
Fork
0
You've already forked FastDeploy
mirror of
https://github.com/PaddlePaddle/FastDeploy.git
synced
2025-12-24 13:28:13 +08:00
Code
Issues
Actions
2
Packages
Projects
Releases
Wiki
Activity
Files
dd2e844ea3fdf8c66bf2d342a4f53c8fbafdc320
FastDeploy
/
tests
/
e2e
History
K11OntheBoat
4515ad21e9
Support limit thinking lengths (
#4069
)
...
Co-authored-by: K11OntheBoat <“
ruianmaidanglao@163.com
”>
2025-09-25 19:55:56 +08:00
..
test_EB_Lite_serving.py
add cache queue port (
#3904
)
2025-09-05 21:17:06 +08:00
test_EB_VL_Lite_serving.py
Support limit thinking lengths (
#4069
)
2025-09-25 19:55:56 +08:00
test_fake_Glm45_AIR_serving.py
[OPs] MoE support wfp8afp8(channelwise) and improve per_token_quant_fp8 (
#4238
)
2025-09-24 16:39:51 +08:00
test_Qwen2_5_VL_serving.py
[Model] Qwen2.5VL support --use-cudagraph and unit testing (
#4087
)
2025-09-24 19:45:01 +08:00
test_Qwen2-7B-Instruct_serving.py
[metrics] Add serveral observability metrics (
#3868
)
2025-09-08 14:13:13 +08:00