Logo
Explore Help
Sign In
apps/FastDeploy
1
0
Fork 0
You've already forked FastDeploy
mirror of https://github.com/PaddlePaddle/FastDeploy.git synced 2025-12-24 13:28:13 +08:00
Code Issues Actions 2 Packages Projects Releases Wiki Activity
Files
c7b8f4f8c696dfe7318cfa14a9545485027ffa8b
FastDeploy/fastdeploy/engine
History
liyonghua0910 c7b8f4f8c6 [fix] fix ipc suffix, use port instead
2025-09-12 21:41:59 +08:00
..
sched
[Optimize] optimize prefix cache in develop (#3890)
2025-09-12 10:15:59 +08:00
__init__.py
polish code with new pre-commit rule (#2923)
2025-07-19 23:19:27 +08:00
args_utils.py
[Feature] GLM-45-AIR Support Mix Quantization(Dense wfp8afp8 and wint8 triton_moe_backend) (#4051)
2025-09-11 20:08:09 +08:00
common_engine.py
[fix] fix ipc suffix, use port instead
2025-09-12 21:41:59 +08:00
engine.py
[fix] fix ipc suffix, use port instead
2025-09-12 21:41:59 +08:00
expert_service.py
[feat] support clearing prefix cache (cherry-picked from release/2.1)
2025-09-12 21:41:55 +08:00
kv_cache_interface.py
polish code with new pre-commit rule (#2923)
2025-07-19 23:19:27 +08:00
request.py
fix response processsors (#3826)
2025-09-04 16:01:25 +08:00
resource_manager.py
[metrics] Add serveral observability metrics (#3868)
2025-09-08 14:13:13 +08:00
sampling_params.py
[Feature] mm and thinking model support structred output (#2749)
2025-09-02 16:21:09 +08:00
Powered by Gitea Version: 1.25.2 Page: 1018ms Template: 29ms
English
Bahasa Indonesia Deutsch English Español Français Gaeilge Italiano Latviešu Magyar nyelv Nederlands Polski Português de Portugal Português do Brasil Suomi Svenska Türkçe Čeština Ελληνικά Български Русский Українська فارسی മലയാളം 日本語 简体中文 繁體中文(台灣) 繁體中文(香港) 한국어
Licenses API