Logo
Explore Help
Sign In
apps/FastDeploy
1
0
Fork 0
You've already forked FastDeploy
mirror of https://github.com/PaddlePaddle/FastDeploy.git synced 2025-10-19 15:04:47 +08:00
Code Issues Actions 2 Packages Projects Releases Wiki Activity
Files
59071268b67c2596c58779899165f0286139968d
FastDeploy/fastdeploy/engine
History
yulangz 830de5a925 [XPU] Supports TP4 deployment on 4,5,6,7 (#2794)
* 支持通过 XPU_VISIBLE_DEVICES 指定 4,5,6,7 卡运行
* 修改 XPU 文档中多卡说明
2025-07-10 16:48:08 +08:00
..
__init__.py
[LLM] First commit the llm deployment code
2025-06-09 19:20:15 +08:00
args_utils.py
[Feature] Online Chat API Support Return logprobs (#2777)
2025-07-10 16:33:40 +08:00
config.py
[XPU] Supports TP4 deployment on 4,5,6,7 (#2794)
2025-07-10 16:48:08 +08:00
engine.py
[Feature] Online Chat API Support Return logprobs (#2777)
2025-07-10 16:33:40 +08:00
expert_service.py
[Bug fix] Add the missing pod_ip param to the launch_cache_manager function. (#2742)
2025-07-08 14:52:13 +08:00
kv_cache_interface.py
[LLM] First commit the llm deployment code
2025-06-09 19:20:15 +08:00
request.py
[Feature] Online Chat API Support Return logprobs (#2777)
2025-07-10 16:33:40 +08:00
resource_manager.py
Sync v2.0 version of code to github repo
2025-06-29 23:29:37 +00:00
sampling_params.py
[Feature] Online Chat API Support Return logprobs (#2777)
2025-07-10 16:33:40 +08:00
Powered by Gitea Version: 1.24.5 Page: 123ms Template: 4ms
English
Bahasa Indonesia Deutsch English Español Français Gaeilge Italiano Latviešu Magyar nyelv Nederlands Polski Português de Portugal Português do Brasil Suomi Svenska Türkçe Čeština Ελληνικά Български Русский Українська فارسی മലയാളം 日本語 简体中文 繁體中文(台灣) 繁體中文(香港) 한국어
Licenses API