Logo
Explore Help
Sign In
apps/FastDeploy
1
0
Fork 0
You've already forked FastDeploy
mirror of https://github.com/PaddlePaddle/FastDeploy.git synced 2025-10-01 23:02:36 +08:00
Code Issues Actions 2 Packages Projects Releases Wiki Activity
Files
a7392a0ff944a1f40a26023c53c80b10f263421f
FastDeploy/docs
History
AIbin a7392a0ff9 【Inference Optimize】DeepSeek-V3-model MLA Optimize (#3886)
* support MLA chunk_size auto search & cuda_graph
2025-09-11 10:46:09 +08:00
..
assets/images
更新文档 (#3975)
2025-09-08 16:53:37 +08:00
best_practices
add a3b-thinking doc (#3994)
2025-09-09 10:27:01 +08:00
features
Revert "【FIX】Change the name of sparse attn from moba to plas (#3845)" (#4001)
2025-09-09 11:08:23 +08:00
get_started
[xpu] add ep custom ops (#3911)
2025-09-10 12:22:50 +08:00
online_serving
Modify markdown (#3896)
2025-09-08 16:42:34 +08:00
quantization
【Inference Optimize】DeepSeek-V3-model MLA Optimize (#3886)
2025-09-11 10:46:09 +08:00
usage
update env docs for Machete (#3959)
2025-09-08 14:44:31 +08:00
zh
【Inference Optimize】DeepSeek-V3-model MLA Optimize (#3886)
2025-09-11 10:46:09 +08:00
benchmark.md
polish code with new pre-commit rule (#2923)
2025-07-19 23:19:27 +08:00
index.md
更新文档 (#3998)
2025-09-09 10:44:15 +08:00
offline_inference.md
rename ernie_xxx to ernie4_5_xxx (#3621)
2025-08-26 19:29:27 +08:00
parameters.md
更新文档 (#3975)
2025-09-08 16:53:37 +08:00
requirements.txt
Sync v2.0 version of code to github repo
2025-06-29 23:29:37 +00:00
supported_models.md
Update docs for thinking model support
2025-09-09 10:08:05 +08:00
Powered by Gitea Version: 1.24.5 Page: 117ms Template: 8ms
English
Bahasa Indonesia Deutsch English Español Français Gaeilge Italiano Latviešu Magyar nyelv Nederlands Polski Português de Portugal Português do Brasil Suomi Svenska Türkçe Čeština Ελληνικά Български Русский Українська فارسی മലയാളം 日本語 简体中文 繁體中文(台灣) 繁體中文(香港) 한국어
Licenses API