Logo
Explore Help
Sign In
apps/FastDeploy
1
0
Fork 0
You've already forked FastDeploy
mirror of https://github.com/PaddlePaddle/FastDeploy.git synced 2025-10-05 16:48:03 +08:00
Code Issues Actions 2 Packages Projects Releases Wiki Activity
Files
2f28f40d90796cb1c1170d0455b7771fa844f67c
FastDeploy/docs/zh
History
Sunny-bot1 c68c3c4b8b [Feature] bad words support v1 scheduler and specifiy token ids (#3608)
* support bad_words_token_ids

* docs

* fix test

* fix

* bad words support kvcache v1 and token ids

* fix
2025-08-25 20:14:51 -07:00
..
best_practices
Modified to support custom all reduce by default (#3538)
2025-08-22 16:59:05 +08:00
features
[Feature] bad words support v1 scheduler and specifiy token ids (#3608)
2025-08-25 20:14:51 -07:00
get_started
[MetaxGPU] adapt to the latest fastdeploy on metax gpu (#3492)
2025-08-25 17:44:20 +08:00
online_serving
[Feature] bad words support v1 scheduler and specifiy token ids (#3608)
2025-08-25 20:14:51 -07:00
quantization
polish code with new pre-commit rule (#2923)
2025-07-19 23:19:27 +08:00
usage
[Docs]Modify the gpu-memory-utilization of the 128K 8-card Wint4 model to 0.95 (#3428)
2025-08-15 18:34:37 +08:00
benchmark.md
Sync v2.0 version of code to github repo
2025-06-29 23:29:37 +00:00
index.md
Update README (#3426)
2025-08-15 18:46:28 +08:00
offline_inference.md
[Docs]fix sampling docs (#3113)
2025-08-11 20:42:27 +08:00
parameters.md
Modified to support custom all reduce by default (#3538)
2025-08-22 16:59:05 +08:00
supported_models.md
[Feature] multi source download (#3005)
2025-07-24 17:42:09 +08:00
Powered by Gitea Version: 1.24.5 Page: 534ms Template: 11ms
English
Bahasa Indonesia Deutsch English Español Français Gaeilge Italiano Latviešu Magyar nyelv Nederlands Polski Português de Portugal Português do Brasil Suomi Svenska Türkçe Čeština Ελληνικά Български Русский Українська فارسی മലയാളം 日本語 简体中文 繁體中文(台灣) 繁體中文(香港) 한국어
Licenses API