ming1753
|
570ad54b51
|
[Docs] release 2.1 (#3441)
* [Docs] release 2.1
* sync gh-pages.yml
|
2025-08-15 19:32:29 +08:00 |
|
yinwei
|
8a15bdc0c8
|
[Doc]Release fastdeploy-xpu 2.1.0 (#3407)
* fix v1 schedule oom bug
* fix v1 schedule oom bug
* update release note
|
2025-08-14 19:11:16 +08:00 |
|
yinwei
|
5b9aec1f10
|
xpu release 2.0.3 (#3105)
|
2025-07-31 14:26:07 +08:00 |
|
Zero Rains
|
25698d56d1
|
polish code with new pre-commit rule (#2923)
|
2025-07-19 23:19:27 +08:00 |
|
yulangz
|
7dfd2ea052
|
[XPU][doc] Update minimal fastdeploy required (#2863)
* [XPU][doc] update minimal fastdeploy required
|
2025-07-17 11:33:22 +08:00 |
|
yulangz
|
17314ee126
|
[XPU] Update doc and add scripts for downloading dependencies (#2845)
* [XPU] update xvllm download
* update supported models
* fix xpu model runner in huge memory with small model
* update doc
|
2025-07-16 11:05:56 +08:00 |
|
Sunny-bot1
|
240d6236bc
|
[Fix]fix top_k_top_p sampling (#2801)
Deploy GitHub Pages / deploy (push) Has been cancelled
* fix topk-topp
* update
* add base_non_truncated
|
2025-07-10 22:35:10 +08:00 |
|
chen
|
888780ffde
|
[Feature] block_wise_fp8 support triton_moe_backend (#2767)
|
2025-07-09 19:22:47 +08:00 |
|
Jiang-Jia-Jun
|
92c2cfa2e7
|
Sync v2.0 version of code to github repo
|
2025-06-29 23:29:37 +00:00 |
|
jiangjiajun
|
4c07d198ba
|
add model zoo
|
2022-07-06 03:12:43 +00:00 |
|
jiangjiajun
|
9d87046d78
|
first commit
|
2022-07-05 09:30:15 +00:00 |
|