Sunny-bot1
|
fa2369271d
|
update env docs for Machete (#3960)
|
2025-09-08 14:44:52 +08:00 |
|
周周周
|
17b414c2df
|
MoE Default use triton's blockwise fp8 in TP Case (#3678)
|
2025-08-29 11:07:30 +08:00 |
|
Mattheliu
|
108d989d9d
|
[Docs] add fastdeploy_unit_test_guide.md (#3484)
* docs:add fastdeploy_unit_test_guide.md
* docs:fix fastdeploy_unit_test_guide.md
* docs: add FastDeploy unit test spec (EN) and update usage nav
* fix codestyle
|
2025-08-28 14:12:25 +08:00 |
|
yinwei
|
354575b6d1
|
[Docs]Modify the gpu-memory-utilization of the 128K 8-card Wint4 model to 0.95 (#3428)
* XPU Update 2.1 Release Documentation
* code style check
* Modify the gpu-memory-utilization of the 128K 8-card Wint4 model to 0.95
|
2025-08-15 18:34:37 +08:00 |
|
yinwei
|
fbb6dcb9e4
|
[Docs]XPU Update 2.1 Release Documentation (#3423)
* XPU Update 2.1 Release Documentation
* code style check
|
2025-08-15 14:07:47 +08:00 |
|
Yuanle Liu
|
9571c458f0
|
enhance eos_tokens (#3274)
* enhance eos_tokens
* update
* update
|
2025-08-11 14:47:52 +08:00 |
|
hong19860320
|
93a1731891
|
[Doc] Update deps and fix dead links (#3252)
|
2025-08-07 11:04:31 +08:00 |
|
yinwei
|
5b9aec1f10
|
xpu release 2.0.3 (#3105)
|
2025-07-31 14:26:07 +08:00 |
|
Zero Rains
|
25698d56d1
|
polish code with new pre-commit rule (#2923)
|
2025-07-19 23:19:27 +08:00 |
|
yulangz
|
7dfd2ea052
|
[XPU][doc] Update minimal fastdeploy required (#2863)
* [XPU][doc] update minimal fastdeploy required
|
2025-07-17 11:33:22 +08:00 |
|
yulangz
|
17314ee126
|
[XPU] Update doc and add scripts for downloading dependencies (#2845)
* [XPU] update xvllm download
* update supported models
* fix xpu model runner in huge memory with small model
* update doc
|
2025-07-16 11:05:56 +08:00 |
|
Sunny-bot1
|
240d6236bc
|
[Fix]fix top_k_top_p sampling (#2801)
Deploy GitHub Pages / deploy (push) Has been cancelled
* fix topk-topp
* update
* add base_non_truncated
|
2025-07-10 22:35:10 +08:00 |
|
chen
|
888780ffde
|
[Feature] block_wise_fp8 support triton_moe_backend (#2767)
|
2025-07-09 19:22:47 +08:00 |
|
Jiang-Jia-Jun
|
92c2cfa2e7
|
Sync v2.0 version of code to github repo
|
2025-06-29 23:29:37 +00:00 |
|
jiangjiajun
|
4c07d198ba
|
add model zoo
|
2022-07-06 03:12:43 +00:00 |
|
jiangjiajun
|
9d87046d78
|
first commit
|
2022-07-05 09:30:15 +00:00 |
|