LiqinruiG
|
4ccd1696ab
|
[Doc] modify offline inference docs (#2747)
* modify reasoning_output docs
* modify offline inference docs
* modify offline inference docs
|
2025-07-09 20:53:26 +08:00 |
|
chen
|
888780ffde
|
[Feature] block_wise_fp8 support triton_moe_backend (#2767)
|
2025-07-09 19:22:47 +08:00 |
|
lifulll
|
1f28bdf994
|
dcu adapter ernie45t (#2756)
Co-authored-by: lifu <lifu@sugon.com>
Co-authored-by: yongqiangma <xing.wo@163.com>
|
2025-07-09 18:56:27 +08:00 |
|
zhink
|
b89180f1cd
|
[Feature] support custom all-reduce (#2758)
* [Feature] support custom all-reduce
* add vllm adapted
|
2025-07-09 16:00:27 +08:00 |
|
EnflameGCU
|
d0f4d6ba3a
|
[GCU] Support gcu platform (#2702)
baseline: e7fa57ebae
Co-authored-by: yongqiangma <xing.wo@163.com>
|
2025-07-08 13:00:52 +08:00 |
|
chen
|
66b321d9ec
|
Update eb45-0.3B cuda memory (#2686)
|
2025-07-07 11:31:15 +08:00 |
|
LiqinruiG
|
b38823bc66
|
modify reasoning_output docs (#2696)
|
2025-07-04 11:30:02 +08:00 |
|
kevin
|
3d3bccdf79
|
[doc] update docs (#2690)
|
2025-07-03 19:33:19 +08:00 |
|
Jiang-Jia-Jun
|
d222248d00
|
Update README.md
|
2025-07-03 15:28:28 +08:00 |
|
liddk1121
|
865e856a94
|
update iluvatar gpu fastdeploy whl (#2675)
|
2025-07-02 14:47:21 +08:00 |
|
freeliuzc
|
2b7f74d427
|
fix docs (#2669)
Co-authored-by: liuzichang01 <liuzichang01@baidu.com>
|
2025-07-01 18:02:44 +08:00 |
|
Jiang-Jia-Jun
|
164b83ab0b
|
[Doc] Update nvidia gpu installation description
|
2025-07-01 15:22:19 +08:00 |
|
hong19860320
|
8e335db645
|
Update kunlunxin_xpu.md (#2662)
|
2025-07-01 15:10:45 +08:00 |
|
AIbin
|
1bb296c5ad
|
update quantization doc (#2659)
|
2025-07-01 15:05:02 +08:00 |
|
hong19860320
|
92428a5ae4
|
Update kunlunxin_xpu.md (#2657)
Deploy GitHub Pages / deploy (push) Has been cancelled
|
2025-07-01 12:28:49 +08:00 |
|
hong19860320
|
6b95b42986
|
Update kunlunxin_xpu.md
|
2025-06-30 15:49:32 +08:00 |
|
hong19860320
|
b0d3a630ba
|
Merge branch 'develop' of https://github.com/hong19860320/FastDeploy into hongming/fix_xpu_doc
|
2025-06-30 15:42:29 +08:00 |
|
hong19860320
|
ef72873695
|
Update kunlunxin_xpu.md
|
2025-06-30 15:27:48 +08:00 |
|
kevin
|
4f7b42ce3e
|
update docs
|
2025-06-30 14:45:41 +08:00 |
|
changwenbin
|
634d3c3642
|
update wint2 doc
|
2025-06-30 11:36:15 +08:00 |
|
Jiang-Jia-Jun
|
866946de0d
|
Update quick_start.md
|
2025-06-30 08:57:02 +08:00 |
|
Jiang-Jia-Jun
|
47299dbc54
|
Update supported models
|
2025-06-30 08:50:44 +08:00 |
|
Jiang-Jia-Jun
|
92c2cfa2e7
|
Sync v2.0 version of code to github repo
|
2025-06-29 23:29:37 +00:00 |
|