LiqinruiG
|
4ccd1696ab
|
[Doc] modify offline inference docs (#2747)
* modify reasoning_output docs
* modify offline inference docs
* modify offline inference docs
|
2025-07-09 20:53:26 +08:00 |
|
chen
|
888780ffde
|
[Feature] block_wise_fp8 support triton_moe_backend (#2767)
|
2025-07-09 19:22:47 +08:00 |
|
lifulll
|
1f28bdf994
|
dcu adapter ernie45t (#2756)
Co-authored-by: lifu <lifu@sugon.com>
Co-authored-by: yongqiangma <xing.wo@163.com>
|
2025-07-09 18:56:27 +08:00 |
|
zhink
|
b89180f1cd
|
[Feature] support custom all-reduce (#2758)
* [Feature] support custom all-reduce
* add vllm adapted
|
2025-07-09 16:00:27 +08:00 |
|
EnflameGCU
|
d0f4d6ba3a
|
[GCU] Support gcu platform (#2702)
baseline: e7fa57ebae
Co-authored-by: yongqiangma <xing.wo@163.com>
|
2025-07-08 13:00:52 +08:00 |
|
chen
|
66b321d9ec
|
Update eb45-0.3B cuda memory (#2686)
|
2025-07-07 11:31:15 +08:00 |
|
LiqinruiG
|
b38823bc66
|
modify reasoning_output docs (#2696)
|
2025-07-04 11:30:02 +08:00 |
|
kevin
|
3d3bccdf79
|
[doc] update docs (#2690)
|
2025-07-03 19:33:19 +08:00 |
|
Jiang-Jia-Jun
|
d222248d00
|
Update README.md
|
2025-07-03 15:28:28 +08:00 |
|
Jiang-Jia-Jun
|
e5b94d4117
|
Update README.md
|
2025-07-03 15:28:05 +08:00 |
|
handiz
|
b8a8a19689
|
add wint2 performance (#2673)
|
2025-07-02 17:10:01 +08:00 |
|
Jiang-Jia-Jun
|
97ac82834f
|
Update nvidia_gpu.md
|
2025-07-02 16:54:14 +08:00 |
|
Jiang-Jia-Jun
|
685265a97d
|
Update nvidia_gpu.md
|
2025-07-02 15:43:35 +08:00 |
|
Jiang-Jia-Jun
|
fc4d643634
|
Update nvidia_gpu.md
|
2025-07-02 15:39:48 +08:00 |
|
liddk1121
|
865e856a94
|
update iluvatar gpu fastdeploy whl (#2675)
|
2025-07-02 14:47:21 +08:00 |
|
Jiang-Jia-Jun
|
9f4a65d817
|
Update README.md
Deploy GitHub Pages / deploy (push) Has been cancelled
|
2025-07-02 10:04:58 +08:00 |
|
freeliuzc
|
2b7f74d427
|
fix docs (#2669)
Co-authored-by: liuzichang01 <liuzichang01@baidu.com>
|
2025-07-01 18:02:44 +08:00 |
|
Jiang-Jia-Jun
|
164b83ab0b
|
[Doc] Update nvidia gpu installation description
|
2025-07-01 15:22:19 +08:00 |
|
Jiang-Jia-Jun
|
01d5d66d95
|
[Doc] Update nvidia gpu installation description
|
2025-07-01 15:20:40 +08:00 |
|
Jiang-Jia-Jun
|
8f1dddcf35
|
[Doc] Update nvidia gpu installation description
|
2025-07-01 15:20:21 +08:00 |
|
hong19860320
|
8e335db645
|
Update kunlunxin_xpu.md (#2662)
|
2025-07-01 15:10:45 +08:00 |
|
AIbin
|
1bb296c5ad
|
update quantization doc (#2659)
|
2025-07-01 15:05:02 +08:00 |
|
hong19860320
|
92428a5ae4
|
Update kunlunxin_xpu.md (#2657)
Deploy GitHub Pages / deploy (push) Has been cancelled
|
2025-07-01 12:28:49 +08:00 |
|
hong19860320
|
6b95b42986
|
Update kunlunxin_xpu.md
|
2025-06-30 15:49:32 +08:00 |
|
hong19860320
|
b0d3a630ba
|
Merge branch 'develop' of https://github.com/hong19860320/FastDeploy into hongming/fix_xpu_doc
|
2025-06-30 15:42:29 +08:00 |
|
hong19860320
|
ef72873695
|
Update kunlunxin_xpu.md
|
2025-06-30 15:27:48 +08:00 |
|
kevin
|
4f7b42ce3e
|
update docs
|
2025-06-30 14:45:41 +08:00 |
|
qingqing01
|
90a5b18742
|
Update disaggregated.md
Deploy GitHub Pages / deploy (push) Has been cancelled
|
2025-06-30 11:57:12 +08:00 |
|
qingqing01
|
7c43500060
|
Update disaggregated.md
|
2025-06-30 11:56:33 +08:00 |
|
Jiang-Jia-Jun
|
ea29b01a68
|
Update quick_start.md
|
2025-06-30 11:52:05 +08:00 |
|
yongqiangma
|
f9431106d8
|
Merge branch 'develop' into doc
|
2025-06-30 11:42:43 +08:00 |
|
mayongqiang
|
0d39e23ab9
|
fix format
|
2025-06-30 11:39:59 +08:00 |
|
changwenbin
|
634d3c3642
|
update wint2 doc
|
2025-06-30 11:36:15 +08:00 |
|
Jiang-Jia-Jun
|
50c5bc1e9d
|
Update nvidia_gpu.md
|
2025-06-30 08:59:41 +08:00 |
|
Jiang-Jia-Jun
|
187a5ae592
|
Update quick_start.md
|
2025-06-30 08:57:25 +08:00 |
|
Jiang-Jia-Jun
|
866946de0d
|
Update quick_start.md
|
2025-06-30 08:57:02 +08:00 |
|
Jiang-Jia-Jun
|
72c768168c
|
Update ernie-4.5-vl.md
|
2025-06-30 08:56:27 +08:00 |
|
Jiang-Jia-Jun
|
f0b7e99f05
|
Update ernie-4.5.md
|
2025-06-30 08:56:08 +08:00 |
|
Jiang-Jia-Jun
|
b40633cbbd
|
Update quick_start.md
|
2025-06-30 08:55:29 +08:00 |
|
Jiang-Jia-Jun
|
f14f361c23
|
Add README.md for quick start
|
2025-06-30 08:55:05 +08:00 |
|
Jiang-Jia-Jun
|
47299dbc54
|
Update supported models
|
2025-06-30 08:50:44 +08:00 |
|
Jiang-Jia-Jun
|
6cb1a75663
|
Update supported models
|
2025-06-30 08:50:21 +08:00 |
|
Jiang-Jia-Jun
|
92c2cfa2e7
|
Sync v2.0 version of code to github repo
|
2025-06-29 23:29:37 +00:00 |
|
Jiang-Jia-Jun
|
8dba1d90a1
|
Rename filepath
|
2025-06-30 01:32:01 +08:00 |
|
Jiang-Jia-Jun
|
135489dd30
|
Create README.md
|
2025-06-30 01:29:45 +08:00 |
|
Jiang-Jia-Jun
|
f57422e3c1
|
Update serving parameters description
|
2025-06-10 19:27:38 +08:00 |
|
jiangjiajun
|
684703fd72
|
[LLM] First commit the llm deployment code
|
2025-06-09 19:20:15 +08:00 |
|
Zheng-Bicheng
|
4e9bcc3718
|
[Backend] Update RKNN Runtime libs to version "1.5.1b19" (#2156)
* update rknpu2 runtime libs and rknntoolkit2
* update rknpu2 runtime libs
|
2023-08-10 13:27:56 +08:00 |
|
Zeref996
|
24f32d10a7
|
[Docs] refresh fd version 1.0.7 (#1986)
|
2023-05-24 17:35:26 +08:00 |
|
seyosum
|
df8dd3e3ac
|
【Hackthon_4th 180】Support HORIZON BPU Backend for FastDeploy (#1822)
* add horizon backend and PPYOLOE examples
* 更改horizon头文件编码规范
* 更改horizon头文件编码规范
* 更改horizon头文件编码规范
* 增加horizon packages下载及自动安装
* Add UseHorizonNPUBackend Method
* 删除编译FD SDK后多余的头文件,同时更改部分规范
* Update horizon.md
* Update horizon.md
---------
Co-authored-by: DefTruth <31974251+DefTruth@users.noreply.github.com>
|
2023-05-06 16:10:37 +08:00 |
|