Yuan Xiaolan
c71ee0831c
add w4afp8 offline script ( #3636 )
2025-08-29 17:56:05 +08:00
Kane2011
2ae7ab28d2
[MetaxGPU] adapt to the latest fastdeploy on metax gpu ( #3492 )
2025-08-25 17:44:20 +08:00
yongqiangma
5703d7aa0f
update installation readme ( #3429 )
2025-08-15 19:09:41 +08:00
yangjianfengo1
615930bc05
Update README ( #3426 )
...
* 修改READMe
* code style
* code style
2025-08-15 18:46:28 +08:00
JYChen
6f11171478
fix some docs error ( #3439 )
2025-08-15 18:45:27 +08:00
ming1753
d4e3a20300
[Docs] Release 2.1 docs and fix some description ( #3424 )
2025-08-15 14:27:19 +08:00
yinwei
fbb6dcb9e4
[Docs]XPU Update 2.1 Release Documentation ( #3423 )
...
* XPU Update 2.1 Release Documentation
* code style check
2025-08-15 14:07:47 +08:00
JYChen
562e01c979
update docs ( #3420 )
2025-08-15 13:00:08 +08:00
yzwu
ce9180241e
[Iluvatar GPU] Modify the names of some variables ( #3273 )
2025-08-13 11:38:02 +08:00
yzwu
fbdd6b0663
[Iluvatar GPU] Optimze attention and moe performance ( #3234 )
2025-08-08 10:51:24 +08:00
hong19860320
93a1731891
[Doc] Update deps and fix dead links ( #3252 )
2025-08-07 11:04:31 +08:00
ApplEOFDiscord
b71cbb466d
[Feature] remove dependency on enable_mm and refine multimodal's code ( #3014 )
...
* remove dependency on enable_mm
* fix codestyle check error
* fix codestyle check error
* update docs
* resolve conflicts on model config
* fix unit test error
* fix code style check error
---------
Co-authored-by: shige <1021937542@qq.com >
Co-authored-by: Jiang-Jia-Jun <163579578+Jiang-Jia-Jun@users.noreply.github.com >
2025-08-01 20:01:18 +08:00
LiqinruiG
25005fee30
[Doc] add chat_template_kwagrs and update params docs ( #3103 )
...
* add chat_template_kwagrs and update params docs
* add chat_template_kwagrs and update params docs
* update enable_thinking
* pre-commit
* update test case
---------
Co-authored-by: Jiang-Jia-Jun <163579578+Jiang-Jia-Jun@users.noreply.github.com >
2025-07-31 19:44:06 +08:00
yinwei
5b9aec1f10
xpu release 2.0.3 ( #3105 )
2025-07-31 14:26:07 +08:00
李泳桦
b242150f94
[feat] extra parameters are all passed directly via http payload now, or in extra_body if using openai client ( #3058 )
...
* [feat] extra parameters are all passed directly via http payload now, or in extra_body if using openai client
* [fix] delete ci test case for enable_thinking
* [fix] add reasoning_parser when server starts
* [fix] fix ci consistency test error with reasoning parser
* [doc] update docs related to metadata
* [fix] cancel enable_thinking default value
2025-07-30 19:25:20 +08:00
Jiang-Jia-Jun
286802a070
Update ernie-4.5.md
2025-07-29 10:10:09 +08:00
Zero Rains
25698d56d1
polish code with new pre-commit rule ( #2923 )
2025-07-19 23:19:27 +08:00
yulangz
c8c280c4d3
[XPU][Doc] fix typo ( #2892 )
2025-07-17 19:13:54 +08:00
yulangz
17314ee126
[XPU] Update doc and add scripts for downloading dependencies ( #2845 )
...
* [XPU] update xvllm download
* update supported models
* fix xpu model runner in huge memory with small model
* update doc
2025-07-16 11:05:56 +08:00
yulangz
830de5a925
[XPU] Supports TP4 deployment on 4,5,6,7 ( #2794 )
...
* 支持通过 XPU_VISIBLE_DEVICES 指定 4,5,6,7 卡运行
* 修改 XPU 文档中多卡说明
2025-07-10 16:48:08 +08:00
lifulll
1f28bdf994
dcu adapter ernie45t ( #2756 )
...
Co-authored-by: lifu <lifu@sugon.com >
Co-authored-by: yongqiangma <xing.wo@163.com >
2025-07-09 18:56:27 +08:00
EnflameGCU
d0f4d6ba3a
[GCU] Support gcu platform ( #2702 )
...
baseline: e7fa57ebae
Co-authored-by: yongqiangma <xing.wo@163.com >
2025-07-08 13:00:52 +08:00
Jiang-Jia-Jun
97ac82834f
Update nvidia_gpu.md
2025-07-02 16:54:14 +08:00
Jiang-Jia-Jun
685265a97d
Update nvidia_gpu.md
2025-07-02 15:43:35 +08:00
Jiang-Jia-Jun
fc4d643634
Update nvidia_gpu.md
2025-07-02 15:39:48 +08:00
liddk1121
865e856a94
update iluvatar gpu fastdeploy whl ( #2675 )
2025-07-02 14:47:21 +08:00
Jiang-Jia-Jun
01d5d66d95
[Doc] Update nvidia gpu installation description
2025-07-01 15:20:40 +08:00
Jiang-Jia-Jun
8f1dddcf35
[Doc] Update nvidia gpu installation description
2025-07-01 15:20:21 +08:00
hong19860320
8e335db645
Update kunlunxin_xpu.md ( #2662 )
2025-07-01 15:10:45 +08:00
hong19860320
92428a5ae4
Update kunlunxin_xpu.md ( #2657 )
Deploy GitHub Pages / deploy (push) Has been cancelled
2025-07-01 12:28:49 +08:00
hong19860320
6b95b42986
Update kunlunxin_xpu.md
2025-06-30 15:49:32 +08:00
hong19860320
b0d3a630ba
Merge branch 'develop' of https://github.com/hong19860320/FastDeploy into hongming/fix_xpu_doc
2025-06-30 15:42:29 +08:00
hong19860320
ef72873695
Update kunlunxin_xpu.md
2025-06-30 15:27:48 +08:00
kevin
4f7b42ce3e
update docs
2025-06-30 14:45:41 +08:00
Jiang-Jia-Jun
ea29b01a68
Update quick_start.md
2025-06-30 11:52:05 +08:00
mayongqiang
0d39e23ab9
fix format
2025-06-30 11:39:59 +08:00
Jiang-Jia-Jun
50c5bc1e9d
Update nvidia_gpu.md
2025-06-30 08:59:41 +08:00
Jiang-Jia-Jun
187a5ae592
Update quick_start.md
2025-06-30 08:57:25 +08:00
Jiang-Jia-Jun
72c768168c
Update ernie-4.5-vl.md
2025-06-30 08:56:27 +08:00
Jiang-Jia-Jun
f0b7e99f05
Update ernie-4.5.md
2025-06-30 08:56:08 +08:00
Jiang-Jia-Jun
b40633cbbd
Update quick_start.md
2025-06-30 08:55:29 +08:00
Jiang-Jia-Jun
f14f361c23
Add README.md for quick start
2025-06-30 08:55:05 +08:00
Jiang-Jia-Jun
92c2cfa2e7
Sync v2.0 version of code to github repo
2025-06-29 23:29:37 +00:00
Jiang-Jia-Jun
8dba1d90a1
Rename filepath
2025-06-30 01:32:01 +08:00