yzwu
ac013803f3
[Iluvatar] Support V1_KVCACHE_SCHEDULER and paddleocr-vl rope mode ( #5555 )
2025-12-18 02:14:25 -08:00
Echo-Nie
e1347be4d9
[Docs] Fix nvidia_gpu.md, add sm80 in precompiled ( #5462 )
...
* Update supported GPU architectures in installation guide
* Update supported architectures in GPU installation guide
* Update GPU architectures support in installation guide
2025-12-11 14:41:50 +08:00
Jiang-Jia-Jun
0eb799a324
Update installation requirements for Kunlunxin XPU
2025-12-03 10:04:29 +08:00
CSWYF3634076
051b82b4c8
[Docs] add qwen25-vl docs ( #5243 )
...
* [Docs] add qwen25-vl docs
* [Docs] add qwen25-vl docs
* [Docs] add qwen25-vl docs
2025-11-27 15:05:57 +08:00
ApplEOFDiscord
287751f19d
[Docs] add docs of base64 or local file mm inputs ( #5193 )
2025-11-26 14:41:43 +08:00
FocusLuo
c2c1942db9
[INTEL_HPU] [CI] enabled fastdeploy PR testing ( #4596 )
...
* [INTEL HPU] added hpu ci work flow support
Signed-off-by: Luo, Focus <focus.luo@intel.com >
* [INTEL HPU] added run ci hpu test scripts
Signed-off-by: Luo, Focus <focus.luo@intel.com >
* [INTEL HPU] enabled HPU ernie test case
Signed-off-by: Luo, Focus <focus.luo@intel.com >
* [INTEL HPU] updated Intel Gaudi Readme with Warmup disable cmdline
Signed-off-by: Luo, Focus <focus.luo@intel.com >
* Modify paddlepaddle installation command
Updated paddlepaddle installation command to use a specific index URL.
* Update run_ci_hpu.sh
* Rename json directory to nlohmann_json
Rename extracted json directory to nlohmann_json.
* Update ci_hpu.yml
* Set pip global index URL to Tsinghua mirror
* Update CI workflow to use self-hosted runner and paths
* Update Docker image in CI workflow
* Modify HPU installation URLs in run_ci_hpu.sh
Updated the installation URL for paddle_intel_hpu and added paddlenlp_ops installation.
* Fix paddle_intel_hpu installation URL
Corrected the URL for paddle_intel_hpu wheel installation.
---------
Signed-off-by: Luo, Focus <focus.luo@intel.com >
Co-authored-by: plusNew001 <95567040+plusNew001@users.noreply.github.com >
2025-11-17 19:24:41 +08:00
Jiang-Jia-Jun
d41cf643f8
Update nvidia_gpu.md
2025-11-14 18:26:08 +08:00
Echo-Nie
ee1ea43e36
[Docs] Fix broken commitID ( #5008 )
...
* fix commitID
* Update nvidia_gpu.md
2025-11-14 10:39:41 +08:00
Echo-Nie
a5e949d9d0
[Feature] Enhance build script, add pre_wheel logic ( #4729 )
...
* Enhance build script, add pre_wheel logic
Updated copyright year and added precompiled wheel installation logic.
* update the nvidia_gpu.md, add pre_wheel description
* fix zh .md
* update the url, automatically detect CUDA and SM
* Fix GPU architecture string formatting in build.sh
* Change default for FD_USE_PRECOMPILED to 0
* fix build.sh
* add ./dist, pre-wheel path
* simplify the process,just save the whl
* del pre_wheel dir
* fix function name, extract_ops_from_precompiled_wheel
* fix docs
* add default commitID in docs
---------
Co-authored-by: plusNew001 <95567040+plusNew001@users.noreply.github.com >
2025-11-13 19:03:52 +08:00
Jiang-Jia-Jun
8329338d37
Update nvidia_gpu.md
2025-11-13 10:25:22 +08:00
JYChen
a1218076dc
remove load default_v1 since already been as default ( #4980 )
2025-11-12 16:49:48 +08:00
yzwu
08b96baa4a
[Iluvatar][Doc] Add ERNIE-4.5-VL-28B-A3B-Thinking doc ( #4955 )
2025-11-11 19:15:19 +08:00
yinwei
215cda2f80
[XPU][Doc]Update XPU release2.3 note ( #4939 )
...
* update doc
* update
* update
* udpate
2025-11-11 11:57:49 +08:00
yzwu
3707af7a4f
[Iluvatar] add vl into ci and support v1 loader ( #4774 )
2025-11-11 10:50:17 +08:00
LiqinruiG
3f74281496
[Docs] add ERNIE-4.5-VL-28B-A3B-Thinking instruction ( #4937 )
...
* [Docs] Improve reasoning_out docs
* [Docs] Improve reasoning_out docs
* [Docs] Improve reasoning_out docs
* [Docs] add ERNIE-4.5-VL-28B-A3B-Thinking instruction
* [Docs] add ERNIE-4.5-VL-28B-A3B-Thinking instruction
* [Docs] add ERNIE-4.5-VL-28B-A3B-Thinking instruction
---------
Co-authored-by: liqinrui <liqinrui@baidu.com >
2025-11-11 10:43:44 +08:00
yangjianfengo1
d7f14dba8b
uodate docx ( #4938 )
...
Co-authored-by: root <root@yq02-inf-sci-k8s-a100-aa2ni5-0018.yq02.baidu.com >
2025-11-11 10:28:46 +08:00
qw86972190
07b21d241d
[XPU]Update documentation ( #4917 )
...
* [XPU]Update documentation
* [XPU]Update documentation
* [XPU]Update documentation
* [XPU]Update documentation
* [XPU][Docs] Update documentation
* [XPU][Docs] Update documentation
* [XPU][Docs] Update documentation
* [XPU][Docs] Update documentation
* [XPU][Docs] Update documentation
* [XPU][Docs] Update documentation
2025-11-10 19:11:42 +08:00
Ding
6c316286c1
fix: correct typo in nvidia_gpu.md ( #4848 )
2025-11-06 16:03:02 +08:00
Jiang-Jia-Jun
aec1a84886
[Doc] Update docs for v2.3.0rc0 ( #4828 )
...
* [Doc] Update docs for v2.3.0rc0
* [Doc] Update docs for v2.3.0rc0
* [Doc] Update docs for v2.3.0rc0
* Update README_CN.md
* Add deployment guide link for FastDeploy v2.3-rc0
Updated release note for FastDeploy v2.3-rc0 to include deployment guide link.
* Add Deployment Guide link for FastDeploy v2.3-rc0
Updated the news section to include a link to the Deployment Guide for FastDeploy v2.3-rc0.
---------
Co-authored-by: Jiang-Jia-Jun <jiangjiajun@baidu.com >
2025-11-05 19:45:53 +08:00
yyssys
b8bf57138f
[Docs]Update XPU document version to 2.3.0 ( #4741 )
...
* [Doc]Update XPU document version to 2.3.0
* update paddle doc version
* update applicable version
2025-11-03 09:54:51 +08:00
yangjianfengo1
329d074326
[Docx] fix the broken link ( #4479 )
...
* 修改文档
* 修改文档
2025-10-17 18:28:50 +08:00
yzwu
4b661512ca
[Iluvatar GPU] Adapt VL model ( #4313 )
2025-10-17 16:13:38 +08:00
yangjianfengo1
ba5c2b7e37
[Docx] add language (en/cn) switch links ( #4470 )
...
* add install docs
* 修改文档
* 修改文档
2025-10-17 15:47:41 +08:00
LiqinruiG
4251ac5e95
【Fix】 remove text_after_process & raw_prediction ( #4421 )
...
* remove text_after_process & raw_prediction
* remove text_after_process & raw_prediction
2025-10-16 19:00:18 +08:00
yangjianfengo1
c1a2e78b18
add install docs ( #4414 )
CE Compile Job / ce_job_pre_check (push) Has been cancelled
CE Compile Job / print_ce_job_pre_check_outputs (push) Has been cancelled
CE Compile Job / FD-Clone-Linux (push) Has been cancelled
CE Compile Job / Show Code Archive Output (push) Has been cancelled
CE Compile Job / BUILD_SM8090 (push) Has been cancelled
CE Compile Job / BUILD_SM8689 (push) Has been cancelled
CE Compile Job / CE_UPLOAD (push) Has been cancelled
Deploy GitHub Pages / deploy (push) Has been cancelled
Publish Job / publish_pre_check (push) Has been cancelled
Publish Job / print_publish_pre_check_outputs (push) Has been cancelled
Publish Job / FD-Clone-Linux (push) Has been cancelled
Publish Job / Show Code Archive Output (push) Has been cancelled
Publish Job / BUILD_SM8090 (push) Has been cancelled
Publish Job / BUILD_SM8689 (push) Has been cancelled
Publish Job / PADDLE_PYPI_UPLOAD_8090 (push) Has been cancelled
Publish Job / PADDLE_PYPI_UPLOAD_8689 (push) Has been cancelled
Publish Job / Run FD Image Build (push) Has been cancelled
Publish Job / Run FastDeploy Unit Tests and Coverage (push) Has been cancelled
Publish Job / Run FastDeploy LogProb Tests (push) Has been cancelled
Publish Job / Extracted partial CE model tasks to run in CI. (push) Has been cancelled
Publish Job / Run Base Tests (push) Has been cancelled
Publish Job / Run Accuracy Tests (push) Has been cancelled
Publish Job / Run Stable Tests (push) Has been cancelled
CI Images Build / FD-Clone-Linux (push) Has been cancelled
CI Images Build / Show Code Archive Output (push) Has been cancelled
CI Images Build / CI Images Build (push) Has been cancelled
CI Images Build / BUILD_SM8090 (push) Has been cancelled
CI Images Build / Run FastDeploy Unit Tests and Coverage (push) Has been cancelled
CI Images Build / Run FastDeploy LogProb Tests (push) Has been cancelled
CI Images Build / Extracted partial CE model tasks to run in CI. (push) Has been cancelled
CI Images Build / Run Base Tests (push) Has been cancelled
CI Images Build / Run Accuracy Tests (push) Has been cancelled
CI Images Build / Run Stable Tests (push) Has been cancelled
CI Images Build / Publish Docker Images Pre Check (push) Has been cancelled
2025-10-14 20:17:29 +08:00
YuBaoku
425205b03c
[Doc] fix the port conflict issue in the usage example ( #4379 )
CE Compile Job / ce_job_pre_check (push) Has been cancelled
CE Compile Job / print_ce_job_pre_check_outputs (push) Has been cancelled
CE Compile Job / FD-Clone-Linux (push) Has been cancelled
CE Compile Job / Show Code Archive Output (push) Has been cancelled
CE Compile Job / BUILD_SM8090 (push) Has been cancelled
CE Compile Job / BUILD_SM8689 (push) Has been cancelled
CE Compile Job / CE_UPLOAD (push) Has been cancelled
Deploy GitHub Pages / deploy (push) Has been cancelled
Publish Job / publish_pre_check (push) Has been cancelled
Publish Job / print_publish_pre_check_outputs (push) Has been cancelled
Publish Job / FD-Clone-Linux (push) Has been cancelled
Publish Job / Show Code Archive Output (push) Has been cancelled
Publish Job / BUILD_SM8090 (push) Has been cancelled
Publish Job / BUILD_SM8689 (push) Has been cancelled
Publish Job / PADDLE_PYPI_UPLOAD_8090 (push) Has been cancelled
Publish Job / PADDLE_PYPI_UPLOAD_8689 (push) Has been cancelled
Publish Job / Run FD Image Build (push) Has been cancelled
Publish Job / Run FastDeploy Unit Tests and Coverage (push) Has been cancelled
Publish Job / Run FastDeploy LogProb Tests (push) Has been cancelled
Publish Job / Extracted partial CE model tasks to run in CI. (push) Has been cancelled
Publish Job / Run Base Tests (push) Has been cancelled
Publish Job / Run Accuracy Tests (push) Has been cancelled
Publish Job / Run Stable Tests (push) Has been cancelled
CI Images Build / FD-Clone-Linux (push) Has been cancelled
CI Images Build / Show Code Archive Output (push) Has been cancelled
CI Images Build / CI Images Build (push) Has been cancelled
CI Images Build / BUILD_SM8090 (push) Has been cancelled
CI Images Build / Run FastDeploy Unit Tests and Coverage (push) Has been cancelled
CI Images Build / Run FastDeploy LogProb Tests (push) Has been cancelled
CI Images Build / Extracted partial CE model tasks to run in CI. (push) Has been cancelled
CI Images Build / Run Base Tests (push) Has been cancelled
CI Images Build / Run Accuracy Tests (push) Has been cancelled
CI Images Build / Run Stable Tests (push) Has been cancelled
CI Images Build / Publish Docker Images Pre Check (push) Has been cancelled
2025-10-13 20:17:06 +08:00
yyssys
3cb4b4d7d4
[Doc] Update xpu fastdeploy version to 2.2.1 ( #4338 )
2025-10-09 20:14:07 +08:00
yangjianfengo1
b650867fff
修改文档 ( #4339 )
2025-10-09 20:10:58 +08:00
fmiao2372
f1b5392e20
[Intel HPU] Support intel hpu platform ( #4161 )
...
* [Intel HPU] Support intel hpu platform
* fix some issues
* apply precommit and move AttentionBackend_HPU
* fix format issue
* correct ops import
* fix ci issue
* update code in layers
* fix code style issue
* remove dense tp moe ep mode
* fix enc_dec_block_num
* fix rebase issue
* rename hpu to gaudi in readme
* rename ForwardMeta_HPU to HPUForwardMeta
2025-09-24 12:27:50 +08:00
bukejiyu
62d1c48363
[v1 loader]code style ( #4204 )
...
* code style
* update
2025-09-23 19:36:00 +08:00
yzwu
504461b6b5
[Iluvatar GPU] Optimize attention performance and fix moe load ckpt error ( #3651 )
2025-09-22 21:13:59 +08:00
zhupengyang
9d0074a91a
[xpu] add ep custom ops ( #3911 )
CE Compile Job / ce_job_pre_check (push) Has been cancelled
CE Compile Job / print_ce_job_pre_check_outputs (push) Has been cancelled
CE Compile Job / FD-Clone-Linux (push) Has been cancelled
CE Compile Job / Show Code Archive Output (push) Has been cancelled
CE Compile Job / BUILD_SM8090 (push) Has been cancelled
CE Compile Job / BUILD_SM8689 (push) Has been cancelled
CE Compile Job / CE_UPLOAD (push) Has been cancelled
Deploy GitHub Pages / deploy (push) Has been cancelled
2025-09-10 12:22:50 +08:00
bukejiyu
08b3153661
update doc ( #3990 )
...
Co-authored-by: root <root@tjdm-inf-sci-k8s-hzz2-h12ni8-0214.tjdm.baidu.com >
2025-09-08 21:04:26 +08:00
yinwei
7e0bfd024f
update release note ( #3986 )
CE Compile Job / ce_job_pre_check (push) Has been cancelled
CE Compile Job / print_ce_job_pre_check_outputs (push) Has been cancelled
CE Compile Job / FD-Clone-Linux (push) Has been cancelled
CE Compile Job / Show Code Archive Output (push) Has been cancelled
CE Compile Job / BUILD_SM8090 (push) Has been cancelled
CE Compile Job / BUILD_SM8689 (push) Has been cancelled
CE Compile Job / CE_UPLOAD (push) Has been cancelled
Deploy GitHub Pages / deploy (push) Has been cancelled
Publish Job / publish_pre_check (push) Has been cancelled
Publish Job / print_publish_pre_check_outputs (push) Has been cancelled
Publish Job / FD-Clone-Linux (push) Has been cancelled
Publish Job / Show Code Archive Output (push) Has been cancelled
Publish Job / BUILD_SM8090 (push) Has been cancelled
Publish Job / BUILD_SM8689 (push) Has been cancelled
Publish Job / PADDLE_PYPI_UPLOAD_8090 (push) Has been cancelled
Publish Job / PADDLE_PYPI_UPLOAD_8689 (push) Has been cancelled
Publish Job / Run FastDeploy Unit Tests and Coverage (push) Has been cancelled
Publish Job / Run FastDeploy LogProb Tests (push) Has been cancelled
Publish Job / Extracted partial CE model tasks to run in CI. (push) Has been cancelled
Publish Job / Run Base Tests (push) Has been cancelled
Publish Job / Run Accuracy Tests (push) Has been cancelled
Publish Job / Run Stable Tests (push) Has been cancelled
CI Images Build / FD-Clone-Linux (push) Has been cancelled
CI Images Build / Show Code Archive Output (push) Has been cancelled
CI Images Build / CI Images Build (push) Has been cancelled
CI Images Build / BUILD_SM8090 (push) Has been cancelled
CI Images Build / Run FastDeploy Unit Tests and Coverage (push) Has been cancelled
CI Images Build / Run FastDeploy LogProb Tests (push) Has been cancelled
CI Images Build / Extracted partial CE model tasks to run in CI. (push) Has been cancelled
CI Images Build / Run Base Tests (push) Has been cancelled
CI Images Build / Run Accuracy Tests (push) Has been cancelled
CI Images Build / Run Stable Tests (push) Has been cancelled
CI Images Build / Publish Docker Images Pre Check (push) Has been cancelled
2025-09-08 19:03:14 +08:00
yangjianfengo1
9ead10e1bc
更新文档 ( #3975 )
2025-09-08 16:53:37 +08:00
Jiang-Jia-Jun
18e5d355a1
Update version in docs
2025-09-02 19:21:10 +08:00
Kane2011
2ae7ab28d2
[MetaxGPU] adapt to the latest fastdeploy on metax gpu ( #3492 )
2025-08-25 17:44:20 +08:00
yongqiangma
5703d7aa0f
update installation readme ( #3429 )
2025-08-15 19:09:41 +08:00
yangjianfengo1
615930bc05
Update README ( #3426 )
...
* 修改READMe
* code style
* code style
2025-08-15 18:46:28 +08:00
JYChen
6f11171478
fix some docs error ( #3439 )
2025-08-15 18:45:27 +08:00
ming1753
d4e3a20300
[Docs] Release 2.1 docs and fix some description ( #3424 )
2025-08-15 14:27:19 +08:00
yinwei
fbb6dcb9e4
[Docs]XPU Update 2.1 Release Documentation ( #3423 )
...
* XPU Update 2.1 Release Documentation
* code style check
2025-08-15 14:07:47 +08:00
JYChen
562e01c979
update docs ( #3420 )
2025-08-15 13:00:08 +08:00
yangjianfengo1
b808c49585
[Doc] 增加中英文切换 ( #3318 )
...
* 增加中英文切换
* 增加中英文切换
* 修改readme
2025-08-12 11:20:45 +08:00
hong19860320
93a1731891
[Doc] Update deps and fix dead links ( #3252 )
2025-08-07 11:04:31 +08:00
ApplEOFDiscord
b71cbb466d
[Feature] remove dependency on enable_mm and refine multimodal's code ( #3014 )
...
* remove dependency on enable_mm
* fix codestyle check error
* fix codestyle check error
* update docs
* resolve conflicts on model config
* fix unit test error
* fix code style check error
---------
Co-authored-by: shige <1021937542@qq.com >
Co-authored-by: Jiang-Jia-Jun <163579578+Jiang-Jia-Jun@users.noreply.github.com >
2025-08-01 20:01:18 +08:00
LiqinruiG
25005fee30
[Doc] add chat_template_kwagrs and update params docs ( #3103 )
...
* add chat_template_kwagrs and update params docs
* add chat_template_kwagrs and update params docs
* update enable_thinking
* pre-commit
* update test case
---------
Co-authored-by: Jiang-Jia-Jun <163579578+Jiang-Jia-Jun@users.noreply.github.com >
2025-07-31 19:44:06 +08:00
yinwei
5b9aec1f10
xpu release 2.0.3 ( #3105 )
2025-07-31 14:26:07 +08:00
李泳桦
b242150f94
[feat] extra parameters are all passed directly via http payload now, or in extra_body if using openai client ( #3058 )
...
* [feat] extra parameters are all passed directly via http payload now, or in extra_body if using openai client
* [fix] delete ci test case for enable_thinking
* [fix] add reasoning_parser when server starts
* [fix] fix ci consistency test error with reasoning parser
* [doc] update docs related to metadata
* [fix] cancel enable_thinking default value
2025-07-30 19:25:20 +08:00
Zero Rains
25698d56d1
polish code with new pre-commit rule ( #2923 )
2025-07-19 23:19:27 +08:00