FastDeploy

mirror of https://github.com/PaddlePaddle/FastDeploy.git synced 2025-12-24 13:28:13 +08:00

Author	SHA1	Message	Date
Zhang Yulong	1a543bca29	Fix test_EB_Lite_serving.py (#3119 ) * Fix test_EB_Lite_serving.py * fix test_EB_Lite_serving.py	2025-07-31 20:15:25 +08:00
LiqinruiG	25005fee30	[Doc] add chat_template_kwagrs and update params docs (#3103 ) * add chat_template_kwagrs and update params docs * add chat_template_kwagrs and update params docs * update enable_thinking * pre-commit * update test case --------- Co-authored-by: Jiang-Jia-Jun <163579578+Jiang-Jia-Jun@users.noreply.github.com>	2025-07-31 19:44:06 +08:00
YUNSHEN XIE	583eae2fd1	fix ci (#3106 ) * fix ci * disable test_non_streaming_chat_with_min_tokens	2025-07-31 17:25:08 +08:00
Jiang-Jia-Jun	0616c208d2	[Feature] Support include_stop_str_in_output in completion api (#3096 ) * [Feature] Support include_stop_str_in_output in completion api * Fix ci test --------- Co-authored-by: Jiang-Jia-Jun <jiangjiajun@baidu.com>	2025-07-30 22:18:48 +08:00
李泳桦	b242150f94	[feat] extra parameters are all passed directly via http payload now, or in extra_body if using openai client (#3058 ) * [feat] extra parameters are all passed directly via http payload now, or in extra_body if using openai client * [fix] delete ci test case for enable_thinking * [fix] add reasoning_parser when server starts * [fix] fix ci consistency test error with reasoning parser * [doc] update docs related to metadata * [fix] cancel enable_thinking default value	2025-07-30 19:25:20 +08:00
Sunny-bot1	74aa31d15b	[Feature] support bad_words (#3055 ) * support bad_words * support online infer bad_words * update * add CI test * update * update * update --------- Co-authored-by: Yuanle Liu <yuanlehome@163.com>	2025-07-30 09:31:29 +08:00
zhuzixuan	ad7bb52a28	修复传入max_tokens=1时的报错 (#3068 ) * 修复传入max_tokens=1时的报错 * 修复传入max_tokens=1时的报错 * 修复传入max_tokens=1时的报错 * 修复传入max_tokens=1时的报错 * 修复传入max_tokens=1时的报错 * 修复传入max_tokens=1时的报错	2025-07-29 23:49:28 +08:00
李泳桦	69996a40da	[feat] add disable_chat_template in chat api as a substitute for previous raw_request (#3020 ) * [feat] add disable_chat_template in chat api as a substitute for previous raw_request * [fix] pre-commit code check	2025-07-25 20:57:32 +08:00
EnflameGCU	7634ffb709	[GCU] Add CI (#3006 )	2025-07-25 10:59:29 +08:00
Zero Rains	0fb37ab7e4	update flake8 version to support pre-commit in python3.12 (#3000 ) * update flake8 version to support pre-commit in python3.12 * polish code	2025-07-24 01:43:31 -07:00
李泳桦	8a619e9db5	[Feature] Add return_token_ids, prompt_token_ids, and delete training, raw_request in request body (#2940 ) * [feat] add return_token_ids, prompt_token_ids, delete raw_request in request body * [fix] return_token_ids not working in curl request * [test] improve some test cases of return_token_ids and prompt_token_ids * [fix] the server responds ok even if request.messages is an empty list	2025-07-21 19:31:14 +08:00
Yuanle Liu	2f74e93d7e	use dist.all_reduce(min) to sync num_blocks_local (#2933 ) * pre-commit all files check * reduce min num_blocks_local * fix nranks=1 * pre-commit when commit-msg	2025-07-21 01:23:36 -07:00
gaoziyuan	95a214ae43	support trainer_degree in name_mapping (#2935 )	2025-07-20 23:12:55 -07:00
liddk1121	17c5d3a241	[Iluvatar GPU] Add CI scripts (#2876 )	2025-07-21 09:44:42 +08:00
Zero Rains	25698d56d1	polish code with new pre-commit rule (#2923 )	2025-07-19 23:19:27 +08:00
ZhangYulongg	b8676d71a8	update ci cases Some checks failed Deploy GitHub Pages / deploy (push) Has been cancelled Details	2025-07-18 21:44:07 +08:00
ZhangYulongg	43976138de	update ci cases	2025-07-18 21:44:07 +08:00
ZhangYulongg	e546e6b1b0	update ci cases	2025-07-18 21:44:07 +08:00
ZhangYulongg	eb77b1be6d	update ci cases	2025-07-18 21:44:07 +08:00
Jiang-Jia-Jun	fbe3547c95	[Feature] Support include_stop_str_in_output in chat/completion (#2910 ) * [Feature] Support include_stop_str_in_output in chat/completion * Add ci test for include_stop_str_in_output * Update version of openai * Fix ci test --------- Co-authored-by: Jiang-Jia-Jun <jiangjiajun@baidu.com>	2025-07-18 16:59:18 +08:00
RAM	0fad10b35a	[Executor] CUDA Graph support padding batch (#2844 ) * cuda graph support padding batch * Integrate the startup parameters for the graph optimization backend and provide support for user - defined capture sizes. * Do not insert max_num_seqs when the user specifies a capture list * Support set graph optimization config from YAML file * update cuda graph ci * fix ci bug * fix ci bug	2025-07-15 19:49:01 -07:00
xiegegege	16940822a7	add result save for ci (#2824 ) Some checks failed Deploy GitHub Pages / deploy (push) Has been cancelled Details LGTM	2025-07-12 23:34:46 +08:00
xiegetest	f6ffbc3cbd	add precision check for ci (#2732 ) * add precision check for ci * add precision check for ci * add precision check for ci * add precision check for ci --------- Co-authored-by: xiegegege <xiege01@baidu.com>	2025-07-08 18:43:53 +08:00
YuBaoku	dacc46f04c	[CI] Add validation for MTP and CUDAGraph (#2710 ) * set git identity to avoid merge failure in CI * add ci cases * [CI] Add validation for MTP and CUDAGraph	2025-07-04 18:13:54 +08:00
LQX	11cfdf5d89	添加XPU CI, test=model (#2701 ) * 添加XPU CI, test=model * 添加XPU CI, test=model * 添加XPU CI, test=model * 添加XPU CI, test=model * 添加XPU CI, test=model * 添加XPU CI, test=model * 添加XPU CI, test=model * 添加XPU CI, test=model * 添加XPU CI, test=model	2025-07-04 16:16:06 +08:00
YuBaoku	bb880c8d7c	Update CI test cases (#2671 ) * set git identity to avoid merge failure in CI * add ci cases	2025-07-02 15:08:39 +08:00
YUNSHEN XIE	d5af78945b	Add ci (#2650 ) Some checks failed Deploy GitHub Pages / deploy (push) Has been cancelled Details * add ci ut and workflow * Automatically cancel any previous CI runs for the ci.yml workflow, keeping only the latest one active	2025-06-30 20:20:49 +08:00
Jiang-Jia-Jun	92c2cfa2e7	Sync v2.0 version of code to github repo	2025-06-29 23:29:37 +00:00
XieYunshen	0825146538	add ci ut and workflow	2025-06-16 02:18:00 +08:00

29 Commits