ltd0924 
							
						 
					 
					
						
						
							
						
						f75697c2d1 
					 
					
						
						
							
							[Feature] support clear data ( #4185 )  
						
						... 
						
						
	
		
			
	 
	
	
		
	
	
		
			
				
	CE Compile Job / ce_job_pre_check (push) Has been cancelled 
				
			 
		
			
				
	CE Compile Job / print_ce_job_pre_check_outputs (push) Has been cancelled 
				
			 
		
			
				
	CE Compile Job / FD-Clone-Linux (push) Has been cancelled 
				
			 
		
			
				
	CE Compile Job / Show Code Archive Output (push) Has been cancelled 
				
			 
		
			
				
	CE Compile Job / BUILD_SM8090 (push) Has been cancelled 
				
			 
		
			
				
	CE Compile Job / BUILD_SM8689 (push) Has been cancelled 
				
			 
		
			
				
	CE Compile Job / CE_UPLOAD (push) Has been cancelled 
				
			 
		
		
	 
 
	 
						
						* fix
* fix
* fix
* [Feature] support clear data
* update
* fix
* fix
* fix
* fix 
						
						
					 
					
						2025-09-21 20:41:27 +08:00 
						 
				 
			
				
					
						
							
							
								ltd0924 
							
						 
					 
					
						
						
							
						
						749f074e44 
					 
					
						
						
							
							Update multi_api_server.py ( #4023 )  
						
						
						
						
					 
					
						2025-09-10 17:15:01 +08:00 
						 
				 
			
				
					
						
							
							
								chenjian 
							
						 
					 
					
						
						
							
						
						8915c8411d 
					 
					
						
						
							
							Revert "[Feature] Setting number of apiserver workers automatically ( #3794 )" ( #3918 )  
						
						... 
						
						
						
						This reverts commit d1d063e4af 
						
						
					 
					
						2025-09-05 21:06:50 +08:00 
						 
				 
			
				
					
						
							
							
								ltd0924 
							
						 
					 
					
						
						
							
						
						173e4df982 
					 
					
						
						
							
							[Fix] mv connection_manager init ( #3902 )  
						
						... 
						
						
	
		
			
	 
	
	
		
	
	
		
			
				
	CE Compile Job / ce_job_pre_check (push) Has been cancelled 
				
			 
		
			
				
	CE Compile Job / print_ce_job_pre_check_outputs (push) Has been cancelled 
				
			 
		
			
				
	CE Compile Job / FD-Clone-Linux (push) Has been cancelled 
				
			 
		
			
				
	CE Compile Job / Show Code Archive Output (push) Has been cancelled 
				
			 
		
			
				
	CE Compile Job / BUILD_SM8090 (push) Has been cancelled 
				
			 
		
			
				
	CE Compile Job / BUILD_SM8689 (push) Has been cancelled 
				
			 
		
			
				
	CE Compile Job / CE_UPLOAD (push) Has been cancelled 
				
			 
		
		
	 
 
	 
						
						* Update serving_chat.py
* Update serving_completion.py
* Update serving_completion.py
* mv connection_manager init
---------
Co-authored-by: Yuanle Liu <yuanlehome@163.com > 
						
						
					 
					
						2025-09-05 17:42:36 +08:00 
						 
				 
			
				
					
						
							
							
								ltd0924 
							
						 
					 
					
						
						
							
						
						55ebe855c0 
					 
					
						
						
							
							[Feature] support controller port in multi api server ( #3895 )  
						
						... 
						
						
						
						* fix scheduler bug
* fix
* Update api_server.py
* Update multi_api_server.py 
						
						
					 
					
						2025-09-05 13:38:58 +08:00 
						 
				 
			
				
					
						
							
							
								luukunn 
							
						 
					 
					
						
						
							
						
						b8d0f1c081 
					 
					
						
						
							
							[bug] fix finish reason ( #3858 )  
						
						... 
						
						
						
						* add reasoning parser plugin
* fix finish reason
---------
Co-authored-by: Yuanle Liu <yuanlehome@163.com > 
						
						
					 
					
						2025-09-04 14:36:03 +08:00 
						 
				 
			
				
					
						
							
							
								ltd0924 
							
						 
					 
					
						
						
							
						
						8550e19008 
					 
					
						
						
							
							[bugfix] scheduler ( #3871 )  
						
						... 
						
						
						
						* fix scheduler bug
* fix
* Update api_server.py 
						
						
					 
					
						2025-09-04 11:34:12 +08:00 
						 
				 
			
				
					
						
							
							
								SunLei 
							
						 
					 
					
						
						
							
						
						8c0e7d6fe9 
					 
					
						
						
							
							Support for async processor added. ( #3870 )  
						
						... 
						
						
						
						* Support for async processor added.
* remove yappi code 
						
						
					 
					
						2025-09-04 10:35:08 +08:00 
						 
				 
			
				
					
						
							
							
								ltd0924 
							
						 
					 
					
						
						
							
						
						37cb37b7f2 
					 
					
						
						
							
							[BugFix] fix scheduler ( #3818 )  
						
						... 
						
						
						
						* fix scheduler bug
* fix 
						
						
					 
					
						2025-09-03 11:16:49 +08:00 
						 
				 
			
				
					
						
							
							
								ltd0924 
							
						 
					 
					
						
						
							
						
						cd09384a14 
					 
					
						
						
							
							[BugFix] fix max streaming tokens invalid ( #3799 )  
						
						... 
						
						
						
						* Update serving_chat.py
* Update serving_completion.py
* Update serving_completion.py 
						
						
					 
					
						2025-09-02 21:03:13 +08:00 
						 
				 
			
				
					
						
							
							
								Jiang-Jia-Jun 
							
						 
					 
					
						
						
							
						
						d1d063e4af 
					 
					
						
						
							
							[Feature] Setting number of apiserver workers automatically ( #3794 )  
						
						... 
						
						
						
						Co-authored-by: Jiang-Jia-Jun <jiangjiajun@baidu.com > 
						
						
					 
					
						2025-09-02 17:19:07 +08:00 
						 
				 
			
				
					
						
							
							
								SunLei 
							
						 
					 
					
						
						
							
						
						b9af95cf1c 
					 
					
						
						
							
							[Feature] Add AsyncTokenizerClient&ChatResponseProcessor with remote encode&decode support. ( #3674 )  
						
						... 
						
						
						
						* [Feature] add AsyncTokenizerClient
* add decode_image
* Add response_processors with remote decode support.
* [Feature] add tokenizer_base_url startup argument
* Revert comment removal and restore original content.
* [Feature] Non-streaming requests now support remote image decoding.
* Fix parameter type issue in decode_image call.
* Keep completion_token_ids when return_token_ids = False.
* add copyright 
						
						
					 
					
						2025-08-30 17:06:26 +08:00 
						 
				 
			
				
					
						
							
							
								luukunn 
							
						 
					 
					
						
						
							
						
						9a7c231f2c 
					 
					
						
						
							
							[Feature]support chat_template.jinja ( #3721 )  
						
						... 
						
						
						
						* add support chat_template.jinja
* add support chat_template.jinja 
						
						
					 
					
						2025-08-30 17:05:34 +08:00 
						 
				 
			
				
					
						
							
							
								李泳桦 
							
						 
					 
					
						
						
							
						
						88297240e7 
					 
					
						
						
							
							[feat] completion api supports passing input token ids in either prompt or prompt_token_ids ( #3311 )  
						
						... 
						
						
						
						* [feat] completion api supports passing input token ids in either `prompt` or `prompt_token_ids`
* [fix] update comment
* [fix] fix type error
* [test] add a unittest file for serving api test
* [test] try to fix ci error
* [chore] rename test function names
* [test] try to fix ci error
* [test] try to fix ci error
* [test] add tests for qwen 
						
						
					 
					
						2025-08-29 14:19:42 +08:00 
						 
				 
			
				
					
						
							
							
								Yuanle Liu 
							
						 
					 
					
						
						
							
						
						4957908275 
					 
					
						
						
							
							add input_processor plugin ( #3657 )  
						
						... 
						
						
						
						* add input_processor plugin
* update
* update
* update
* update
* update
* update
* update
* update
* update
* update
* update 
						
						
					 
					
						2025-08-28 22:53:57 +08:00 
						 
				 
			
				
					
						
							
							
								ltd0924 
							
						 
					 
					
						
						
							
						
						e5015eea05 
					 
					
						
						
							
							[BugFix] fix logger ( #3666 )  
						
						
						
						
					 
					
						2025-08-28 17:08:00 +08:00 
						 
				 
			
				
					
						
							
							
								gaoziyuan 
							
						 
					 
					
						
						
							
						
						82e64b13e1 
					 
					
						
						
							
							[NewFeature]Support dp multi api server && Fix some bug in mixed ep && merge develop ( #3598 )  
						
						... 
						
						
						
						* [Feature] update ep
* fix ci
* fix ci
* fix ci
* fix ci
* fix ci
* fix ci
* fix ci
* fix queue ports idx
* fix ci
* fix ci
* fix ci
* fix ci
* fix ci
* fix ci
* fix ci
* fix ci
* Update engine.py
* fix ci
* fix some bug in mixed ep
* add server fix and op fix
* rm some log
* fix code style
* ltd fix
* fix
* fix
* fix some bug
* fix bug
* fix bug
* fix style
* Update config.py
* Update splitwise_connector.py
* Update cache_messager.py
* Update __init__.py
* merge and fix
* Update engine.py
* Update common_engine.py
* Update run_ci_xpu.sh
* Update ernie_processor.py
* Update ernie_processor.py
---------
Co-authored-by: ltd0924 <ltd0924@sina.com >
Co-authored-by: ltd0924 <32387785+ltd0924@users.noreply.github.com > 
						
						
					 
					
						2025-08-26 19:59:02 +08:00 
						 
				 
			
				
					
						
							
							
								SunLei 
							
						 
					 
					
						
						
							
						
						2f28f40d90 
					 
					
						
						
							
							fix: replace list * n initialization with list comprehension to avoid shared references ( #3618 )  
						
						
						
						
					 
					
						2025-08-26 17:53:31 +08:00 
						 
				 
			
				
					
						
							
							
								Sunny-bot1 
							
						 
					 
					
						
						
							
						
						c68c3c4b8b 
					 
					
						
						
							
							[Feature] bad words support v1 scheduler and specifiy token ids ( #3608 )  
						
						... 
						
						
						
						* support bad_words_token_ids
* docs
* fix test
* fix
* bad words support kvcache v1 and token ids
* fix 
						
						
					 
					
						2025-08-25 20:14:51 -07:00 
						 
				 
			
				
					
						
							
							
								ltd0924 
							
						 
					 
					
						
						
							
						
						66c5addce4 
					 
					
						
						
							
							[Bugfix] fix api server control signal bugs ( #3531 )  
						
						... 
						
						
	
		
			
	 
	
	
		
	
	
		
			
				
	CE Compile Job / ce_job_pre_check (push) Has been cancelled 
				
			 
		
			
				
	CE Compile Job / print_ce_job_pre_check_outputs (push) Has been cancelled 
				
			 
		
			
				
	CE Compile Job / FD-Clone-Linux (push) Has been cancelled 
				
			 
		
			
				
	CE Compile Job / Show Code Archive Output (push) Has been cancelled 
				
			 
		
			
				
	CE Compile Job / BUILD_SM8090 (push) Has been cancelled 
				
			 
		
			
				
	CE Compile Job / BUILD_SM8689 (push) Has been cancelled 
				
			 
		
			
				
	CE Compile Job / CE_UPLOAD (push) Has been cancelled 
				
			 
		
			
				
	Deploy GitHub Pages / deploy (push) Has been cancelled 
				
			 
		
		
	 
 
	 
						
						* Update serving_chat.py
* Update serving_completion.py
* Update serving_completion.py 
						
						
					 
					
						2025-08-25 21:13:04 +08:00 
						 
				 
			
				
					
						
							
							
								chen 
							
						 
					 
					
						
						
							
						
						9cab3f47ff 
					 
					
						
						
							
							[Feature] Add temp_scaled_logprobs and top_p_normalized_logprobs parameters for logits and logprobs post processing ( #3552 )  
						
						... 
						
						
						
						* [feature] Add temp_scaled_logprobs and top_p_normalized_logprobs parameters for logits and logprobs post processing
* infer engine support temp_scaled_logprobs and top_p_normalized_logprobs
* delete some code
* code check
* code check and add doc
* fix tokenizer.decoder(-1), return 'Invalid Token'
* add ci for temp_scaled and top_p logprobs
* check test
* check seq len time shape
* logprob clip inf
---------
Co-authored-by: sunlei1024 <sunlei5788@gmail.com > 
						
						
					 
					
						2025-08-25 14:11:49 +08:00 
						 
				 
			
				
					
						
							
							
								李泳桦 
							
						 
					 
					
						
						
							
						
						8bea4b1e25 
					 
					
						
						
							
							[fix] fix output tokens count in streaming completion api ( #3507 )  
						
						
						
						
					 
					
						2025-08-21 18:19:13 +08:00 
						 
				 
			
				
					
						
							
							
								李泳桦 
							
						 
					 
					
						
						
							
						
						e4f0b755b4 
					 
					
						
						
							
							[fix] setting disable_chat_template while passing prompt_token_ids led to response error ( #3228 )  
						
						... 
						
						
						
						* [fix] setting disable_chat_template while passing prompt_token_ids led to response error
* [fix] code syntax
* [test] add test case for this bug
* [test] add test case for empty message list
* [test] fix test case for empty message list 
						
						
					 
					
						2025-08-21 17:30:51 +08:00 
						 
				 
			
				
					
						
							
							
								luukunn 
							
						 
					 
					
						
						
							
						
						371fb3f853 
					 
					
						
						
							
							[Feature] add tool parser ( #3483 )  
						
						... 
						
						
						
						* add tool parser
* add x1 enable_thinking
* restart ci
* fix vl reasoning parser
* modify call style
* modify call style
* add offline enablethinking
* fix completion
* fix
* fix unit test
* fix unit test
* fix unit test
* fix vl reasoning parser
* fix vl reasoning parser 
						
						
					 
					
						2025-08-21 17:25:44 +08:00 
						 
				 
			
				
					
						
							
							
								Yzc216 
							
						 
					 
					
						
						
							
						
						466cbb5a99 
					 
					
						
						
							
							[Feature] Models api ( #3073 )  
						
						... 
						
						
						
						* add v1/models interface related
* add model parameters
* default model verification
* unit test
* check model err_msg
* unit test
* type annotation
* model parameter in response
* modify document description
* modify document description
* unit test
* verification
* verification update
* model_name
* pre-commit
* update test case
* update test case
* Update tests/entrypoints/openai/test_serving_models.py
Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com >
* Update tests/entrypoints/openai/test_serving_models.py
Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com >
* Update tests/entrypoints/openai/test_serving_models.py
Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com >
* Update tests/entrypoints/openai/test_serving_models.py
Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com >
* Update fastdeploy/entrypoints/openai/serving_models.py
Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com >
---------
Co-authored-by: LiqinruiG <37392159+LiqinruiG@users.noreply.github.com >
Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com > 
						
						
					 
					
						2025-08-21 17:02:56 +08:00 
						 
				 
			
				
					
						
							
							
								ltd0924 
							
						 
					 
					
						
						
							
						
						51f68ae593 
					 
					
						
						
							
							[Feature] add dealer manager to reuse the connection ( #3471 )  
						
						... 
						
						
						
						* [BugFix] fix control signal release failed
* [BugFix] fix control signal release failed
* update
* update
* update
* [Feature] add dealer manager to reuse the connection
* fix
* fix
* fix
* fix
* fix
* fix
* Create test_dealer_connection_manager.py
* Delete test/entrypoints/openai directory
* Update test_dealer_connection_manager.py
* Update test_dealer_connection_manager.py 
						
						
					 
					
						2025-08-21 13:11:13 +08:00 
						 
				 
			
				
					
						
							
							
								memoryCoderC 
							
						 
					 
					
						
						
							
						
						31f639f10b 
					 
					
						
						
							
							[Feature] add prompt_tokens and completion_tokens ( #3504 )  
						
						
	
		
			
	 
	
	
		
	
	
		
			
				
	Deploy GitHub Pages / deploy (push) Has been cancelled 
				
			 
		
		
	 
 
	 
						
						
					 
					
						2025-08-21 10:23:27 +08:00 
						 
				 
			
				
					
						
							
							
								kevin 
							
						 
					 
					
						
						
							
						
						67298cf4c0 
					 
					
						
						
							
							add error traceback info ( #3419 )  
						
						... 
						
						
	
		
			
	 
	
	
		
	
	
		
			
				
	Deploy GitHub Pages / deploy (push) Has been cancelled 
				
			 
		
		
	 
 
	 
						
						* add error traceback info
* update error msg
* update code
---------
Co-authored-by: Jiang-Jia-Jun <163579578+Jiang-Jia-Jun@users.noreply.github.com > 
						
						
					 
					
						2025-08-19 19:32:04 +08:00 
						 
				 
			
				
					
						
							
							
								ltd0924 
							
						 
					 
					
						
						
							
						
						bca8905b40 
					 
					
						
						
							
							[BugFix] fix control signal release failed ( #3390 )  
						
						... 
						
						
						
						* [BugFix] fix control signal release failed
* [BugFix] fix control signal release failed
* update
* update
* update 
						
						
					 
					
						2025-08-19 13:51:38 +08:00 
						 
				 
			
				
					
						
							
							
								zhuzixuan 
							
						 
					 
					
						
						
							
						
						c95b3395e9 
					 
					
						
						
							
							【BugFix】completion接口echo回显支持 ( #3245 )  
						
						... 
						
						
						
						* wenxin-tools-511,修复v1/completion无法回显的问题。
* 支持多prompt的回显
* 支持多prompt情况下的流式回显
* 补充了 completion 接口支持 echo 的单元测试
* pre-commit
* 移除了多余的test文件
* 修复了completion接口echo支持的单测方法
* 补充了单元测试文件
* 补充单测
* unittest
* 补充单测
* 修复单测
* 删除不必要的assert.
* 重新提交
* 更新测试方法
* ut
* 验证是否是正确思路单测
* 验证是否是正确思路单测
* 验证是否是正确思路单测3
* 优化单测代码,有针对性地缩小单测范围。
* 优化单测代码2,有针对性地缩小单测范围。
* 优化单测代码3,有针对性地缩小单测范围。
* support 'echo' in chat/completion.
* update
* update
* update
* update
* update
* update
* 补充了关于tokenid的单元测试
* update
* 修正index错误
* 修正index错误 
						
						
					 
					
						2025-08-19 10:41:51 +08:00 
						 
				 
			
				
					
						
							
							
								luukunn 
							
						 
					 
					
						
						
							
						
						9c129813f9 
					 
					
						
						
							
							[Feature] add custom chat template ( #3251 )  
						
						... 
						
						
						
						* add custom chat_template
* add custom chat_template
* add unittest
* fix
* add docs
* fix comment
* add offline chat
* fix unit test
* fix unit test
* fix
* fix pre commit
* fix unit test
* add unit test
* add unit test
* add unit test
* fix pre_commit
* fix enable_thinking
* fix pre commit
* fix pre commit
* fix unit test
* add requirements 
						
						
					 
					
						2025-08-18 16:34:08 +08:00 
						 
				 
			
				
					
						
							
							
								gaoziyuan 
							
						 
					 
					
						
						
							
						
						6fdd83da10 
					 
					
						
						
							
							fix some bug ( #3434 )  
						
						
						
						
					 
					
						2025-08-18 14:39:13 +08:00 
						 
				 
			
				
					
						
							
							
								xiaolei373 
							
						 
					 
					
						
						
							
						
						d4f610e4cd 
					 
					
						
						
							
							feat(log):add_request_and_response_log ( #3373 )  
						
						
	
		
			
	 
	
	
		
	
	
		
			
				
	Deploy GitHub Pages / deploy (push) Has been cancelled 
				
			 
		
		
	 
 
	 
						
						
					 
					
						2025-08-13 23:27:41 +08:00 
						 
				 
			
				
					
						
							
							
								luukunn 
							
						 
					 
					
						
						
							
						
						eda83ca672 
					 
					
						
						
							
							add Tool Parser ( #3272 )  
						
						... 
						
						
	
		
			
	 
	
	
		
	
	
		
			
				
	Deploy GitHub Pages / deploy (push) Has been cancelled 
				
			 
		
		
	 
 
	 
						
						* add tool-parser
* add tool-parser
* add tool parser
* add tool parser
* fix
* add offline
* add offline
* fix
* parsers:tool&reasoning
* 修改tool parser名称·
* update
* fix reasoning-parser
* add requirements
* fix finish reason
* fix
* fix reasoning-parser
* fix
* fix
* fix
* fix
* fix
---------
Co-authored-by: zhuzixuan <zhuzixuan@baidu.com > 
						
						
					 
					
						2025-08-13 01:06:55 +08:00 
						 
				 
			
				
					
						
							
							
								memoryCoderC 
							
						 
					 
					
						
						
							
						
						2d1a4cacdf 
					 
					
						
						
							
							Completion add raw_prediction/text_after_process ( #3356 )  
						
						
						
						
					 
					
						2025-08-12 23:06:45 +08:00 
						 
				 
			
				
					
						
							
							
								memoryCoderC 
							
						 
					 
					
						
						
							
						
						c575611a5b 
					 
					
						
						
							
							[BugFix] v1/completions add finish_reason ( #3246 )  
						
						... 
						
						
						
						* [BugFix] v1/completions add finish_reason
* update TestOpenAIServingCompletion for merge
---------
Co-authored-by: YUNSHEN XIE <1084314248@qq.com > 
						
						
					 
					
						2025-08-12 19:40:26 +08:00 
						 
				 
			
				
					
						
							
							
								ltd0924 
							
						 
					 
					
						
						
							
						
						31d4fcb425 
					 
					
						
						
							
							[BugFix] fix too many open files problem ( #3256 )  
						
						... 
						
						
	
		
			
	 
	
	
		
	
	
		
			
				
	Deploy GitHub Pages / deploy (push) Has been cancelled 
				
			 
		
		
	 
 
	 
						
						* Update cache_messager.py
* fix too many open files problem
* fix too many open files problem
* fix too many open files problem
* fix ci bugs
* Update api_server.py
* add parameter
* format
* format
* format
* format
* Update parameters.md
* Update parameters.md
* Update serving_completion.py
* Update serving_chat.py
* Update envs.py
---------
Co-authored-by: Jiang-Jia-Jun <163579578+Jiang-Jia-Jun@users.noreply.github.com > 
						
						
					 
					
						2025-08-08 20:10:11 +08:00 
						 
				 
			
				
					
						
							
							
								李泳桦 
							
						 
					 
					
						
						
							
						
						09cc4e2802 
					 
					
						
						
							
							[fix] fix completion stream api output_tokens not in usage ( #3247 )  
						
						
						
						
					 
					
						2025-08-07 10:36:00 +08:00 
						 
				 
			
				
					
						
							
							
								sg263 
							
						 
					 
					
						
						
							
						
						841e831575 
					 
					
						
						
							
							[Trace]add trace when fd start  ( #3174 )  
						
						... 
						
						
	
		
			
	 
	
	
		
	
	
		
			
				
	Deploy GitHub Pages / deploy (push) Has been cancelled 
				
			 
		
		
	 
 
	 
						
						* add opentelemetry
* add opentelemetry
* add opentelemetry on dequeue
* add opentelemetry on dequeue
* add opentelemetry on dequeue
* fix annotation
* fix annotation when add opentelemetry
* fix opentelemetry-instrumentation-fastapi
* fix pentelemetry-bootstrap
* fix opentelemetry can not work in uvicorn
* move conf to env
* fd start add trace
* fix pre-commit
* fix pre-commit
* change FD_JOB_ID
---------
Co-authored-by: Jiang-Jia-Jun <163579578+Jiang-Jia-Jun@users.noreply.github.com >
Co-authored-by: shige <shige@baidu.com > 
						
						
					 
					
						2025-08-05 21:18:27 +08:00 
						 
				 
			
				
					
						
							
							
								lizhenyun01 
							
						 
					 
					
						
						
							
						
						fe540f6caa 
					 
					
						
						
							
							[plugin] Custom model_runner/model support ( #3186 )  
						
						... 
						
						
						
						* support custom model&&model_runner
* fix merge
* add test && update doc
* fix codestyle
* fix unittest
* load model in rl 
						
						
					 
					
						2025-08-04 18:52:39 -07:00 
						 
				 
			
				
					
						
							
							
								ApplEOFDiscord 
							
						 
					 
					
						
						
							
						
						b71cbb466d 
					 
					
						
						
							
							[Feature] remove dependency on enable_mm and refine multimodal's code ( #3014 )  
						
						... 
						
						
						
						* remove dependency on enable_mm
* fix codestyle check error
* fix codestyle check error
* update docs
* resolve conflicts on model config
* fix unit test error
* fix code style check error
---------
Co-authored-by: shige <1021937542@qq.com >
Co-authored-by: Jiang-Jia-Jun <163579578+Jiang-Jia-Jun@users.noreply.github.com > 
						
						
					 
					
						2025-08-01 20:01:18 +08:00 
						 
				 
			
				
					
						
							
							
								SunLei 
							
						 
					 
					
						
						
							
						
						dade19d7a4 
					 
					
						
						
							
							[Feature] General support for logprobs ( #2974 )  
						
						... 
						
						
						
						* [Feature] support logprobs in chat/completions and completions endpoints
* Temporarily comment out text_offset due to incorrect logic
* Clean up temporary debug prints
* [Feature] support logprobs in offline mode via SamplingParams
* fix: serialize Logprob as dict before zmq send to fix msgpack error
* refactor: remove redundant methods to simplify codebase
* Fix missing fields in CompletionOutput.to_dict affecting msgpack serialization
* refactor: centralize param validation in engine_client to reduce duplication
* revert: rollback changes in offline_demo.py
* revert: rollback changes in offline_demo.py
* [bugfix] fix parameter validation for logprobs
* [bugfix] fix parameter validation for logprobs
* [bugfix] fix parameter validation for logprobs
* [bugfix] fix parameter validation for logprobs
---------
Co-authored-by: Jiang-Jia-Jun <163579578+Jiang-Jia-Jun@users.noreply.github.com > 
						
						
					 
					
						2025-07-31 20:25:56 +08:00 
						 
				 
			
				
					
						
							
							
								LiqinruiG 
							
						 
					 
					
						
						
							
						
						25005fee30 
					 
					
						
						
							
							[Doc]  add chat_template_kwagrs and update params docs ( #3103 )  
						
						... 
						
						
						
						* add chat_template_kwagrs and update params docs
* add chat_template_kwagrs and update params docs
* update enable_thinking
* pre-commit
* update test case
---------
Co-authored-by: Jiang-Jia-Jun <163579578+Jiang-Jia-Jun@users.noreply.github.com > 
						
						
					 
					
						2025-07-31 19:44:06 +08:00 
						 
				 
			
				
					
						
							
							
								Jiang-Jia-Jun 
							
						 
					 
					
						
						
							
						
						0616c208d2 
					 
					
						
						
							
							[Feature] Support include_stop_str_in_output in completion api ( #3096 )  
						
						... 
						
						
						
						* [Feature] Support include_stop_str_in_output in completion api
* Fix ci test
---------
Co-authored-by: Jiang-Jia-Jun <jiangjiajun@baidu.com > 
						
						
					 
					
						2025-07-30 22:18:48 +08:00 
						 
				 
			
				
					
						
							
							
								李泳桦 
							
						 
					 
					
						
						
							
						
						b242150f94 
					 
					
						
						
							
							[feat] extra parameters are all passed directly via http payload now, or in extra_body if using openai client ( #3058 )  
						
						... 
						
						
						
						* [feat] extra parameters are all passed directly via http payload now, or in extra_body if using openai client
* [fix] delete ci test case for enable_thinking
* [fix] add reasoning_parser when server starts
* [fix] fix ci consistency test error with reasoning parser
* [doc] update docs related to metadata
* [fix] cancel enable_thinking default value 
						
						
					 
					
						2025-07-30 19:25:20 +08:00 
						 
				 
			
				
					
						
							
							
								Sunny-bot1 
							
						 
					 
					
						
						
							
						
						74aa31d15b 
					 
					
						
						
							
							[Feature] support bad_words ( #3055 )  
						
						... 
						
						
						
						* support bad_words
* support online infer bad_words
* update
* add CI test
* update
* update
* update
---------
Co-authored-by: Yuanle Liu <yuanlehome@163.com > 
						
						
					 
					
						2025-07-30 09:31:29 +08:00 
						 
				 
			
				
					
						
							
							
								李泳桦 
							
						 
					 
					
						
						
							
						
						69996a40da 
					 
					
						
						
							
							[feat] add disable_chat_template in chat api as a substitute for previous raw_request ( #3020 )  
						
						... 
						
						
						
						* [feat] add disable_chat_template in chat api as a substitute for previous raw_request
* [fix] pre-commit code check 
						
						
					 
					
						2025-07-25 20:57:32 +08:00 
						 
				 
			
				
					
						
							
							
								Zero Rains 
							
						 
					 
					
						
						
							
						
						0fb37ab7e4 
					 
					
						
						
							
							update flake8 version to support pre-commit in python3.12 ( #3000 )  
						
						... 
						
						
						
						* update flake8 version to support pre-commit in python3.12
* polish code 
						
						
					 
					
						2025-07-24 01:43:31 -07:00 
						 
				 
			
				
					
						
							
							
								ltd0924 
							
						 
					 
					
						
						
							
						
						f935d6f862 
					 
					
						
						
							
							[BugFix] fix multinode deployment ( #2977 )  
						
						
						
						
					 
					
						2025-07-24 15:04:04 +08:00 
						 
				 
			
				
					
						
							
							
								Yzc216 
							
						 
					 
					
						
						
							
						
						e14587a954 
					 
					
						
						
							
							[Feature] multi-source download ( #2986 )  
						
						... 
						
						
						
						* multi-source download
* multi-source download
* huggingface download revision
* requirement
* style
* add revision arg
* test
* pre-commit 
						
						
					 
					
						2025-07-24 14:26:37 +08:00