ming1753 
							
						 
					 
					
						
						
							
						
						5acde4eb43 
					 
					
						
						
							
							[Feature] Multimodal Scheduler V1 ( #3019 )  
						
						... 
						
						
						
						* [Feature] Support multimodal scheduler v1
* remove debug log
* fix bug
* fix format
* modify code
* fix bug
* fix bug
* fix bug
* modify code 
						
						
					 
					
						2025-07-30 16:05:55 +08:00 
						 
				 
			
				
					
						
							
							
								YuanRisheng 
							
						 
					 
					
						
						
							
						
						502ee92a0a 
					 
					
						
						
							
							Unify server-side and model-side Config (Part3)  ( #3047 )  
						
						... 
						
						
						
						* merge model config
* fix arch
* fix rl 
						
						
					 
					
						2025-07-29 17:07:44 +08:00 
						 
				 
			
				
					
						
							
							
								JYChen 
							
						 
					 
					
						
						
							
						
						dafe02a7b9 
					 
					
						
						
							
							[stop sequence] support stop sequence ( #3025 )  
						
						... 
						
						
						
						* stop seqs in multi-ends
* unittest for gpu stop op
* kernel tid==0 
						
						
					 
					
						2025-07-29 14:17:37 +08:00 
						 
				 
			
				
					
						
							
							
								Zero Rains 
							
						 
					 
					
						
						
							
						
						0fb37ab7e4 
					 
					
						
						
							
							update flake8 version to support pre-commit in python3.12 ( #3000 )  
						
						... 
						
						
						
						* update flake8 version to support pre-commit in python3.12
* polish code 
						
						
					 
					
						2025-07-24 01:43:31 -07:00 
						 
				 
			
				
					
						
							
							
								Sunny-bot1 
							
						 
					 
					
						
						
							
						
						7c5e34e72d 
					 
					
						
						
							
							[FIX]fix rejection sampling when topp=0 using _SAMPLING_EPS ( #2967 )  
						
						... 
						
						
						
						* fix rejection sampling when topp=0
* fix 
						
						
					 
					
						2025-07-22 05:53:37 -07:00 
						 
				 
			
				
					
						
							
							
								luukunn 
							
						 
					 
					
						
						
							
						
						920e6b3f60 
					 
					
						
						
							
							[Fix]fix empty prompt_token_ids,update the parser's triggering condit… ( #2891 )  
						
						
						
						
					 
					
						2025-07-22 16:13:05 +08:00 
						 
				 
			
				
					
						
							
							
								李泳桦 
							
						 
					 
					
						
						
							
						
						2a8a2c06de 
					 
					
						
						
							
							[fix] non-streaming api now returns full output ids if return_token_ids is enabled ( #2951 )  
						
						
						
						
					 
					
						2025-07-22 14:35:56 +08:00 
						 
				 
			
				
					
						
							
							
								李泳桦 
							
						 
					 
					
						
						
							
						
						8a619e9db5 
					 
					
						
						
							
							[Feature] Add return_token_ids, prompt_token_ids, and delete training, raw_request in request body ( #2940 )  
						
						... 
						
						
						
						* [feat] add return_token_ids, prompt_token_ids, delete raw_request in request body
* [fix] return_token_ids not working in curl request
* [test] improve some test cases of return_token_ids and prompt_token_ids
* [fix] the server responds ok even if request.messages is an empty list 
						
						
					 
					
						2025-07-21 19:31:14 +08:00 
						 
				 
			
				
					
						
							
							
								Zero Rains 
							
						 
					 
					
						
						
							
						
						25698d56d1 
					 
					
						
						
							
							polish code with new pre-commit rule ( #2923 )  
						
						
						
						
					 
					
						2025-07-19 23:19:27 +08:00 
						 
				 
			
				
					
						
							
							
								Jiang-Jia-Jun 
							
						 
					 
					
						
						
							
						
						fbe3547c95 
					 
					
						
						
							
							[Feature] Support include_stop_str_in_output in chat/completion ( #2910 )  
						
						... 
						
						
						
						* [Feature] Support include_stop_str_in_output in chat/completion
* Add ci test for include_stop_str_in_output
* Update version of openai
* Fix ci test
---------
Co-authored-by: Jiang-Jia-Jun <jiangjiajun@baidu.com > 
						
						
					 
					
						2025-07-18 16:59:18 +08:00 
						 
				 
			
				
					
						
							
							
								gaoziyuan 
							
						 
					 
					
						
						
							
						
						6efad14b95 
					 
					
						
						
							
							support vl ori_vacab_size ( #2900 )  
						
						
						
						
					 
					
						2025-07-18 16:26:14 +08:00 
						 
				 
			
				
					
						
							
							
								Xintong Yu 
							
						 
					 
					
						
						
							
						
						273efba76f 
					 
					
						
						
							
							[Fix] remove misleading variables ( #2841 )  
						
						... 
						
						
						
						Co-authored-by: Jiang-Jia-Jun <163579578+Jiang-Jia-Jun@users.noreply.github.com > 
						
						
					 
					
						2025-07-17 16:49:14 +08:00 
						 
				 
			
				
					
						
							
							
								LokeZhou 
							
						 
					 
					
						
						
							
						
						f50c25178b 
					 
					
						
						
							
							[MM_PROCESS] add _extract_labels ( #2879 )  
						
						
						
						
					 
					
						2025-07-17 14:20:01 +08:00 
						 
				 
			
				
					
						
							
							
								LiqinruiG 
							
						 
					 
					
						
						
							
						
						ce5adec877 
					 
					
						
						
							
							[Doc] modify offline-inerence docs ( #2800 )  
						
						... 
						
						
						
						* modify offline-inerence docs
* [bug] remove tool_call_content 
						
						
					 
					
						2025-07-10 19:41:12 +08:00 
						 
				 
			
				
					
						
							
							
								chen 
							
						 
					 
					
						
						
							
						
						d33105baeb 
					 
					
						
						
							
							[Feature] Online Chat API Support Return logprobs ( #2777 )  
						
						... 
						
						
						
						* online chat support logprobs
* check xpu
* check vl_gpu_model_runner and xpu_model_runner
* get_worker() check platform 
						
						
					 
					
						2025-07-10 16:33:40 +08:00 
						 
				 
			
				
					
						
							
							
								Ting 
							
						 
					 
					
						
						
							
						
						a6e9161045 
					 
					
						
						
							
							fix bug. ( #2718 )  
						
						
	
		
			
	 
	
	
		
	
	
		
			
				
	Deploy GitHub Pages / deploy (push) Has been cancelled 
				
			 
		
		
	 
 
	 
						
						
					 
					
						2025-07-05 08:19:19 +08:00 
						 
				 
			
				
					
						
							
							
								Ting 
							
						 
					 
					
						
						
							
						
						90ef28d982 
					 
					
						
						
							
							spec token map lazy. ( #2715 )  
						
						
	
		
			
	 
	
	
		
	
	
		
			
				
	Deploy GitHub Pages / deploy (push) Has been cancelled 
				
			 
		
		
	 
 
	 
						
						
					 
					
						2025-07-05 00:14:54 +08:00 
						 
				 
			
				
					
						
							
							
								Jiang-Jia-Jun 
							
						 
					 
					
						
						
							
						
						05c670e593 
					 
					
						
						
							
							[Sync] Update to latest code ( #2679 )  
						
						... 
						
						
						
						* [Sync] Update to latest code
* Add new code files
* Add new code files
* update code
* Try to fix build.sh
* Try to fix build.sh
* Update code
* Update requirements.txt
* Update code
---------
Co-authored-by: Jiang-Jia-Jun <jiangjiajun@baidu.com > 
						
						
					 
					
						2025-07-03 15:43:53 +08:00 
						 
				 
			
				
					
						
							
							
								Jiang-Jia-Jun 
							
						 
					 
					
						
						
							
						
						92c2cfa2e7 
					 
					
						
						
							
							Sync v2.0 version of code to github repo  
						
						
						
						
					 
					
						2025-06-29 23:29:37 +00:00 
						 
				 
			
				
					
						
							
							
								jiangjiajun 
							
						 
					 
					
						
						
							
						
						684703fd72 
					 
					
						
						
							
							[LLM] First commit the llm deployment code  
						
						
						
						
					 
					
						2025-06-09 19:20:15 +08:00