| 
							
							
								 yinwei | 354575b6d1 | [Docs]Modify the gpu-memory-utilization of the 128K 8-card Wint4 model to 0.95 (#3428) * XPU Update 2.1 Release Documentation
* code style check
* Modify the gpu-memory-utilization of the 128K 8-card Wint4 model to 0.95 | 2025-08-15 18:34:37 +08:00 |  | 
			
				
					| 
							
							
								 yinwei | fbb6dcb9e4 | [Docs]XPU Update 2.1 Release Documentation (#3423) * XPU Update 2.1 Release Documentation
* code style check | 2025-08-15 14:07:47 +08:00 |  | 
			
				
					| 
							
							
								 Yuanle Liu | 9571c458f0 | enhance eos_tokens (#3274) * enhance eos_tokens
* update
* update | 2025-08-11 14:47:52 +08:00 |  | 
			
				
					| 
							
							
								 hong19860320 | 93a1731891 | [Doc] Update deps and fix dead links (#3252) | 2025-08-07 11:04:31 +08:00 |  | 
			
				
					| 
							
							
								 yinwei | 5b9aec1f10 | xpu release 2.0.3 (#3105) | 2025-07-31 14:26:07 +08:00 |  | 
			
				
					| 
							
							
								 Zero Rains | 25698d56d1 | polish code with new pre-commit rule (#2923) | 2025-07-19 23:19:27 +08:00 |  | 
			
				
					| 
							
							
								 yulangz | 7dfd2ea052 | [XPU][doc] Update minimal fastdeploy required (#2863) * [XPU][doc] update minimal fastdeploy required | 2025-07-17 11:33:22 +08:00 |  | 
			
				
					| 
							
							
								 yulangz | 17314ee126 | [XPU] Update doc and add scripts for downloading dependencies (#2845) * [XPU] update xvllm download
* update supported models
* fix xpu model runner in huge memory with small model
* update doc | 2025-07-16 11:05:56 +08:00 |  | 
			
				
					| 
							
							
								 Sunny-bot1 | 240d6236bc | [Fix]fix top_k_top_p sampling (#2801) 
		
	
	
		
			
				
	
				Deploy GitHub Pages / deploy (push) Has been cancelled * fix topk-topp
* update
* add base_non_truncated | 2025-07-10 22:35:10 +08:00 |  | 
			
				
					| 
							
							
								 chen | 888780ffde | [Feature] block_wise_fp8 support triton_moe_backend (#2767) | 2025-07-09 19:22:47 +08:00 |  | 
			
				
					| 
							
							
								 Jiang-Jia-Jun | 92c2cfa2e7 | Sync v2.0 version of code to github repo | 2025-06-29 23:29:37 +00:00 |  | 
			
				
					| 
							
							
								 jiangjiajun | 4c07d198ba | add model zoo | 2022-07-06 03:12:43 +00:00 |  | 
			
				
					| 
							
							
								 jiangjiajun | 9d87046d78 | first commit | 2022-07-05 09:30:15 +00:00 |  |