| 
							
							
								 co63oc | d6369b4d51 | fix typos (#3684) | 2025-09-01 17:50:17 +08:00 |  | 
			
				
					| 
							
							
								 周周周 | 17b414c2df | MoE Default use triton's blockwise fp8 in TP Case (#3678) | 2025-08-29 11:07:30 +08:00 |  | 
			
				
					| 
							
							
								 Mattheliu | 108d989d9d | [Docs] add fastdeploy_unit_test_guide.md (#3484) * docs:add fastdeploy_unit_test_guide.md
* docs:fix fastdeploy_unit_test_guide.md
* docs: add FastDeploy unit test spec (EN) and update usage nav
* fix codestyle | 2025-08-28 14:12:25 +08:00 |  | 
			
				
					| 
							
							
								 yinwei | 354575b6d1 | [Docs]Modify the gpu-memory-utilization of the 128K 8-card Wint4 model to 0.95 (#3428) * XPU Update 2.1 Release Documentation
* code style check
* Modify the gpu-memory-utilization of the 128K 8-card Wint4 model to 0.95 | 2025-08-15 18:34:37 +08:00 |  | 
			
				
					| 
							
							
								 yinwei | fbb6dcb9e4 | [Docs]XPU Update 2.1 Release Documentation (#3423) * XPU Update 2.1 Release Documentation
* code style check | 2025-08-15 14:07:47 +08:00 |  | 
			
				
					| 
							
							
								 Yuanle Liu | 9571c458f0 | enhance eos_tokens (#3274) * enhance eos_tokens
* update
* update | 2025-08-11 14:47:52 +08:00 |  | 
			
				
					| 
							
							
								 hong19860320 | 93a1731891 | [Doc] Update deps and fix dead links (#3252) | 2025-08-07 11:04:31 +08:00 |  | 
			
				
					| 
							
							
								 yinwei | 5b9aec1f10 | xpu release 2.0.3 (#3105) | 2025-07-31 14:26:07 +08:00 |  | 
			
				
					| 
							
							
								 Zero Rains | 25698d56d1 | polish code with new pre-commit rule (#2923) | 2025-07-19 23:19:27 +08:00 |  | 
			
				
					| 
							
							
								 yulangz | 7dfd2ea052 | [XPU][doc] Update minimal fastdeploy required (#2863) * [XPU][doc] update minimal fastdeploy required | 2025-07-17 11:33:22 +08:00 |  | 
			
				
					| 
							
							
								 yulangz | 17314ee126 | [XPU] Update doc and add scripts for downloading dependencies (#2845) * [XPU] update xvllm download
* update supported models
* fix xpu model runner in huge memory with small model
* update doc | 2025-07-16 11:05:56 +08:00 |  | 
			
				
					| 
							
							
								 Sunny-bot1 | 240d6236bc | [Fix]fix top_k_top_p sampling (#2801) 
		
	
	
		
			
				
	
				Deploy GitHub Pages / deploy (push) Has been cancelled * fix topk-topp
* update
* add base_non_truncated | 2025-07-10 22:35:10 +08:00 |  | 
			
				
					| 
							
							
								 chen | 888780ffde | [Feature] block_wise_fp8 support triton_moe_backend (#2767) | 2025-07-09 19:22:47 +08:00 |  | 
			
				
					| 
							
							
								 Jiang-Jia-Jun | 92c2cfa2e7 | Sync v2.0 version of code to github repo | 2025-06-29 23:29:37 +00:00 |  | 
			
				
					| 
							
							
								 jiangjiajun | 4c07d198ba | add model zoo | 2022-07-06 03:12:43 +00:00 |  | 
			
				
					| 
							
							
								 jiangjiajun | 9d87046d78 | first commit | 2022-07-05 09:30:15 +00:00 |  |