| 
							
							
								 yzwu | fbdd6b0663 | [Iluvatar GPU] Optimze attention and moe performance (#3234) | 2025-08-08 10:51:24 +08:00 |  | 
			
				
					| 
							
							
								 Zero Rains | 0fb37ab7e4 | update flake8 version to support pre-commit in python3.12 (#3000) * update flake8 version to support pre-commit in python3.12
* polish code | 2025-07-24 01:43:31 -07:00 |  | 
			
				
					| 
							
							
								 lifulll | 2c6a9e887e | native top_p_sampling (#2901) | 2025-07-22 14:09:59 +08:00 |  | 
			
				
					| 
							
							
								 lizexu123 | 67990e0572 | [Feature] support min_p_sampling (#2872) 
		
	
	
		
			
				
	
				Deploy GitHub Pages / deploy (push) Has been cancelled * Fastdeploy support min_p
* add test_min_p
* fix
* min_p_sampling
* update
* delete vl_gpu_model_runner.py
* fix
* Align usage of min_p with vLLM
* fix
* modified unit test
* fix test_min_sampling
* pre-commit all files
* fix
* fix
* fix
* fix xpu_model_runner.py | 2025-07-20 23:17:59 -07:00 |  | 
			
				
					| 
							
							
								 Zero Rains | 25698d56d1 | polish code with new pre-commit rule (#2923) | 2025-07-19 23:19:27 +08:00 |  | 
			
				
					| 
							
							
								 ming1753 | 1f15ca21e4 | [Feature] support prompt repetition_penalty (#2806) 
		
	
	
		
			
				
	
				Deploy GitHub Pages / deploy (push) Has been cancelled | 2025-07-17 12:05:52 +08:00 |  | 
			
				
					| 
							
							
								 freeliuzc | 7cdd8d290d | [MTP] optimize mtp infer speed (#2840) 
		
	
	
		
			
				
	
				Deploy GitHub Pages / deploy (push) Has been cancelled | 2025-07-14 19:50:22 +08:00 |  | 
			
				
					| 
							
							
								 Sunny-bot1 | 240d6236bc | [Fix]fix top_k_top_p sampling (#2801) 
		
	
	
		
			
				
	
				Deploy GitHub Pages / deploy (push) Has been cancelled * fix topk-topp
* update
* add base_non_truncated | 2025-07-10 22:35:10 +08:00 |  | 
			
				
					| 
							
							
								 Sunny-bot1 | 1e2319cbef | Rename top_p_sampling to top_k_top_p_sampling (#2791) | 2025-07-10 00:09:25 -07:00 |  | 
			
				
					| 
							
							
								 Sunny-bot1 | e45050cae3 | [Feature] support top_k_top_p sampling (#2753) * support top_k_top_p sampling
* fix
* add api param
* add api para
* fix
* fix
* fix
* fix
* fix
* fix
* fix | 2025-07-09 20:58:58 -07:00 |  | 
			
				
					| 
							
							
								 EnflameGCU | d0f4d6ba3a | [GCU] Support gcu platform (#2702) baseline: e7fa57ebaeCo-authored-by: yongqiangma <xing.wo@163.com> | 2025-07-08 13:00:52 +08:00 |  | 
			
				
					| 
							
							
								 liddk1121 | 1b54a2831e | Adapt for iluvatar gpu (#2684) | 2025-07-07 16:53:14 +08:00 |  | 
			
				
					| 
							
							
								 Jiang-Jia-Jun | 92c2cfa2e7 | Sync v2.0 version of code to github repo | 2025-06-29 23:29:37 +00:00 |  | 
			
				
					| 
							
							
								 jiangjiajun | 684703fd72 | [LLM] First commit the llm deployment code | 2025-06-09 19:20:15 +08:00 |  |