| 
							
							
								 Zero Rains | 42af0b4b64 | [V1 Loader] Support DeepSeekV3(bf16) (#3294) * Support new loader for DeepSeekV3(bf16)
* update paddle version
* remove useless attr | 2025-08-11 13:39:28 +08:00 |  | 
			
				
					| 
							
							
								 bukejiyu | 20839abccf | qwen3_moe (#3084) | 2025-08-06 14:45:27 +08:00 |  | 
			
				
					| 
							
							
								 bukejiyu | db698bda01 | qwen loader (#3057) | 2025-07-30 19:09:38 +08:00 |  | 
			
				
					| 
							
							
								 Zero Rains | 25698d56d1 | polish code with new pre-commit rule (#2923) | 2025-07-19 23:19:27 +08:00 |  | 
			
				
					| 
							
							
								 Yuanle Liu | 61b3997b85 | refactor rl get_name_mappings_to_training (#2847) 
		
	
	
		
			
				
	
				Deploy GitHub Pages / deploy (push) Has been cancelled * refactor rl get_name_mappings_to_training
* fix tp>1
* change variable name(ffn1->up_gate_proj/ffn2->down_proj)
* change variable name(linear_weight->weight/linear_bias->bias)
* add rl names mapping for vl
* fix ernie 0.3B error
* fix develop code
* fix | 2025-07-15 07:31:42 -07:00 |  | 
			
				
					| 
							
							
								 bukejiyu | bad53c6b6e | [vl]remove duplicated load logic (#2744) 
		
	
	
		
			
				
	
				Deploy GitHub Pages / deploy (push) Has been cancelled | 2025-07-13 07:36:26 +08:00 |  | 
			
				
					| 
							
							
								 liddk1121 | 1b54a2831e | Adapt for iluvatar gpu (#2684) | 2025-07-07 16:53:14 +08:00 |  | 
			
				
					| 
							
							
								 Jiang-Jia-Jun | 05c670e593 | [Sync] Update to latest code (#2679) * [Sync] Update to latest code
* Add new code files
* Add new code files
* update code
* Try to fix build.sh
* Try to fix build.sh
* Update code
* Update requirements.txt
* Update code
---------
Co-authored-by: Jiang-Jia-Jun <jiangjiajun@baidu.com> | 2025-07-03 15:43:53 +08:00 |  | 
			
				
					| 
							
							
								 Jiang-Jia-Jun | 92c2cfa2e7 | Sync v2.0 version of code to github repo | 2025-06-29 23:29:37 +00:00 |  | 
			
				
					| 
							
							
								 jiangjiajun | 684703fd72 | [LLM] First commit the llm deployment code | 2025-06-09 19:20:15 +08:00 |  |