YuanRisheng 
							
						 
					 
					
						
						
							
						
						c389a4013c 
					 
					
						
						
							
							Unify server-side and model-side Config(Part-5) ( #3497 )  
						
						... 
						
						
	
		
			
	 
	
	
		
	
	
		
			
				
	CE Compile Job / ce_job_pre_check (push) Has been cancelled 
				
			 
		
			
				
	CE Compile Job / print_ce_job_pre_check_outputs (push) Has been cancelled 
				
			 
		
			
				
	CE Compile Job / FD-Clone-Linux (push) Has been cancelled 
				
			 
		
			
				
	CE Compile Job / Show Code Archive Output (push) Has been cancelled 
				
			 
		
			
				
	CE Compile Job / BUILD_SM8090 (push) Has been cancelled 
				
			 
		
			
				
	CE Compile Job / BUILD_SM8689 (push) Has been cancelled 
				
			 
		
			
				
	CE Compile Job / CE_UPLOAD (push) Has been cancelled 
				
			 
		
			
				
	Deploy GitHub Pages / deploy (push) Has been cancelled 
				
			 
		
			
				
	Publish Job / publish_pre_check (push) Has been cancelled 
				
			 
		
			
				
	Publish Job / print_publish_pre_check_outputs (push) Has been cancelled 
				
			 
		
			
				
	Publish Job / FD-Clone-Linux (push) Has been cancelled 
				
			 
		
			
				
	Publish Job / Show Code Archive Output (push) Has been cancelled 
				
			 
		
			
				
	Publish Job / BUILD_SM8090 (push) Has been cancelled 
				
			 
		
			
				
	Publish Job / BUILD_SM8689 (push) Has been cancelled 
				
			 
		
			
				
	Publish Job / PADDLE_PYPI_UPLOAD_8090 (push) Has been cancelled 
				
			 
		
			
				
	Publish Job / PADDLE_PYPI_UPLOAD_8689 (push) Has been cancelled 
				
			 
		
			
				
	Publish Job / Run FastDeploy Unit Tests and Coverage (push) Has been cancelled 
				
			 
		
			
				
	Publish Job / Run FastDeploy LogProb Tests (push) Has been cancelled 
				
			 
		
			
				
	Publish Job / Extracted partial CE model tasks to run in CI. (push) Has been cancelled 
				
			 
		
			
				
	Publish Job / Run Base Tests (push) Has been cancelled 
				
			 
		
			
				
	Publish Job / Run Accuracy Tests (push) Has been cancelled 
				
			 
		
		
	 
 
	 
						
						* move config
* fix xpu
* fix
* fix vl
* fix vl
* fix unitest
* fix args
* add unitest
* fix test 
						
						
					 
					
						2025-08-21 19:00:21 +08:00 
						 
				 
			
				
					
						
							
							
								Zero Rains 
							
						 
					 
					
						
						
							
						
						30b3f2dc07 
					 
					
						
						
							
							[BugFix][V1 Loader] fix the bug in creat weight for block_wise_fp8 ( #3486 )  
						
						
	
		
			
	 
	
	
		
	
	
		
			
				
	Deploy GitHub Pages / deploy (push) Has been cancelled 
				
			 
		
		
	 
 
	 
						
						
					 
					
						2025-08-20 05:52:54 -07:00 
						 
				 
			
				
					
						
							
							
								Ryan 
							
						 
					 
					
						
						
							
						
						bcdfc1d6b9 
					 
					
						
						
							
							Add custom op declaration for all_reduce ( #3473 )  
						
						... 
						
						
						
						* add custom op declaration
* roll back try except 
						
						
					 
					
						2025-08-20 20:29:58 +08:00 
						 
				 
			
				
					
						
							
							
								kevin 
							
						 
					 
					
						
						
							
						
						67298cf4c0 
					 
					
						
						
							
							add error traceback info ( #3419 )  
						
						... 
						
						
	
		
			
	 
	
	
		
	
	
		
			
				
	Deploy GitHub Pages / deploy (push) Has been cancelled 
				
			 
		
		
	 
 
	 
						
						* add error traceback info
* update error msg
* update code
---------
Co-authored-by: Jiang-Jia-Jun <163579578+Jiang-Jia-Jun@users.noreply.github.com > 
						
						
					 
					
						2025-08-19 19:32:04 +08:00 
						 
				 
			
				
					
						
							
							
								Zero Rains 
							
						 
					 
					
						
						
							
						
						fef447e350 
					 
					
						
						
							
							[V1 Loader] Support MOE parameters create and load for DeepGemm and marlin backend ( #3447 )  
						
						... 
						
						
						
						* support deepgemm backend
* support marlin backend
* remove print
* fix process_prequanted_weights 
						
						
					 
					
						2025-08-19 14:15:53 +08:00 
						 
				 
			
				
					
						
							
							
								Zero Rains 
							
						 
					 
					
						
						
							
						
						8b12c80f90 
					 
					
						
						
							
							[FixBug] compute early stopping with real batch size ( #3418 )  
						
						... 
						
						
						
						* [FixBug] compute early stopping with real batch size
* update
* fix test_sampler 
						
						
					 
					
						2025-08-18 22:09:21 -07:00 
						 
				 
			
				
					
						
							
							
								AIbin 
							
						 
					 
					
						
						
							
						
						beec24fd89 
					 
					
						
						
							
							【Inference Optimize】DeepSeek-v3 model inference performance optimization ( #3455 )  
						
						... 
						
						
						
						* DSK_OPT_01
* update FA3 
						
						
					 
					
						2025-08-19 10:42:42 +08:00 
						 
				 
			
				
					
						
							
							
								lizexu123 
							
						 
					 
					
						
						
							
						
						32b39620bc 
					 
					
						
						
							
							[Code Simplification] remove cum_offsets ( #3410 )  
						
						
	
		
			
	 
	
	
		
	
	
		
			
				
	Deploy GitHub Pages / deploy (push) Has been cancelled 
				
			 
		
			
				
	Publish Job / publish_pre_check (push) Has been cancelled 
				
			 
		
			
				
	Publish Job / print_publish_pre_check_outputs (push) Has been cancelled 
				
			 
		
			
				
	Publish Job / FD-Clone-Linux (push) Has been cancelled 
				
			 
		
			
				
	Publish Job / Show Code Archive Output (push) Has been cancelled 
				
			 
		
			
				
	Publish Job / BUILD_SM8090 (push) Has been cancelled 
				
			 
		
			
				
	Publish Job / BUILD_SM8689 (push) Has been cancelled 
				
			 
		
			
				
	Publish Job / PADDLE_PYPI_UPLOAD_8090 (push) Has been cancelled 
				
			 
		
			
				
	Publish Job / PADDLE_PYPI_UPLOAD_8689 (push) Has been cancelled 
				
			 
		
			
				
	Publish Job / Run FastDeploy Unit Tests and Coverage (push) Has been cancelled 
				
			 
		
			
				
	Publish Job / Run FastDeploy LogProb Tests (push) Has been cancelled 
				
			 
		
			
				
	Publish Job / Extracted partial CE model tasks to run in CI. (push) Has been cancelled 
				
			 
		
			
				
	Publish Job / Run Base Tests (push) Has been cancelled 
				
			 
		
			
				
	Publish Job / Run Accuracy Tests (push) Has been cancelled 
				
			 
		
		
	 
 
	 
						
						
					 
					
						2025-08-18 20:21:25 +08:00 
						 
				 
			
				
					
						
							
							
								Jundong Liu 
							
						 
					 
					
						
						
							
						
						70ee910cd5 
					 
					
						
						
							
							[Excutor] Change cudagraph hashkey from batch size to num_tokens  ( #3454 )  
						
						
						
						
					 
					
						2025-08-18 16:16:48 +08:00 
						 
				 
			
				
					
						
							
							
								Jundong Liu 
							
						 
					 
					
						
						
							
						
						ea4a3b479c 
					 
					
						
						
							
							[Excutor] Increase buffer size to prevent address corruption; add forward metadata debug tool ( #3404 )  
						
						... 
						
						
						
						* 修复buffer申请不够大,增加打印forwardmetadata的工具
* fix mistake
* Make CPU tensor in CPUPlace
* Add test about forward_meta_str and Add unitest_requirement
---------
Co-authored-by: RAM <gstian5555@outlook.com > 
						
						
					 
					
						2025-08-18 16:14:09 +08:00 
						 
				 
			
				
					
						
							
							
								chen 
							
						 
					 
					
						
						
							
						
						5585cf7aa5 
					 
					
						
						
							
							fix mtp_rej_topp input ( #3450 )  
						
						
						
						
					 
					
						2025-08-18 16:12:42 +08:00 
						 
				 
			
				
					
						
							
							
								chen 
							
						 
					 
					
						
						
							
						
						e88f5552db 
					 
					
						
						
							
							fix cpu __ini__.py ( #3448 )  
						
						
	
		
			
	 
	
	
		
	
	
		
			
				
	Deploy GitHub Pages / deploy (push) Has been cancelled 
				
			 
		
		
	 
 
	 
						
						
					 
					
						2025-08-17 12:38:54 +08:00 
						 
				 
			
				
					
						
							
							
								chen 
							
						 
					 
					
						
						
							
						
						f0f00a6025 
					 
					
						
						
							
							[OPs] Universal optimization and Fix early_stop cuda 700 ( #3375 )  
						
						... 
						
						
	
		
			
	 
	
	
		
	
	
		
			
				
	Deploy GitHub Pages / deploy (push) Has been cancelled 
				
			 
		
		
	 
 
	 
						
						* delete nonzero
* delete setup_ops_base.py
* check if
* check gcp infer_seed.cpu()
* fix repetition_early_stopper_kernel cuda 700 
						
						
					 
					
						2025-08-14 22:40:44 +08:00 
						 
				 
			
				
					
						
							
							
								YuanRisheng 
							
						 
					 
					
						
						
							
						
						09c979f3dd 
					 
					
						
						
							
							[V1 Loader] Support Ernie text(moe and dense) ( #3110 )  
						
						... 
						
						
						
						* new loader support 0.3B
* fix weight
* support parallel load
* support parallel load
* fix slice
* support moe
* delete code
* perfect code
* perfect code 
						
						
					 
					
						2025-08-14 20:25:28 +08:00 
						 
				 
			
				
					
						
							
							
								lzy 
							
						 
					 
					
						
						
							
						
						1e06b9fa6d 
					 
					
						
						
							
							make append_attn supports mask_offset ( #3138 )  
						
						... 
						
						
						
						* make append_attn supports mask_offset
* add unittest 
						
						
					 
					
						2025-08-14 03:40:55 -07:00 
						 
				 
			
				
					
						
							
							
								lizexu123 
							
						 
					 
					
						
						
							
						
						7b596d0877 
					 
					
						
						
							
							[BugFix] fix real_bsz in ep ( #3366 )  
						
						... 
						
						
						
						* Your commit message here
* fix ep
* delete cuda_graph 
						
						
					 
					
						2025-08-14 17:31:19 +08:00 
						 
				 
			
				
					
						
							
							
								Jiang-Jia-Jun 
							
						 
					 
					
						
						
							
						
						666ab65a51 
					 
					
						
						
							
							[Polish Code] Remove useless notes  
						
						
						
						
					 
					
						2025-08-14 14:04:52 +08:00 
						 
				 
			
				
					
						
							
							
								Zero Rains 
							
						 
					 
					
						
						
							
						
						be94bdd0b0 
					 
					
						
						
							
							[Loader V1] modify layername for DeepSeekV3 ( #3336 )  
						
						... 
						
						
						
						Co-authored-by: Yuanle Liu <yuanlehome@163.com >
Co-authored-by: YUNSHEN XIE <1084314248@qq.com > 
						
						
					 
					
						2025-08-13 15:47:06 +08:00 
						 
				 
			
				
					
						
							
							
								EnflameGCU 
							
						 
					 
					
						
						
							
						
						d1a92e3e17 
					 
					
						
						
							
							[GCU] Enable gcu CI ( #3190 )  
						
						... 
						
						
	
		
			
	 
	
	
		
	
	
		
			
				
	Deploy GitHub Pages / deploy (push) Has been cancelled 
				
			 
		
		
	 
 
	 
						
						* [GCU] Update to the latest version
* [GCU] Enable CI 
						
						
					 
					
						2025-08-13 11:48:24 +08:00 
						 
				 
			
				
					
						
							
							
								yzwu 
							
						 
					 
					
						
						
							
						
						ce9180241e 
					 
					
						
						
							
							[Iluvatar GPU] Modify the names of some variables ( #3273 )  
						
						
						
						
					 
					
						2025-08-13 11:38:02 +08:00 
						 
				 
			
				
					
						
							
							
								Kane2011 
							
						 
					 
					
						
						
							
						
						b4fef2cf29 
					 
					
						
						
							
							[MetaxGPU] Support FastDeploy on metax gpu  ( #3241 )  
						
						... 
						
						
						
						* [MetaxGPU] Support FastDeploy on metax gpu
* Update metax_worker.py
1. change worker log;
2. remove custom allreduce, adapt it later;
3. remove cuda graph;
* Update __init__.py
1. remove metax's key work comment
* Update __init__.py
1. remove metax's key word comment;
2. add fused_moe_kernel_paddle import
---------
Co-authored-by: yongqiangma <xing.wo@163.com > 
						
						
					 
					
						2025-08-13 11:11:54 +08:00 
						 
				 
			
				
					
						
							
							
								RichardWooSJTU 
							
						 
					 
					
						
						
							
						
						283da92bfa 
					 
					
						
						
							
							fix ep lm head ( #3244 )  
						
						... 
						
						
						
						Co-authored-by: yuanxiaolan <yuanxiaolan01@baidu.com > 
						
						
					 
					
						2025-08-12 15:38:28 +08:00 
						 
				 
			
				
					
						
							
							
								Jiang-Jia-Jun 
							
						 
					 
					
						
						
							
						
						c56c99837a 
					 
					
						
						
							
							Revert "[BugFix] num_seqs ( #3291 )" ( #3316 )  
						
						... 
						
						
						
						This reverts commit e0aeac58e1 
						
						
					 
					
						2025-08-11 16:16:51 +08:00 
						 
				 
			
				
					
						
							
							
								Yuanle Liu 
							
						 
					 
					
						
						
							
						
						9571c458f0 
					 
					
						
						
							
							enhance eos_tokens ( #3274 )  
						
						... 
						
						
						
						* enhance eos_tokens
* update
* update 
						
						
					 
					
						2025-08-11 14:47:52 +08:00 
						 
				 
			
				
					
						
							
							
								Zero Rains 
							
						 
					 
					
						
						
							
						
						42af0b4b64 
					 
					
						
						
							
							[V1 Loader] Support DeepSeekV3(bf16) ( #3294 )  
						
						... 
						
						
						
						* Support new loader for DeepSeekV3(bf16)
* update paddle version
* remove useless attr 
						
						
					 
					
						2025-08-11 13:39:28 +08:00 
						 
				 
			
				
					
						
							
							
								lizexu123 
							
						 
					 
					
						
						
							
						
						e0aeac58e1 
					 
					
						
						
							
							[BugFix] num_seqs ( #3291 )  
						
						... 
						
						
						
						* fix num_seqs
* merge develop 
						
						
					 
					
						2025-08-11 13:38:55 +08:00 
						 
				 
			
				
					
						
							
							
								gaoziyuan 
							
						 
					 
					
						
						
							
						
						a799d14df1 
					 
					
						
						
							
							[Bugfix] Fix model accuracy in some ops ( #3231 )  
						
						... 
						
						
						
						* fix noaux_tc op
* fix
* update
* fix qk norm
* fix linear for prequant loader
* test
* fix
* fix
* rm some print
* fix noaux_tc op
* test
* Fix the confused enable_early_stop when only set early_stop_config (#3214 )
* fix the confused early_stop_config when only set early_stop_config
* pre-commit
* write a general method
* Add ci case for min token and max token (#3229 )
Co-authored-by: xujing43 <xujing43@baidu.com >
* add some evil cases (#3240 )
* add repitation early stop cases
* add repitation early stop cases
* add bad cases
* add bad cases
* add evil cases
* qwen3_moe (#3084 )
* [Feature] support seed parameter (#3161 )
* support seed
* fix
* add SamplingMetadata seed test
* The next_tokens values are inconsistent!
* add air and rejection seed test
* fix
* add SamplingParams seed test
* fix seed=0
* Default to defualt
* fix
* fix args_utils
* fix review
* fix review
* fix
* fix
* add xpu,gcu,iluvatar support seed
* fix
* 【Fix Bug】 修复 fa3 支持集中式bug (#3235 )
* fix fa3 集中式bug
* 增加qknorm参数
* fix qk norm
* fix
* update
* fix linear for prequant loader
* fix
* fix
* rm some print
* fix
* fix moe init weight&scale
* fix moe init weight&scale
---------
Co-authored-by: bukejiyu <395822456@qq.com >
Co-authored-by: yuanxiaolan <yuanxiaolan01@baidu.com >
Co-authored-by: Zero Rains <linjunlu@zerorains.top >
Co-authored-by: xjkmfa <108254620+xjkmfa@users.noreply.github.com >
Co-authored-by: xujing43 <xujing43@baidu.com >
Co-authored-by: Divano <dddivano@outlook.com >
Co-authored-by: bukejiyu <52310069+bukejiyu@users.noreply.github.com >
Co-authored-by: lizexu123 <39205361+lizexu123@users.noreply.github.com >
Co-authored-by: yangjianfengo1 <125249383+yangjianfengo1@users.noreply.github.com >
Co-authored-by: qingqing01 <dangqingqing@baidu.com > 
						
						
					 
					
						2025-08-08 17:30:37 +08:00 
						 
				 
			
				
					
						
							
							
								Zero Rains 
							
						 
					 
					
						
						
							
						
						ce1f353c70 
					 
					
						
						
							
							Move create_parameters to __init__ in FuseMOE for CultassBackend and TritonBackend ( #3148 )  
						
						... 
						
						
						
						* w4a8 bug
* fix w4a8 bug
* remove code
* modify the triton backend
* fix ep
* fix the bug with tensor_wise_fp8 in triton backend
* fix the RL
* fix bug by merge
* fix the bug in w4a8
* fix the tensor_wise_fp8 bug
* fix RL 
						
						
					 
					
						2025-08-08 15:55:47 +08:00 
						 
				 
			
				
					
						
							
							
								freeliuzc 
							
						 
					 
					
						
						
							
						
						71267840f7 
					 
					
						
						
							
							【Fix】fix mtp bug ( #3139 )  
						
						
						
						
					 
					
						2025-08-08 13:30:12 +08:00 
						 
				 
			
				
					
						
							
							
								bukejiyu 
							
						 
					 
					
						
						
							
						
						b76b17fc1b 
					 
					
						
						
							
							qwen3 0.3B fix ( #3255 )  
						
						
	
		
			
	 
	
	
		
	
	
		
			
				
	Deploy GitHub Pages / deploy (push) Has been cancelled 
				
			 
		
		
	 
 
	 
						
						
					 
					
						2025-08-08 11:35:40 +08:00 
						 
				 
			
				
					
						
							
							
								yzwu 
							
						 
					 
					
						
						
							
						
						fbdd6b0663 
					 
					
						
						
							
							[Iluvatar GPU] Optimze attention and moe performance ( #3234 )  
						
						
						
						
					 
					
						2025-08-08 10:51:24 +08:00 
						 
				 
			
				
					
						
							
							
								bukejiyu 
							
						 
					 
					
						
						
							
						
						37569cca86 
					 
					
						
						
							
							[feat]add fast_weights_iterator ( #3258 )  
						
						... 
						
						
	
		
			
	 
	
	
		
	
	
		
			
				
	Deploy GitHub Pages / deploy (push) Has been cancelled 
				
			 
		
		
	 
 
	 
						
						* add fast_weights_iterator
* update
* update 
						
						
					 
					
						2025-08-07 22:36:46 +08:00 
						 
				 
			
				
					
						
							
							
								bukejiyu 
							
						 
					 
					
						
						
							
						
						9408e667a5 
					 
					
						
						
							
							[bugfix]fix blockwisefp8 and all_reduce ( #3243 )  
						
						... 
						
						
						
						* fix
* update
* fix linear for prequant loader 
						
						
					 
					
						2025-08-06 23:54:33 +08:00 
						 
				 
			
				
					
						
							
							
								yangjianfengo1 
							
						 
					 
					
						
						
							
						
						3a15e0c53e 
					 
					
						
						
							
							【Fix Bug】 修复 fa3 支持集中式bug ( #3235 )  
						
						... 
						
						
	
		
			
	 
	
	
		
	
	
		
			
				
	Deploy GitHub Pages / deploy (push) Has been cancelled 
				
			 
		
		
	 
 
	 
						
						* fix fa3 集中式bug
* 增加qknorm参数 
						
						
					 
					
						2025-08-06 16:24:27 +08:00 
						 
				 
			
				
					
						
							
							
								lizexu123 
							
						 
					 
					
						
						
							
						
						afff4d37ea 
					 
					
						
						
							
							[Feature] support seed parameter ( #3161 )  
						
						... 
						
						
						
						* support seed
* fix
* add SamplingMetadata seed test
* The next_tokens values are inconsistent!
* add air and rejection seed test
* fix
* add SamplingParams seed test
* fix seed=0
* Default to defualt
* fix
* fix args_utils
* fix review
* fix review
* fix
* fix
* add xpu,gcu,iluvatar support seed
* fix 
						
						
					 
					
						2025-08-06 15:20:47 +08:00 
						 
				 
			
				
					
						
							
							
								bukejiyu 
							
						 
					 
					
						
						
							
						
						20839abccf 
					 
					
						
						
							
							qwen3_moe ( #3084 )  
						
						
						
						
					 
					
						2025-08-06 14:45:27 +08:00 
						 
				 
			
				
					
						
							
							
								Yuan Xiaolan 
							
						 
					 
					
						
						
							
						
						7ce00e597c 
					 
					
						
						
							
							support qk norm ( #3145 )  
						
						
						
						
					 
					
						2025-08-05 16:46:14 +08:00 
						 
				 
			
				
					
						
							
							
								RAM 
							
						 
					 
					
						
						
							
						
						4a10e29804 
					 
					
						
						
							
							fix mla attention backend ( #3176 )  
						
						
						
						
					 
					
						2025-08-05 16:43:15 +08:00 
						 
				 
			
				
					
						
							
							
								Yuan Xiaolan 
							
						 
					 
					
						
						
							
						
						af543b7f0f 
					 
					
						
						
							
							revise get_moe_scores ( #3164 )  
						
						
						
						
					 
					
						2025-08-05 16:43:07 +08:00 
						 
				 
			
				
					
						
							
							
								RichardWooSJTU 
							
						 
					 
					
						
						
							
						
						1e9a8e8cef 
					 
					
						
						
							
							fix lm head bias ( #3185 )  
						
						... 
						
						
						
						Co-authored-by: yuanxiaolan <yuanxiaolan01@baidu.com > 
						
						
					 
					
						2025-08-05 15:40:24 +08:00 
						 
				 
			
				
					
						
							
							
								RichardWooSJTU 
							
						 
					 
					
						
						
							
						
						f5c64a074c 
					 
					
						
						
							
							[EP] Refactor DeepEP Engine Organization for Mixed Mode & Buffer Management Optimization  ( #3182 )  
						
						... 
						
						
						
						* Add support for mixed-ep across multi nodes
* code refine
---------
Co-authored-by: yuanxiaolan <yuanxiaolan01@baidu.com > 
						
						
					 
					
						2025-08-05 15:40:11 +08:00 
						 
				 
			
				
					
						
							
							
								lizhenyun01 
							
						 
					 
					
						
						
							
						
						fe540f6caa 
					 
					
						
						
							
							[plugin] Custom model_runner/model support ( #3186 )  
						
						... 
						
						
						
						* support custom model&&model_runner
* fix merge
* add test && update doc
* fix codestyle
* fix unittest
* load model in rl 
						
						
					 
					
						2025-08-04 18:52:39 -07:00 
						 
				 
			
				
					
						
							
							
								Yuan Xiaolan 
							
						 
					 
					
						
						
							
						
						1f8289e106 
					 
					
						
						
							
							fix expertwise_scale ( #3181 )  
						
						
						
						
					 
					
						2025-08-04 20:06:15 +08:00 
						 
				 
			
				
					
						
							
							
								gaoziyuan 
							
						 
					 
					
						
						
							
						
						4021d66ea5 
					 
					
						
						
							
							【Feature】add fd plugins && rm model_classes ( #3123 )  
						
						... 
						
						
						
						* add fd plugins && rm model_classed
* fix reviews
* add docs
* fix
* fix unitest ci 
						
						
					 
					
						2025-08-03 19:53:20 -07:00 
						 
				 
			
				
					
						
							
							
								bukejiyu 
							
						 
					 
					
						
						
							
						
						1582814905 
					 
					
						
						
							
							fix load_pre_sharded_checkpoint ( #3152 )  
						
						... 
						
						
						
						Co-authored-by: Jiang-Jia-Jun <163579578+Jiang-Jia-Jun@users.noreply.github.com > 
						
						
					 
					
						2025-08-04 10:44:20 +08:00 
						 
				 
			
				
					
						
							
							
								ApplEOFDiscord 
							
						 
					 
					
						
						
							
						
						b71cbb466d 
					 
					
						
						
							
							[Feature] remove dependency on enable_mm and refine multimodal's code ( #3014 )  
						
						... 
						
						
						
						* remove dependency on enable_mm
* fix codestyle check error
* fix codestyle check error
* update docs
* resolve conflicts on model config
* fix unit test error
* fix code style check error
---------
Co-authored-by: shige <1021937542@qq.com >
Co-authored-by: Jiang-Jia-Jun <163579578+Jiang-Jia-Jun@users.noreply.github.com > 
						
						
					 
					
						2025-08-01 20:01:18 +08:00 
						 
				 
			
				
					
						
							
							
								yangjianfengo1 
							
						 
					 
					
						
						
							
						
						64d7a3194d 
					 
					
						
						
							
							集中式支持fa3 ( #3112 )  
						
						
						
						
					 
					
						2025-08-01 18:03:36 +08:00 
						 
				 
			
				
					
						
							
							
								Ryan 
							
						 
					 
					
						
						
							
						
						94264bbf60 
					 
					
						
						
							
							[Code Simplification] Refactor Post-processing in VL Model Forward Method ( #2937 )  
						
						... 
						
						
						
						* rm sth useless
* refactor model forward
* mv bool index to kernel 
						
						
					 
					
						2025-08-01 17:28:07 +08:00 
						 
				 
			
				
					
						
							
							
								chen 
							
						 
					 
					
						
						
							
						
						a2f5cc54f8 
					 
					
						
						
							
							moe preprocess op support 160 experts and fused_moe triton kernel name add K ( #3121 )  
						
						
						
						
					 
					
						2025-08-01 10:46:20 +08:00 
						 
				 
			
				
					
						
							
							
								Yuan Xiaolan 
							
						 
					 
					
						
						
							
						
						5f56d289a7 
					 
					
						
						
							
							fix is_permuted ( #3098 )  
						
						... 
						
						
						
						Co-authored-by: Jiang-Jia-Jun <163579578+Jiang-Jia-Jun@users.noreply.github.com > 
						
						
					 
					
						2025-07-31 19:58:05 +08:00