| 
							
							
								 lzy | 1e06b9fa6d | make append_attn supports mask_offset (#3138) * make append_attn supports mask_offset
* add unittest | 2025-08-14 03:40:55 -07:00 |  | 
			
				
					| 
							
							
								 gaoziyuan | a799d14df1 | [Bugfix] Fix model accuracy in some ops (#3231) * fix noaux_tc op
* fix
* update
* fix qk norm
* fix linear for prequant loader
* test
* fix
* fix
* rm some print
* fix noaux_tc op
* test
* Fix the confused enable_early_stop when only set early_stop_config (#3214)
* fix the confused early_stop_config when only set early_stop_config
* pre-commit
* write a general method
* Add ci case for min token and max token (#3229)
Co-authored-by: xujing43 <xujing43@baidu.com>
* add some evil cases (#3240)
* add repitation early stop cases
* add repitation early stop cases
* add bad cases
* add bad cases
* add evil cases
* qwen3_moe (#3084)
* [Feature] support seed parameter (#3161)
* support seed
* fix
* add SamplingMetadata seed test
* The next_tokens values are inconsistent!
* add air and rejection seed test
* fix
* add SamplingParams seed test
* fix seed=0
* Default to defualt
* fix
* fix args_utils
* fix review
* fix review
* fix
* fix
* add xpu,gcu,iluvatar support seed
* fix
* 【Fix Bug】 修复 fa3 支持集中式bug (#3235)
* fix fa3 集中式bug
* 增加qknorm参数
* fix qk norm
* fix
* update
* fix linear for prequant loader
* fix
* fix
* rm some print
* fix
* fix moe init weight&scale
* fix moe init weight&scale
---------
Co-authored-by: bukejiyu <395822456@qq.com>
Co-authored-by: yuanxiaolan <yuanxiaolan01@baidu.com>
Co-authored-by: Zero Rains <linjunlu@zerorains.top>
Co-authored-by: xjkmfa <108254620+xjkmfa@users.noreply.github.com>
Co-authored-by: xujing43 <xujing43@baidu.com>
Co-authored-by: Divano <dddivano@outlook.com>
Co-authored-by: bukejiyu <52310069+bukejiyu@users.noreply.github.com>
Co-authored-by: lizexu123 <39205361+lizexu123@users.noreply.github.com>
Co-authored-by: yangjianfengo1 <125249383+yangjianfengo1@users.noreply.github.com>
Co-authored-by: qingqing01 <dangqingqing@baidu.com> | 2025-08-08 17:30:37 +08:00 |  | 
			
				
					| 
							
							
								 Yuan Xiaolan | 7ce00e597c | support qk norm (#3145) | 2025-08-05 16:46:14 +08:00 |  | 
			
				
					| 
							
							
								 Jiang-Jia-Jun | 05c670e593 | [Sync] Update to latest code (#2679) * [Sync] Update to latest code
* Add new code files
* Add new code files
* update code
* Try to fix build.sh
* Try to fix build.sh
* Update code
* Update requirements.txt
* Update code
---------
Co-authored-by: Jiang-Jia-Jun <jiangjiajun@baidu.com> | 2025-07-03 15:43:53 +08:00 |  | 
			
				
					| 
							
							
								 Jiang-Jia-Jun | 92c2cfa2e7 | Sync v2.0 version of code to github repo | 2025-06-29 23:29:37 +00:00 |  | 
			
				
					| 
							
							
								 jiangjiajun | 684703fd72 | [LLM] First commit the llm deployment code | 2025-06-09 19:20:15 +08:00 |  |