| 
							
							
								 littledgg | 59071268b6 | [Executor] Move forward_meta.py to fastdeploy/model_executor (#2774) * Use PEP 563 in attention.py and fix conflict
* merge commit
* Change what was left out last time | 2025-07-10 20:36:51 +08:00 |  | 
			
				
					| 
							
							
								 lifulll | 1f28bdf994 | dcu adapter ernie45t (#2756) Co-authored-by: lifu <lifu@sugon.com>
Co-authored-by: yongqiangma <xing.wo@163.com> | 2025-07-09 18:56:27 +08:00 |  | 
			
				
					| 
							
							
								 yulangz | be21ef5047 | [XPU] Supports BF16 for ERNIE-4.5-21B-A3B and ERNIE-4.5-0.3B (#2765) * fix no quant xpu moe
* change dir of xpu moe weight only | 2025-07-09 15:57:51 +08:00 |  | 
			
				
					| 
							
							
								 EnflameGCU | d0f4d6ba3a | [GCU] Support gcu platform (#2702) baseline: e7fa57ebaeCo-authored-by: yongqiangma <xing.wo@163.com> | 2025-07-08 13:00:52 +08:00 |  | 
			
				
					| 
							
							
								 Jiang-Jia-Jun | 05c670e593 | [Sync] Update to latest code (#2679) * [Sync] Update to latest code
* Add new code files
* Add new code files
* update code
* Try to fix build.sh
* Try to fix build.sh
* Update code
* Update requirements.txt
* Update code
---------
Co-authored-by: Jiang-Jia-Jun <jiangjiajun@baidu.com> | 2025-07-03 15:43:53 +08:00 |  | 
			
				
					| 
							
							
								 Jiang-Jia-Jun | 92c2cfa2e7 | Sync v2.0 version of code to github repo | 2025-06-29 23:29:37 +00:00 |  | 
			
				
					| 
							
							
								 jiangjiajun | 684703fd72 | [LLM] First commit the llm deployment code | 2025-06-09 19:20:15 +08:00 |  |