xiaoxiaohehe001
|
a42fc3f40b
|
[Feature] Support 45tVL EP FP8 Infer. (#2909)
* support_mm_ep_fp8
* support_mm_ep
|
2025-07-18 17:57:15 +08:00 |
|
gaoziyuan
|
6efad14b95
|
support vl ori_vacab_size (#2900)
|
2025-07-18 16:26:14 +08:00 |
|
Yuanle Liu
|
63d6e7ce06
|
fix and refine vl (#2866)
* refine vl config
* delete attn_sep
* fix vl accuracy
|
2025-07-16 05:59:28 -07:00 |
|
Yuanle Liu
|
dda4a9f848
|
rl update (#2861)
|
2025-07-16 00:33:10 -07:00 |
|
Yuanle Liu
|
61b3997b85
|
refactor rl get_name_mappings_to_training (#2847)
Deploy GitHub Pages / deploy (push) Has been cancelled
* refactor rl get_name_mappings_to_training
* fix tp>1
* change variable name(ffn1->up_gate_proj/ffn2->down_proj)
* change variable name(linear_weight->weight/linear_bias->bias)
* add rl names mapping for vl
* fix ernie 0.3B error
* fix develop code
* fix
|
2025-07-15 07:31:42 -07:00 |
|
YuanRisheng
|
4c7b8bc458
|
Simplify the Config code (#2770)
* simplify the code
* fix vl
* delete config
* fix
* perfect code
* fix ci
* fix xpu
* fix xpu
* fix server
* resolve conflict
* fix mtp
* resolve conflict
* fix xpu
* fix xpu
* fix vl
* fix log
* fix qwen moe
* fix qwen moe
* fix qwen moe
|
2025-07-14 19:50:05 +08:00 |
|
bukejiyu
|
bad53c6b6e
|
[vl]remove duplicated load logic (#2744)
Deploy GitHub Pages / deploy (push) Has been cancelled
|
2025-07-13 07:36:26 +08:00 |
|
littledgg
|
59071268b6
|
[Executor] Move forward_meta.py to fastdeploy/model_executor (#2774)
* Use PEP 563 in attention.py and fix conflict
* merge commit
* Change what was left out last time
|
2025-07-10 20:36:51 +08:00 |
|
lizexu123
|
8c660a0dfb
|
[BugFix] fix RMSNorm rms_norm_esp (#2797)
* fix rms
* add vl
* fix
* add vl
* fix
* fix
|
2025-07-10 20:02:24 +08:00 |
|
Ryan
|
b0f525955c
|
[SOT] Remove breakgraph in post processing && fix datatype (#2780)
|
2025-07-10 11:26:00 +08:00 |
|
lifulll
|
1f28bdf994
|
dcu adapter ernie45t (#2756)
Co-authored-by: lifu <lifu@sugon.com>
Co-authored-by: yongqiangma <xing.wo@163.com>
|
2025-07-09 18:56:27 +08:00 |
|
Ryan
|
c4718fd693
|
Enable SOT D2St in Multimodal Model (#2735)
|
2025-07-09 12:26:18 +08:00 |
|
Ryan
|
f72c4de539
|
[SOT] Make custom_op dy&st unified (#2733)
Deploy GitHub Pages / deploy (push) Has been cancelled
* make_custom_op dy&st unified
* add instance judgement
|
2025-07-08 19:21:44 +08:00 |
|
Ryan
|
fefbd65cf8
|
[SOT] Remove BreakGraph with paddle.maximum (#2731)
* rm if with clip
* clip -> maximum
* int64 -> int32
|
2025-07-08 11:44:25 +08:00 |
|
Jiang-Jia-Jun
|
05c670e593
|
[Sync] Update to latest code (#2679)
* [Sync] Update to latest code
* Add new code files
* Add new code files
* update code
* Try to fix build.sh
* Try to fix build.sh
* Update code
* Update requirements.txt
* Update code
---------
Co-authored-by: Jiang-Jia-Jun <jiangjiajun@baidu.com>
|
2025-07-03 15:43:53 +08:00 |
|
Jiang-Jia-Jun
|
92c2cfa2e7
|
Sync v2.0 version of code to github repo
|
2025-06-29 23:29:37 +00:00 |
|