YuanRisheng
|
03b3d6175d
|
fix mtp (#4105)
|
2025-09-15 20:26:07 +08:00 |
|
YuanRisheng
|
808b548761
|
support tmp (#3675)
|
2025-08-28 19:42:32 +08:00 |
|
chen
|
ce9c0917c5
|
[Precision] Support lm_head layer running in float32 (#3597)
CE Compile Job / ce_job_pre_check (push) Has been cancelled
CE Compile Job / print_ce_job_pre_check_outputs (push) Has been cancelled
CE Compile Job / FD-Clone-Linux (push) Has been cancelled
CE Compile Job / Show Code Archive Output (push) Has been cancelled
CE Compile Job / BUILD_SM8090 (push) Has been cancelled
CE Compile Job / BUILD_SM8689 (push) Has been cancelled
CE Compile Job / CE_UPLOAD (push) Has been cancelled
Deploy GitHub Pages / deploy (push) Has been cancelled
* support lm_head fp32 bf16 fp16
* support lm_head fp32 bf16 fp16
* add doc and check code
* lm_head_fp32 specify lm_head as fp32
* code check
* check doc
|
2025-08-27 11:34:53 +08:00 |
|
freeliuzc
|
52eda7fdb3
|
[Feature][MTP]support new speculative decoding method named hybrid mtp with ngram (#3610)
|
2025-08-26 14:29:22 +08:00 |
|
gaoziyuan
|
4021d66ea5
|
【Feature】add fd plugins && rm model_classes (#3123)
* add fd plugins && rm model_classed
* fix reviews
* add docs
* fix
* fix unitest ci
|
2025-08-03 19:53:20 -07:00 |
|
Zero Rains
|
25698d56d1
|
polish code with new pre-commit rule (#2923)
|
2025-07-19 23:19:27 +08:00 |
|
Yuanle Liu
|
61b3997b85
|
refactor rl get_name_mappings_to_training (#2847)
Deploy GitHub Pages / deploy (push) Has been cancelled
* refactor rl get_name_mappings_to_training
* fix tp>1
* change variable name(ffn1->up_gate_proj/ffn2->down_proj)
* change variable name(linear_weight->weight/linear_bias->bias)
* add rl names mapping for vl
* fix ernie 0.3B error
* fix develop code
* fix
|
2025-07-15 07:31:42 -07:00 |
|
YuanRisheng
|
4c7b8bc458
|
Simplify the Config code (#2770)
* simplify the code
* fix vl
* delete config
* fix
* perfect code
* fix ci
* fix xpu
* fix xpu
* fix server
* resolve conflict
* fix mtp
* resolve conflict
* fix xpu
* fix xpu
* fix vl
* fix log
* fix qwen moe
* fix qwen moe
* fix qwen moe
|
2025-07-14 19:50:05 +08:00 |
|
littledgg
|
59071268b6
|
[Executor] Move forward_meta.py to fastdeploy/model_executor (#2774)
* Use PEP 563 in attention.py and fix conflict
* merge commit
* Change what was left out last time
|
2025-07-10 20:36:51 +08:00 |
|
lizexu123
|
8c660a0dfb
|
[BugFix] fix RMSNorm rms_norm_esp (#2797)
* fix rms
* add vl
* fix
* add vl
* fix
* fix
|
2025-07-10 20:02:24 +08:00 |
|
GoldPancake
|
e7fa57ebae
|
Extract eh_proj Layer from ParallelLMHead for MTP to Avoid Weight Transposition Issue (#2707)
Deploy GitHub Pages / deploy (push) Has been cancelled
* fix mtp eh_proj layer
* fix mtp update_cfg function
* fix stringdoc
* simplify class name
|
2025-07-04 14:15:04 +08:00 |
|
Jiang-Jia-Jun
|
05c670e593
|
[Sync] Update to latest code (#2679)
* [Sync] Update to latest code
* Add new code files
* Add new code files
* update code
* Try to fix build.sh
* Try to fix build.sh
* Update code
* Update requirements.txt
* Update code
---------
Co-authored-by: Jiang-Jia-Jun <jiangjiajun@baidu.com>
|
2025-07-03 15:43:53 +08:00 |
|
Jiang-Jia-Jun
|
92c2cfa2e7
|
Sync v2.0 version of code to github repo
|
2025-06-29 23:29:37 +00:00 |
|