周周周
|
aa76085d1f
|
[Attention] remove cum_offsets from atten, and use cu_seqlens_q (#2870)
Deploy GitHub Pages / deploy (push) Has been cancelled
[Attention] remove cum_offsets from atten, and use cu_seqlens_q (#2870)
|
2025-07-16 20:10:57 +08:00 |
|
Yuanle Liu
|
61b3997b85
|
refactor rl get_name_mappings_to_training (#2847)
Deploy GitHub Pages / deploy (push) Has been cancelled
* refactor rl get_name_mappings_to_training
* fix tp>1
* change variable name(ffn1->up_gate_proj/ffn2->down_proj)
* change variable name(linear_weight->weight/linear_bias->bias)
* add rl names mapping for vl
* fix ernie 0.3B error
* fix develop code
* fix
|
2025-07-15 07:31:42 -07:00 |
|
freeliuzc
|
7cdd8d290d
|
[MTP] optimize mtp infer speed (#2840)
Deploy GitHub Pages / deploy (push) Has been cancelled
|
2025-07-14 19:50:22 +08:00 |
|
zhink
|
b89180f1cd
|
[Feature] support custom all-reduce (#2758)
* [Feature] support custom all-reduce
* add vllm adapted
|
2025-07-09 16:00:27 +08:00 |
|
RichardWooSJTU
|
fee544e808
|
fix ep prefill (#2762)
|
2025-07-09 14:03:05 +08:00 |
|
ming1753
|
1eb8ea7328
|
[Bug fix] fix complie bug when sm < 89 (#2738)
|
2025-07-08 11:24:52 +08:00 |
|
ming1753
|
ef6649a577
|
[Optimize] Optimize tensorwise fp8 performance (#2729)
Deploy GitHub Pages / deploy (push) Has been cancelled
* [Optimize] Optimize tensorwise fp8 performance
|
2025-07-07 20:06:28 +08:00 |
|
Jiang-Jia-Jun
|
05c670e593
|
[Sync] Update to latest code (#2679)
* [Sync] Update to latest code
* Add new code files
* Add new code files
* update code
* Try to fix build.sh
* Try to fix build.sh
* Update code
* Update requirements.txt
* Update code
---------
Co-authored-by: Jiang-Jia-Jun <jiangjiajun@baidu.com>
|
2025-07-03 15:43:53 +08:00 |
|
Jiang-Jia-Jun
|
92c2cfa2e7
|
Sync v2.0 version of code to github repo
|
2025-06-29 23:29:37 +00:00 |
|