[Sync] Update to latest code (#2679)

* [Sync] Update to latest code

* Add new code files

* Add new code files

* update code

* Try to fix build.sh

* Try to fix build.sh

* Update code

* Update requirements.txt

* Update code

---------

Co-authored-by: Jiang-Jia-Jun <jiangjiajun@baidu.com>
This commit is contained in:
Jiang-Jia-Jun
2025-07-03 15:43:53 +08:00
committed by GitHub
parent d222248d00
commit 05c670e593
95 changed files with 9916 additions and 1312 deletions

View File

@@ -111,6 +111,8 @@ class Attention(nn.Layer):
k: paddle.Tensor = None,
v: paddle.Tensor = None,
qkv: paddle.Tensor = None,
compressed_kv: paddle.Tensor = None,
k_pe: paddle.Tensor = None,
forward_meta: ForwardMeta = None,
) -> paddle.Tensor:
"""
@@ -120,12 +122,16 @@ class Attention(nn.Layer):
k: the key tensor
v: the value tensor
forward_meta: the forward meta data
compressed_kv: optional compressed key-value cache (for MLA)
k_pe: optional key positional encoding (for MLA)
"""
return forward_meta.attn_backend.forward(
q,
k,
v,
qkv,
compressed_kv,
k_pe,
self,
forward_meta,
)