FastDeploy/fastdeploy/model_executor/layers at cfa0982aae2c86e5114febb23c4c712bc24d06e2 - FastDeploy - 子说镜像小站

apps/FastDeploy

mirror of https://github.com/PaddlePaddle/FastDeploy.git synced 2025-12-24 13:28:13 +08:00

Files

History

liyonghua0910 cfa0982aae [fix] fix ep group all-reduce

2025-09-17 18:05:34 +08:00

..

…

[Feature] refactor metax_gpu attention and moe and remove some useless code (#3688 )

2025-09-12 14:40:25 +08:00

[fix] fix ep group all-reduce

2025-09-17 18:05:34 +08:00

[BugFix]Fix load kv cache quant scale (#4077 )

2025-09-12 17:44:03 +08:00

…

__init__.py

…

activation.py

…

embeddings.py

…

linear.py

…

lm_head.py

…

mtp_linear.py

…

normalization.py

…

rotary_embedding.py

[Feature] refactor metax_gpu attention and moe and remove some useless code (#3688 )

2025-09-12 14:40:25 +08:00

utils.py

…