FastDeploy

mirror of https://github.com/PaddlePaddle/FastDeploy.git synced 2025-10-08 01:50:27 +08:00

Files

SuperNova 805f29a06c [Feature] refactor metax_gpu attention and moe and remove some useless code (#3688 )

Co-authored-by: yongqiangma <xing.wo@163.com>

2025-09-12 14:40:25 +08:00

__init__.py

2025-08-13 11:11:54 +08:00

flash_attention_interface.py

2025-08-13 11:11:54 +08:00

flash_attn_backend.py

2025-09-12 14:40:25 +08:00