This website requires JavaScript.
Explore
Help
Sign In
apps
/
FastDeploy
Watch
1
Star
0
Fork
0
You've already forked FastDeploy
mirror of
https://github.com/PaddlePaddle/FastDeploy.git
synced
2025-10-06 00:57:33 +08:00
Code
Issues
Actions
2
Packages
Projects
Releases
Wiki
Activity
Files
7b596d08776eae9be9d8f40e8e8b6871d49e1593
FastDeploy
/
custom_ops
/
iluvatar_ops
History
yzwu
fbdd6b0663
[Iluvatar GPU] Optimze attention and moe performance (
#3234
)
2025-08-08 10:51:24 +08:00
..
runtime
[Iluvatar GPU] Optimze attention and moe performance (
#3234
)
2025-08-08 10:51:24 +08:00
fused_moe_helper.h
Adapt for iluvatar gpu (
#2684
)
2025-07-07 16:53:14 +08:00
fused_moe_imp_op.h
Adapt for iluvatar gpu (
#2684
)
2025-07-07 16:53:14 +08:00
fused_moe_op.h
Adapt for iluvatar gpu (
#2684
)
2025-07-07 16:53:14 +08:00
moe_dispatch.cu
Adapt for iluvatar gpu (
#2684
)
2025-07-07 16:53:14 +08:00
moe_reduce.cu
refactor rl get_name_mappings_to_training (
#2847
)
2025-07-15 07:31:42 -07:00
paged_attn.cu
[Iluvatar GPU] Optimze attention and moe performance (
#3234
)
2025-08-08 10:51:24 +08:00
w8a16_group_gemm.cu
[Iluvatar GPU] Optimze attention and moe performance (
#3234
)
2025-08-08 10:51:24 +08:00