This website requires JavaScript.
Explore
Help
Sign In
apps
/
FastDeploy
Watch
1
Star
0
Fork
0
You've already forked FastDeploy
mirror of
https://github.com/PaddlePaddle/FastDeploy.git
synced
2025-10-05 08:37:06 +08:00
Code
Issues
Actions
2
Packages
Projects
Releases
Wiki
Activity
Files
b4fef2cf2967356c7a4cfaa2ba7411b87a96f6e6
FastDeploy
/
custom_ops
/
gpu_ops
/
sample_kernels
History
yzwu
fbdd6b0663
[Iluvatar GPU] Optimze attention and moe performance (
#3234
)
2025-08-08 10:51:24 +08:00
..
air_top_p_sampling.cu
Adapt for iluvatar gpu (
#2684
)
2025-07-07 16:53:14 +08:00
min_p_sampling_from_probs.cu
[Feature] support min_p_sampling (
#2872
)
2025-07-20 23:17:59 -07:00
rejection_top_p_sampling.cu
[Iluvatar GPU] Optimze attention and moe performance (
#3234
)
2025-08-08 10:51:24 +08:00
sampling.cuh
[Iluvatar GPU] Optimze attention and moe performance (
#3234
)
2025-08-08 10:51:24 +08:00
top_k_renorm_probs.cu
[Feature] support top_k_top_p sampling (
#2753
)
2025-07-09 20:58:58 -07:00
utils.cuh
[Iluvatar GPU] Optimze attention and moe performance (
#3234
)
2025-08-08 10:51:24 +08:00