This website requires JavaScript.
Explore
Help
Sign In
apps
/
FastDeploy
Watch
1
Star
0
Fork
0
You've already forked FastDeploy
mirror of
https://github.com/PaddlePaddle/FastDeploy.git
synced
2025-10-05 16:48:03 +08:00
Code
Issues
Actions
2
Packages
Projects
Releases
Wiki
Activity
Files
7b596d08776eae9be9d8f40e8e8b6871d49e1593
FastDeploy
/
custom_ops
History
Sunny-bot1
2e7831185f
Some checks failed
Deploy GitHub Pages / deploy (push)
Has been cancelled
Details
[Optimize]Add norm_weights feature for topk_gating_softmax (
#3372
)
2025-08-14 15:05:23 +08:00
..
cpu_ops
Sync v2.0 version of code to github repo
2025-06-29 23:29:37 +00:00
gpu_ops
[Optimize]Add norm_weights feature for topk_gating_softmax (
#3372
)
2025-08-14 15:05:23 +08:00
iluvatar_ops
[Iluvatar GPU] Optimze attention and moe performance (
#3234
)
2025-08-08 10:51:24 +08:00
utils
[New Feature] Support W4Afp8 MoE GroupGemm (
#3171
)
2025-08-06 10:34:05 +08:00
xpu_ops
[XPU] Support kvblock centralized management (
#3017
)
2025-07-29 10:40:55 +08:00
0001-DeepGEMM-95e81b3.patch
[feat] support fa3 backend for pd disaggregated (
#2695
)
2025-07-03 22:33:27 +08:00
MANIFEST.in
[LLM] First commit the llm deployment code
2025-06-09 19:20:15 +08:00
setup_ops_base.py
polish code with new pre-commit rule (
#2923
)
2025-07-19 23:19:27 +08:00
setup_ops_cpu.py
polish code with new pre-commit rule (
#2923
)
2025-07-19 23:19:27 +08:00
setup_ops.py
[MetaxGPU] Support FastDeploy on metax gpu (
#3241
)
2025-08-13 11:11:54 +08:00