This website requires JavaScript.
Explore
Help
Sign In
apps
/
FastDeploy
Watch
1
Star
0
Fork
0
You've already forked FastDeploy
mirror of
https://github.com/PaddlePaddle/FastDeploy.git
synced
2025-12-24 13:28:13 +08:00
Code
Issues
Actions
2
Packages
Projects
Releases
Wiki
Activity
Files
a52aea073ce9ec1eda445dfadd1525e302865211
FastDeploy
/
custom_ops
History
周周周
a36d60aa18
[FIX BUG] fix bug in TP in permute_x_fp8_kernel (
#5350
)
...
* commit * commit * commit * commit * commit * commit
2025-12-03 05:17:37 -08:00
..
cpu_ops
c++ code format (
#4527
)
2025-10-22 17:59:50 +08:00
gpu_ops
[FIX BUG] fix bug in TP in permute_x_fp8_kernel (
#5350
)
2025-12-03 05:17:37 -08:00
iluvatar_ops
c++ code format (
#4527
)
2025-10-22 17:59:50 +08:00
metax_ops
[Metax] optimize cutlass moe and flash attention backend (
#5128
)
2025-11-20 16:12:35 +08:00
third_party
…
utils
[Quantization] Support w4afp8 MoE dynamic quantization (
#5282
)
2025-12-02 18:56:16 +08:00
xpu_ops
[XPU]add enable_logprob (
#5279
)
2025-12-02 15:32:28 +08:00
0001-DeepGEMM-95e81b3.patch
[OP]Remove extra H2D in DeepGemm (
#5262
)
2025-11-28 14:23:44 +08:00
MANIFEST.in
…
setup_ops_cpu.py
…
setup_ops.py
[Feature] Support noaux for eplb (
#5143
)
2025-11-21 14:10:32 +08:00