This website requires JavaScript.
Explore
Help
Sign In
apps
/
FastDeploy
Watch
1
Star
0
Fork
0
You've already forked FastDeploy
mirror of
https://github.com/PaddlePaddle/FastDeploy.git
synced
2025-12-24 13:28:13 +08:00
Code
Issues
Actions
2
Packages
Projects
Releases
Wiki
Activity
Files
65425bf8583cfa4a7c2a94d5132c3fa35792d09b
FastDeploy
/
docs
/
zh
History
周周周
17b414c2df
MoE Default use triton's blockwise fp8 in TP Case (
#3678
)
2025-08-29 11:07:30 +08:00
..
best_practices
MoE Default use triton's blockwise fp8 in TP Case (
#3678
)
2025-08-29 11:07:30 +08:00
features
[Feature] bad words support v1 scheduler and specifiy token ids (
#3608
)
2025-08-25 20:14:51 -07:00
get_started
…
online_serving
[Feature] bad words support v1 scheduler and specifiy token ids (
#3608
)
2025-08-25 20:14:51 -07:00
quantization
…
usage
MoE Default use triton's blockwise fp8 in TP Case (
#3678
)
2025-08-29 11:07:30 +08:00
benchmark.md
…
index.md
…
offline_inference.md
rename ernie_xxx to ernie4_5_xxx (
#3621
)
2025-08-26 19:29:27 +08:00
parameters.md
[Precision] Support lm_head layer running in float32 (
#3597
)
2025-08-27 11:34:53 +08:00
supported_models.md
…