This website requires JavaScript.
Explore
Help
Sign In
apps
/
FastDeploy
Watch
1
Star
0
Fork
0
You've already forked FastDeploy
mirror of
https://github.com/PaddlePaddle/FastDeploy.git
synced
2025-12-24 13:28:13 +08:00
Code
Issues
Actions
2
Packages
Projects
Releases
Wiki
Activity
Files
442543cd6bd6de539d0849f13071a451ff7585d6
FastDeploy
/
fastdeploy
/
model_executor
/
models
History
YuanRisheng
03b3d6175d
fix mtp (
#4105
)
2025-09-15 20:26:07 +08:00
..
ernie4_5_vl
[v1 loader]qwen Offline fp8 (
#4036
)
2025-09-15 13:44:11 +08:00
qwen2_5_vl
…
__init__.py
…
deepseek_v3.py
【Inference Optimize】Update MergedReplicatedLinear for DSK qkv_a_proj_with_mqa. (
#3673
)
2025-09-04 21:16:05 -07:00
ernie4_5_moe.py
[v1 loader]qwen Offline fp8 (
#4036
)
2025-09-15 13:44:11 +08:00
ernie4_5_mtp.py
fix mtp (
#4105
)
2025-09-15 20:26:07 +08:00
glm4_moe.py
[Feature] GLM-45-AIR Support Mix Quantization(Dense wfp8afp8 and wint8 triton_moe_backend) (
#4051
)
2025-09-11 20:08:09 +08:00
model_base.py
…
qwen2.py
rename fused_get_rope.cu (
#3752
)
2025-09-03 10:54:34 +08:00
qwen3.py
rename fused_get_rope.cu (
#3752
)
2025-09-03 10:54:34 +08:00
qwen3moe.py
rename fused_get_rope.cu (
#3752
)
2025-09-03 10:54:34 +08:00
tp_utils.py
…
utils.py
…