This website requires JavaScript.
Explore
Help
Sign In
apps
/
FastDeploy
Watch
1
Star
0
Fork
0
You've already forked FastDeploy
mirror of
https://github.com/PaddlePaddle/FastDeploy.git
synced
2025-12-24 13:28:13 +08:00
Code
Issues
Actions
2
Packages
Projects
Releases
Wiki
Activity
Files
4408dc7f67645f786e15acef0251412abb0a3e04
FastDeploy
/
fastdeploy
/
model_executor
/
models
History
Ayakouji
987609c894
[BugFix] Fix
image_feature
0-Size causing insert failed (
#4042
)
...
* update * fix image_feature
2025-09-12 19:13:08 +08:00
..
ernie4_5_vl
[BugFix] Fix
image_feature
0-Size causing insert failed (
#4042
)
2025-09-12 19:13:08 +08:00
qwen2_5_vl
…
__init__.py
…
deepseek_v3.py
【Inference Optimize】Update MergedReplicatedLinear for DSK qkv_a_proj_with_mqa. (
#3673
)
2025-09-04 21:16:05 -07:00
ernie4_5_moe.py
[V1 Loader] Ernie kv cache quant support v1 loader (
#3899
)
2025-09-09 05:25:08 -07:00
ernie4_5_mtp.py
…
glm4_moe.py
[Feature] GLM-45-AIR Support Mix Quantization(Dense wfp8afp8 and wint8 triton_moe_backend) (
#4051
)
2025-09-11 20:08:09 +08:00
model_base.py
…
qwen2.py
rename fused_get_rope.cu (
#3752
)
2025-09-03 10:54:34 +08:00
qwen3.py
rename fused_get_rope.cu (
#3752
)
2025-09-03 10:54:34 +08:00
qwen3moe.py
rename fused_get_rope.cu (
#3752
)
2025-09-03 10:54:34 +08:00
tp_utils.py
…
utils.py
…