This website requires JavaScript.
Explore
Help
Sign In
apps
/
FastDeploy
Watch
1
Star
0
Fork
0
You've already forked FastDeploy
mirror of
https://github.com/PaddlePaddle/FastDeploy.git
synced
2025-10-08 01:50:27 +08:00
Code
Issues
Actions
2
Packages
Projects
Releases
Wiki
Activity
Files
51f68ae593b8264d3b93e395ab8010db0937db84
FastDeploy
/
fastdeploy
/
model_executor
/
models
History
AIbin
beec24fd89
【Inference Optimize】DeepSeek-v3 model inference performance optimization (
#3455
)
...
* DSK_OPT_01 * update FA3
2025-08-19 10:42:42 +08:00
..
ernie4_5_vl
qwen3_moe (
#3084
)
2025-08-06 14:45:27 +08:00
__init__.py
…
deepseek_v3.py
【Inference Optimize】DeepSeek-v3 model inference performance optimization (
#3455
)
2025-08-19 10:42:42 +08:00
ernie4_5_moe.py
[V1 Loader] Support Ernie text(moe and dense) (
#3110
)
2025-08-14 20:25:28 +08:00
ernie4_5_mtp.py
…
model_base.py
[plugin] Custom model_runner/model support (
#3186
)
2025-08-04 18:52:39 -07:00
qwen2.py
…
qwen3.py
qwen3 0.3B fix (
#3255
)
2025-08-08 11:35:40 +08:00
qwen3moe.py
[V1 Loader] Support Ernie text(moe and dense) (
#3110
)
2025-08-14 20:25:28 +08:00
tp_utils.py
…
utils.py
[V1 Loader] Support DeepSeekV3(bf16) (
#3294
)
2025-08-11 13:39:28 +08:00