This website requires JavaScript.
Explore
Help
Sign In
apps
/
FastDeploy
Watch
1
Star
0
Fork
0
You've already forked FastDeploy
mirror of
https://github.com/PaddlePaddle/FastDeploy.git
synced
2025-10-04 16:22:57 +08:00
Code
Issues
Actions
2
Packages
Projects
Releases
Wiki
Activity
Files
865e856a94e952f75b2f2be11d14cec74dd53eb9
FastDeploy
/
fastdeploy
/
model_executor
History
AIbin
a197dcd729
【Inference Optimize】Support ERNIE-4_5-300B-A47B-2BITS-Paddle model TP2/TP4 Inference (
#2666
)
...
* Support TP2&TP4 Wint * Support TP2&TP4 Wint2 Inference
2025-07-01 18:29:11 +08:00
..
graph_optimization
Sync v2.0 version of code to github repo
2025-06-29 23:29:37 +00:00
guided_decoding
Sync v2.0 version of code to github repo
2025-06-29 23:29:37 +00:00
layers
【Inference Optimize】Support ERNIE-4_5-300B-A47B-2BITS-Paddle model TP2/TP4 Inference (
#2666
)
2025-07-01 18:29:11 +08:00
models
Sync v2.0 version of code to github repo
2025-06-29 23:29:37 +00:00
ops
Sync v2.0 version of code to github repo
2025-06-29 23:29:37 +00:00
__init__.py
[LLM] First commit the llm deployment code
2025-06-09 19:20:15 +08:00
model_loader.py
Sync v2.0 version of code to github repo
2025-06-29 23:29:37 +00:00
pre_and_post_process.py
Sync v2.0 version of code to github repo
2025-06-29 23:29:37 +00:00