This website requires JavaScript.
Explore
Help
Sign In
apps
/
FastDeploy
Watch
1
Star
0
Fork
0
You've already forked FastDeploy
mirror of
https://github.com/PaddlePaddle/FastDeploy.git
synced
2025-12-24 13:28:13 +08:00
Code
Issues
Actions
2
Packages
Projects
Releases
Wiki
Activity
Files
9d6a42b334c8fef9d446fa5848d1e437c830a1c5
FastDeploy
/
fastdeploy
/
model_executor
History
AIbin
fd91da7b41
【Inference Optimize】Support wint2 triton kernel about triton_utils_v2 (
#2842
)
...
* update supported_models doc
2025-07-15 14:35:40 +08:00
..
graph_optimization
…
guided_decoding
…
layers
【Inference Optimize】Support wint2 triton kernel about triton_utils_v2 (
#2842
)
2025-07-15 14:35:40 +08:00
models
Simplify the Config code (
#2770
)
2025-07-14 19:50:05 +08:00
ops
【Inference Optimize】Support wint2 triton kernel about triton_utils_v2 (
#2842
)
2025-07-15 14:35:40 +08:00
__init__.py
…
forward_meta.py
[Executor] Move forward_meta.py to fastdeploy/model_executor (
#2774
)
2025-07-10 20:36:51 +08:00
load_weight_utils.py
Simplify the Config code (
#2770
)
2025-07-14 19:50:05 +08:00
model_loader.py
[vl]remove duplicated load logic (
#2744
)
2025-07-13 07:36:26 +08:00
pre_and_post_process.py
[MTP] optimize mtp infer speed (
#2840
)
2025-07-14 19:50:22 +08:00