This website requires JavaScript.
Explore
Help
Sign In
apps
/
FastDeploy
Watch
1
Star
0
Fork
0
You've already forked FastDeploy
mirror of
https://github.com/PaddlePaddle/FastDeploy.git
synced
2025-12-24 13:28:13 +08:00
Code
Issues
Actions
2
Packages
Projects
Releases
Wiki
Activity
Files
b87e2c6184b1d918b60a528aecfd54aa877e2403
FastDeploy
/
fastdeploy
/
model_executor
/
graph_optimization
History
Ryan
b87e2c6184
[CUDAGraph]Add support for custom all-reduce operators under SOT mode (
#4386
)
2025-10-16 19:31:19 +08:00
..
__init__.py
[LLM] First commit the llm deployment code
2025-06-09 19:20:15 +08:00
cudagraph_piecewise_backend.py
[CUDAGraph]Add support for custom all-reduce operators under SOT mode (
#4386
)
2025-10-16 19:31:19 +08:00
decorator.py
[BugFix] Fix
image_feature
0-Size causing insert failed (
#4042
)
2025-09-12 19:13:08 +08:00
dynamic_dims_marker.py
[SOT] Mark dynamic dims by type annotations (
#2771
)
2025-07-22 00:23:52 -07:00
graph_optimization_backend.py
[Executor]CUDAGraph support Speculate Decode (
#3769
)
2025-10-09 21:18:29 +08:00
utils.py
[SOT] Add sot warmup (NVIDIA GPU Only) (
#2929
)
2025-07-22 21:36:14 +08:00