This website requires JavaScript.
Explore
Help
Sign In
apps
/
FastDeploy
Watch
1
Star
0
Fork
0
You've already forked FastDeploy
mirror of
https://github.com/PaddlePaddle/FastDeploy.git
synced
2025-12-24 13:28:13 +08:00
Code
Issues
Actions
2
Packages
Projects
Releases
Wiki
Activity
Files
561a7ebc0b1fa402f068be029bbb06eae3f52dd3
FastDeploy
/
custom_ops
/
gpu_ops
/
cpp_extensions.cc
AIbin
a7392a0ff9
【Inference Optimize】DeepSeek-V3-model MLA Optimize (
#3886
)
...
* support MLA chunk_size auto search & cuda_graph
2025-09-11 10:46:09 +08:00
56 KiB
Raw
Blame
History
View Raw
Reference in New Issue
View Git Blame
Copy Permalink