This website requires JavaScript.
Explore
Help
Sign In
apps
/
FastDeploy
Watch
1
Star
0
Fork
0
You've already forked FastDeploy
mirror of
https://github.com/PaddlePaddle/FastDeploy.git
synced
2025-10-05 16:48:03 +08:00
Code
Issues
Actions
2
Packages
Projects
Releases
Wiki
Activity
Files
a7392a0ff944a1f40a26023c53c80b10f263421f
FastDeploy
/
fastdeploy
/
spec_decode
History
AIbin
a7392a0ff9
【Inference Optimize】DeepSeek-V3-model MLA Optimize (
#3886
)
...
* support MLA chunk_size auto search & cuda_graph
2025-09-11 10:46:09 +08:00
..
__init__.py
Sync v2.0 version of code to github repo
2025-06-29 23:29:37 +00:00
base.py
fix typos (
#3684
)
2025-09-01 17:50:17 +08:00
mtp.py
【Inference Optimize】DeepSeek-V3-model MLA Optimize (
#3886
)
2025-09-11 10:46:09 +08:00
ngram.py
polish code with new pre-commit rule (
#2923
)
2025-07-19 23:19:27 +08:00