This website requires JavaScript.
Explore
Help
Sign In
apps
/
FastDeploy
Watch
1
Star
0
Fork
0
You've already forked FastDeploy
mirror of
https://github.com/PaddlePaddle/FastDeploy.git
synced
2025-10-30 11:26:39 +08:00
Code
Issues
Actions
6
Packages
Projects
Releases
Wiki
Activity
Files
a7392a0ff944a1f40a26023c53c80b10f263421f
FastDeploy
/
tests
/
layers
History
AIbin
a7392a0ff9
【Inference Optimize】DeepSeek-V3-model MLA Optimize (
#3886
)
...
* support MLA chunk_size auto search & cuda_graph
2025-09-11 10:46:09 +08:00
..
test_append_attention_with_output.py
【Inference Optimize】DeepSeek-V3-model MLA Optimize (
#3886
)
2025-09-11 10:46:09 +08:00
test_append_attention.py
【Inference Optimize】DeepSeek-V3-model MLA Optimize (
#3886
)
2025-09-11 10:46:09 +08:00
test_min_sampling.py
Add stable ci (
#3460
)
2025-08-20 08:57:17 +08:00
test_moba_attention.py
Revert "【FIX】Change the name of sparse attn from moba to plas (
#3845
)" (
#4001
)
2025-09-09 11:08:23 +08:00
test_repetition_early_stopper.py
Add stable ci (
#3460
)
2025-08-20 08:57:17 +08:00
test_sampler.py
Add stable ci (
#3460
)
2025-08-20 08:57:17 +08:00