FastDeploy/custom_ops/gpu_ops/cpp_extensions.cc at 561a7ebc0b1fa402f068be029bbb06eae3f52dd3

mirror of https://github.com/PaddlePaddle/FastDeploy.git synced 2025-12-24 13:28:13 +08:00

Files

AIbin a7392a0ff9 【Inference Optimize】DeepSeek-V3-model MLA Optimize (#3886 )

* support MLA chunk_size auto search & cuda_graph

2025-09-11 10:46:09 +08:00

View Raw