mirror of
https://github.com/PaddlePaddle/FastDeploy.git
synced 2025-10-21 15:49:31 +08:00

* support chunk_prefill both normal and speculative_decoding(mtp) * optimize pd-disaggregation config * fix bug
* support chunk_prefill both normal and speculative_decoding(mtp) * optimize pd-disaggregation config * fix bug