* Enhance run_ci_xpu.sh with caching and prefill options
* Update model path and configuration in run_ci_xpu.sh
* Add '北朝' keyword to assertion in run_45vl.py
* Enhance process termination logic in run_ci_xpu.sh
* Set timeout for CI_XPU job to 60 minutes
* Remove extra newline in stop_processes function
* Update paddlepaddle-xpu installation command
Comment out the previous paddlepaddle-xpu installation command and replace it with a specific version installation due to EP parallel error.
* Update PaddlePaddle installation command
* Remove max_tokens from model response configuration
Removed max_tokens parameter from the model response call.
* add mtp case
* Refactor test_mtp.py for clarity and efficiency
Removed duplicate import of json and simplified spec_config formatting.
---------
Co-authored-by: root <root@gajl-bbc-onlinec-com-1511972.gajl.baidu.com>
* add xpu ci
* add case
* add case
* fix ci bug
* Update Docker image tag to 'latest' in CI workflow
* Fix set -e usage in run_xpu_ci_pytest.sh
* add pd case
* add case
* Configure pip to use Tsinghua mirror for dependencies
Set the global pip index URL to Tsinghua mirror.
* fix ci bug
* fix bug
* fix bug
---------
Co-authored-by: suijiaxin <suijiaxin@Suis-MacBook-Pro.local>
Co-authored-by: root <root@gajl-bbc-onlinec-com-1511964.gajl.baidu.com>
Co-authored-by: root <root@gajl-bbc-onlinec-com-1511972.gajl.baidu.com>