mirror of
https://github.com/PaddlePaddle/FastDeploy.git
synced 2025-10-06 09:07:10 +08:00

* yolov5 use external stream * yolov5lite/v6/v7/v7e2etrt: optimize output tensor and cuda stream * avoid reallocating output tensors * add input output tensors to FastDeployModel * add cuda.cmake * rename to reused_input/output_tensors * eliminate cmake cuda arch error * use swap to release input and output tensors Co-authored-by: Jason <jiangjiajun@baidu.com>