* [FlyCV] Bump up FlyCV -> official release 1.0.0
* add valid_xpu for detection
* add paddledetection model support for xpu
* support all detection model in c++ and python
* fix code
* add python stable_diffusion support
Co-authored-by: DefTruth <qiustudent_r@163.com>
Co-authored-by: DefTruth <31974251+DefTruth@users.noreply.github.com>
* [Backend] fix lite backend save model error
* [Backend] fixed typos
* [FlyCV] optimize the integration of FlyCV
* [cmake] close some tests options
* [cmake] close some test option
* [FlyCV] remove un-need warnings
* [FlyCV] remove un-need GetMat method
* [FlyCV] optimize FlyCV codes
* [cmake] remove un-need cmake function in examples/CMakelists
* [cmake] support gflags for Android
* yolov5 use external stream
* yolov5lite/v6/v7/v7e2etrt: optimize output tensor and cuda stream
* avoid reallocating output tensors
* add input output tensors to FastDeployModel
* add cuda.cmake
* rename to reused_input/output_tensors
* eliminate cmake cuda arch error
* use swap to release input and output tensors
Co-authored-by: Jason <jiangjiajun@baidu.com>
* add yolo cuda preprocessing
* cmake build cuda src
* yolov5 support cuda preprocessing
* yolov5 cuda preprocessing configurable
* yolov5 update get mat data api
* yolov5 check cuda preprocess args
* refactor cuda function name
* yolo cuda preprocess padding value configurable
* yolov5 release cuda memory
* cuda preprocess pybind api update
* move use_cuda_preprocessing option to yolov5 model
* yolov5lite cuda preprocessing
* yolov6 cuda preprocessing
* yolov7 cuda preprocessing
* yolov7_e2e cuda preprocessing
* remove cuda preprocessing in runtime option
* refine log and cmake variable name
* fix model runtime ptr type
Co-authored-by: Jason <jiangjiajun@baidu.com>