* TRT cast int64 to int32
* windows cmake build cuda src
* fix windows cmake error when build cuda src
* add a notice in windows gpu build doc
* cmake add cuda std=11
* TRT cast output from int32 to int64
* nits
* trt get original input output dtype
* add quantize model support for trt and paddle
* fix bugs
* fix
* update paddle2onnx version
* update version
* add quantize test
Co-authored-by: Jason <jiangjiajun@baidu.com>
* Add custom operator for onnxruntime ans fix paddle backend
* Polish cmake files and runtime apis
* Remove copy libraries
* fix some issue
* fix bug
* fix bug
* Support remove multiclass_nms to enable paddledetection run tensorrt
* Support remove multiclass_nms to enable paddledetection run tensorrt
* Support remove multiclass_nms to enable paddledetection run tensorrt
* Support remove multiclass_nms to enable paddledetection run tensorrt
* add common operator multiclassnms
* fix compile problem
Co-authored-by: root <root@bjyz-sys-gpu-kongming3.bjyz.baidu.com>
* update .gitignore
* Added checking for cmake include dir
* fixed missing trt_backend option bug when init from trt
* remove un-need data layout and add pre-check for dtype
* changed RGB2BRG to BGR2RGB in ppcls model
* Update CMakeLists.txt