Commit Graph

6 Commits

Author SHA1 Message Date
Jason
e93bf6e35c [Other] Add FDTensor function Pad (#532)
* Add InferShape func for all the vision processors

* fix infer shape of limit short

* Fix infer shape bug of stride_pad

* revert modify of processor

* add function pad
2022-11-08 21:45:31 +08:00
Jason
3589c0fa94 [Model] Refactor PaddleClas module (#505)
* Refactor the PaddleClas module

* fix bug

* remove debug code

* clean unused code

* support pybind

* Update fd_tensor.h

* Update fd_tensor.cc

* temporary revert python api

* fix ci error

* fix code style problem
2022-11-07 19:33:47 +08:00
Jack Zhou
70f664161f [Functions] Add fd tensor concat (#507)
* Add fd tensor concat

* fix comment
2022-11-07 10:02:42 +08:00
Wang Xinyu
caa369f64a [Backend] TRT cast GPU input from int64 to int32, output from int32 to int64, and Windows support building CUDA files (#426)
* TRT cast int64 to int32

* windows cmake build cuda src

* fix windows cmake error when build cuda src

* add a notice in windows gpu build doc

* cmake add cuda std=11

* TRT cast output from int32 to int64

* nits

* trt get original input output dtype
2022-10-28 13:38:06 +08:00
Jack Zhou
9c150f0bfb Upgrade eigen func (#253)
* Add FDTensor copy and move assignment and constructor

* Upgrade the transpose to receive the output tensor same as input tensor

* Add note

* Add realloc for FDTensor

* Support output equals to input for softmax

* Remove FDTensor::Alloc
2022-09-20 10:58:07 +08:00
Jason
68523be411 Modify file structure to separate python and cpp code (#223)
Modify code structure
2022-09-14 15:44:13 +08:00