InfiniTensor/include/core
wendy12022 9032cbb973
Add: reshape/flatten/identity OP and cuda kernel (#34)
* ADD:reshape/flatten/identity operators and cuda kernel.

fix: use cudaMemcpyAsync

clang format.

ADD flatten/identity operator.

add test for reshape.

ADD: reshape operator and cuda kernel.

* Fix: seperate CUDA tests & remove old header

Co-authored-by: Liyan Zheng <liyan-zheng@outlook.com>
2022-09-21 14:04:30 +08:00
..
blob.h Add CUDA runtime (#6) 2022-08-22 15:01:03 +08:00
common.h Add: ConvTransposed (#33) 2022-09-19 15:05:39 +08:00
constants.h Add activation operators and kernels 2022-09-16 13:58:57 +08:00
data_type.h Extended DataType class and Runtime interaction (#9) 2022-08-23 16:55:59 +08:00
graph.h Fix NNet tests after migration (#27) 2022-09-13 15:17:22 +08:00
hash.h Tensor hash and inferShape (#4) 2022-08-15 15:08:56 +08:00
kernel.h Fix: PerfRecord in shared pointers (#31) 2022-09-18 20:27:18 +08:00
mutator.h Update: OpAttrs -> OpPerfKey 2022-08-09 14:58:45 +08:00
object.h Add: perf engine 2022-08-07 21:12:17 +08:00
operator.h Add: reshape/flatten/identity OP and cuda kernel (#34) 2022-09-21 14:04:30 +08:00
perf_engine.h Add: ConvTransposed (#33) 2022-09-19 15:05:39 +08:00
ref.h Add: ConvTransposed (#33) 2022-09-19 15:05:39 +08:00
runtime.h Extended DataType class and Runtime interaction (#9) 2022-08-23 16:55:59 +08:00
tensor.h Tensor serialization (#25) 2022-09-13 11:27:41 +08:00
tensor_base.h Simplify tensor transfer between CPU and CUDA (#10) 2022-08-25 11:29:16 +08:00