InfiniTensor/include/core
Hardy b0c2a08252
Support bang c kernel wanghailu 0927 (#43)
* fix a little bug which found by new verison CMake

* add code for support BangC language kernel , just like Cuda kernel, not
library

* add bangc kernel

* support BangC kernel

* add code for support BangC kernel

* support bangc kernel

* fix some code from reviewer

* fix code of template fumction

* add code for support bangc kernel

* fix bangc format

Co-authored-by: wanghailu <wanghailu@qiyuanlab.com>
Co-authored-by: Haojie Wang <haojie0429@gmail.com>
2022-09-30 11:01:52 +08:00
..
blob.h Support bang c kernel wanghailu 0927 (#43) 2022-09-30 11:01:52 +08:00
common.h Add: ConvTransposed (#33) 2022-09-19 15:05:39 +08:00
constants.h Add activation operators and kernels 2022-09-16 13:58:57 +08:00
data_type.h Json perfrecord (#32) 2022-09-22 15:34:34 +08:00
graph.h Fix NNet tests after migration (#27) 2022-09-13 15:17:22 +08:00
hash.h Tensor hash and inferShape (#4) 2022-08-15 15:08:56 +08:00
kernel.h Json perfrecord (#32) 2022-09-22 15:34:34 +08:00
mutator.h Update: OpAttrs -> OpPerfKey 2022-08-09 14:58:45 +08:00
object.h Add: perf engine 2022-08-07 21:12:17 +08:00
operator.h ADD:concat/split operator and cuda kernels (#29) 2022-09-29 11:01:30 +08:00
perf_engine.h Json perfrecord (#32) 2022-09-22 15:34:34 +08:00
ref.h Add: ConvTransposed (#33) 2022-09-19 15:05:39 +08:00
runtime.h ADD: Gather operator and cuda kernel. (#41) 2022-09-29 14:44:20 +08:00
tensor.h ADD:pad/slice operator and cuda kernel. (#39) 2022-09-29 10:29:24 +08:00
tensor_base.h Simplify tensor transfer between CPU and CUDA (#10) 2022-08-25 11:29:16 +08:00