InfiniTensor/test
deathwings602 11d5aa1ccc
Add TVM codegen for MemboundOp (#35)
* Add:  interface for membound TVM kernel and test

* add getAnsorCode

* add evaluation, but link failed

* add evaluation of kernel, but link failed

* Fix: link libcuda and nvrtc

* add print

* Add: const for source of copy

* compile and evaluate the kernel

* add compute

* fix gen_ansor_op.py

* fix membound_TVM

* format and fix CMakeLists.txt

* fix memory leak

Co-authored-by: Liyan Zheng <liyan-zheng@outlook.com>
Co-authored-by: huangshuhong <huangsh19@mails.tsinghua.edu.cn>
2022-09-22 18:06:45 +08:00
..
core Tensor serialization (#25) 2022-09-13 11:27:41 +08:00
kernels Add bangc runtime and element-wise kernels 2022-09-22 16:57:39 +08:00
nnet Add TVM codegen for MemboundOp (#35) 2022-09-22 18:06:45 +08:00
operators Add: reshape/flatten/identity OP and cuda kernel (#34) 2022-09-21 14:04:30 +08:00
script cuDNN conv tuning (#16) 2022-08-29 21:37:07 +08:00