forked from jiuyuan/InfiniTensor
![]() * Add: interface for membound TVM kernel and test * add getAnsorCode * add evaluation, but link failed * add evaluation of kernel, but link failed * Fix: link libcuda and nvrtc * add print * Add: const for source of copy * compile and evaluate the kernel * add compute * fix gen_ansor_op.py * fix membound_TVM * format and fix CMakeLists.txt * fix memory leak Co-authored-by: Liyan Zheng <liyan-zheng@outlook.com> Co-authored-by: huangshuhong <huangsh19@mails.tsinghua.edu.cn> |
||
---|---|---|
.. | ||
cuda_common.h | ||
cuda_element_wise.h | ||
cuda_kernel_wihtout_config.h | ||
cuda_runtime.h | ||
cuda_unary.h | ||
cuda_utility.h | ||
gbmm_g2bmm.cuh | ||
gbmm_g2bmm.h |