InfiniTensor/include/cuda
zhengly123 d39328afce
Fix: PerfRecord in shared pointers (#31)
* Fix: PerfData in a shared pointer

* Add: abstraction for kernels without configuration

Co-authored-by: Liyan Zheng <liyan-zheng@outlook.com>
2022-09-18 20:27:18 +08:00
..
cuda_common.h Add CUDA runtime (#6) 2022-08-22 15:01:03 +08:00
cuda_element_wise.h ADD add/mul/sub/div/pow operators and CPU/CUDA kernels (#26) 2022-09-09 13:43:59 +08:00
cuda_kernel_wihtout_config.h Fix: PerfRecord in shared pointers (#31) 2022-09-18 20:27:18 +08:00
cuda_runtime.h cuDNN conv tuning (#16) 2022-08-29 21:37:07 +08:00
cuda_unary.h Add activation operators and kernels 2022-09-16 13:58:57 +08:00
cuda_utility.h Simplify tensor transfer between CPU and CUDA (#10) 2022-08-25 11:29:16 +08:00