InfiniTensor/include
constroy Li 48847958d0
impl sqrt on CUDA (#109)
* impl sqrt on CUDA
fix parser of Gather and ReduceMean

* fix test_gather

* fix test_cuda_gather

* impl sqrt cpu and add sqrt to test_cuda_unary

* cuda_unary supports arbitary shapes

* fix SplitOp with dim=-1

* fix SplitOp with dim=-1
2023-08-18 12:17:47 +08:00
..
bang Dev for 202303ddl (#66) 2023-04-18 15:10:33 +08:00
core impl sqrt on CUDA (#109) 2023-08-18 12:17:47 +08:00
cuda impl sqrt on CUDA (#109) 2023-08-18 12:17:47 +08:00
ffi Add TVM codegen for MemboundOp (#35) 2022-09-22 18:06:45 +08:00
intelcpu Cpu backend2 (#77) 2023-04-17 12:15:23 +08:00
nnet Dev for 202303ddl (#66) 2023-04-18 15:10:33 +08:00
operators support mixed dtype (#102) 2023-08-16 21:49:43 +08:00
utils support mixed dtype (#102) 2023-08-16 21:49:43 +08:00
test.h Add python interface for CUDA operator evaluation (#42) 2022-09-27 10:41:12 +08:00