InfiniTensor/include
PanZezhong1725 36ae7b7fb6
Add GatherElements op and cuda kernel (#149)
* Add GatherElements op and cuda kernel

* fix format

* remove print

* remove unused var

* fix spacing

* fix format

---------

Co-authored-by: panzezhong@qiyuanlab.com <panzezhong@zezhongpan>
Co-authored-by: Haojie Wang <haojie0429@gmail.com>
2023-10-12 09:18:12 +08:00
..
bang fix bang runtime bug after merging distributed branch (#137) 2023-09-19 14:10:39 +08:00
core Add GatherElements op and cuda kernel (#149) 2023-10-12 09:18:12 +08:00
cuda Add GatherElements op and cuda kernel (#149) 2023-10-12 09:18:12 +08:00
ffi Add TVM codegen for MemboundOp (#35) 2022-09-22 18:06:45 +08:00
intelcpu Cpu backend2 (#77) 2023-04-17 12:15:23 +08:00
nnet Dev for 202303ddl (#66) 2023-04-18 15:10:33 +08:00
operators Add GatherElements op and cuda kernel (#149) 2023-10-12 09:18:12 +08:00
utils tensor parallel for transformer (#125) 2023-09-14 14:19:45 +08:00
test.h Add python interface for CUDA operator evaluation (#42) 2022-09-27 10:41:12 +08:00