InfiniTensor

History

Hardy 1184fa131f Xpu (#82 ) * support kunlun xpu and add an operator named Add * add sub, mul, div, pow, maximum, minimum * add code * add xpu code * add code * add matmul * add transpose * add unary operator * add unary operator * add some operator * add code * support run resnet18 on xpu * add code * add max pool2d * fix xpu code, let it can run. * 添加XPU算子 (#120) * add floordiv for xpu * add batchnorm for xpu * add more cast types for xpu * add conv_trans for xpu * add pad for xpu * add logical ops for xpu * fix format for xpu src and include * fix format for xpu test * fix format for xpu src --------- Co-authored-by: Bolun <bolunz@u.nus.edu> * Xpu abs (#121) * add: unary kernel for xpu * formatting * format * format * format * fix: pointer jump * fix optype comments * fix bug introduced while resolving conflict * change cmake option for kunlunxin xpu from 'xpu' to 'kunlun'; fix bug after merging distributed infrastructure * Add doc support for xpu (#141) * fix * fix * fix pooling test * format * format * fix * fix * set cmake version requirement * fix cmakelists * rename xpu to kunlun * fix * fix format * fix format * fix format * fix change name to kunlun * format * fix format * clang format * fix format --------- Co-authored-by: root <root@localhost.localdomain> Co-authored-by: wanghailu <wanghailu@qiyuanlab.com> Co-authored-by: wanghailu <wanghailu0717@163.com> Co-authored-by: Bolun Zhang <48948016+Chamberlain0w0@users.noreply.github.com> Co-authored-by: Bolun <bolunz@u.nus.edu> Co-authored-by: zhangyue207 <138768300+zhangyue207@users.noreply.github.com> Co-authored-by: Haojie Wang <haojie0429@gmail.com> Co-authored-by: baominghelly <41820386+baominghelly@users.noreply.github.com> Co-authored-by: Bolun <chamberlain0w0@gmail.com>		2023-10-16 10:57:08 +08:00
..
bang	fix bang runtime bug after merging distributed branch (#137 )	2023-09-19 14:10:39 +08:00
core	Xpu (#82 )	2023-10-16 10:57:08 +08:00
cuda	Add GatherElements op and cuda kernel (#149 )	2023-10-12 09:18:12 +08:00
ffi	Add TVM codegen for MemboundOp (#35 )	2022-09-22 18:06:45 +08:00
intelcpu	Cpu backend2 (#77 )	2023-04-17 12:15:23 +08:00
kunlun	Xpu (#82 )	2023-10-16 10:57:08 +08:00
nnet	Dev for 202303ddl (#66 )	2023-04-18 15:10:33 +08:00
operators	add transpose, concat and split for native cpu (#158 )	2023-10-12 10:14:28 +08:00
utils	tensor parallel for transformer (#125 )	2023-09-14 14:19:45 +08:00
test.h	Add python interface for CUDA operator evaluation (#42 )	2022-09-27 10:41:12 +08:00