forked from jiuyuan/InfiniTensor
00e6cc2587
* add reduce_mean and gather * fix format * add kunlun allreduce and cmakefile * add kunlun allreduce and cmakefile * deltete cmake opt * fix format * fix makefile * add DIST option in Makefile * add xpu allgather * delete xpu_wait() * add xpu allgather * delete specific compiler * fix format * fix gather * add broadcast * fix format * fix * fix xpu, add where operation, fix element-wise operation * fix softmax * fix softmax * log internal input and output * fix kunlun gather bugs * update CMakeList.txt and Makefile * fix some kunlun kernels * fix Makefile * fix Makefile * set cmake version 3.12 * format * fix where, gather and support gpt2 * "fix format" * fix format * copy onnx.py from master * use KUNLUN_HOME instead of absolute path * fix torchvision models * support torchvison model-zoo * fix format * format fix, CMakeList fix * fix review * fix vecToString return value * fix format * delete empty file --------- Co-authored-by: wanghailu <wanghailu0717@163.com> Co-authored-by: wanghailu <wanghailu@qiyuanlab.com> Co-authored-by: Haojie Wang <haojie0429@gmail.com> |
||
---|---|---|
.. | ||
test_kunlun_add.cc | ||
test_kunlun_allgather.cc | ||
test_kunlun_allreduce.cc | ||
test_kunlun_batch_norm.cc | ||
test_kunlun_broadcast.cc | ||
test_kunlun_concat.cc | ||
test_kunlun_conv.cc | ||
test_kunlun_conv_trans.cc | ||
test_kunlun_element_wise.cc | ||
test_kunlun_gather.cc | ||
test_kunlun_matmul.cc | ||
test_kunlun_pad.cc | ||
test_kunlun_pooling.cc | ||
test_kunlun_slice.cc | ||
test_kunlun_softmax.cc | ||
test_kunlun_split.cc | ||
test_kunlun_transpose.cc | ||
test_kunlun_unary.cc | ||
test_kunlun_where.cc |