InfiniTensor/include
xgqdut2016 a3929c25f8
Add send and recv operators based on NCCL (#182)
* baseline sendrecv, bug

* success sendrecv

* get rank from comm

* set output shape

* successful:set output shape equal to input shape

* shape as attribute

* success:shape as attribute

* success send recv, output 0

* add onnx test

* split send and recv

* success split send and recv

* test-onnx bug

* success test-onnx

* modified onnx.py

* solve review
2023-12-14 16:38:03 +08:00
..
bang Fix workspace & bang conv (#183) 2023-11-24 15:16:25 +08:00
core Add send and recv operators based on NCCL (#182) 2023-12-14 16:38:03 +08:00
cuda Add layer normalization (#181) 2023-11-24 15:15:14 +08:00
ffi Add TVM codegen for MemboundOp (#35) 2022-09-22 18:06:45 +08:00
intelcpu Cpu backend2 (#77) 2023-04-17 12:15:23 +08:00
kunlun Xpu (#82) 2023-10-16 10:57:08 +08:00
nnet test: 支持编译 einnet 单元测试,但不是所有测试都能通过 (#174) 2023-11-03 13:21:49 +08:00
operators Add send and recv operators based on NCCL (#182) 2023-12-14 16:38:03 +08:00
utils Add layer normalization (#181) 2023-11-24 15:15:14 +08:00
test.h Add python interface for CUDA operator evaluation (#42) 2022-09-27 10:41:12 +08:00