forked from jiuyuan/InfiniTensor
3e6ef305f1
* feat: support to sqrt op * feat: support to erf op * feat: support to expand op * feat: support to where op * fix: gather op index can be int64_t(hard coding) * fix: some wrong use * style: fix the format style * test: add test for change op * fix: rebase to master * fix: fix matmul b compute wrong * add expand and where kernel * Add int64 support for cuda gather kernel * add test_where.cc * add "expand.(cu/cc,test,cuda),modified where.cu" * Separate initialization of datatypes to avoid compile error * modify where.(cu/cc/h,test), expand and clip * Format fix * Format fix --------- Co-authored-by: xgqdut2016 <kenan_gewei@163.com> Co-authored-by: panzezhong <panzezhong@qiyuanlab.com> Co-authored-by: Haojie Wang <haojie0429@gmail.com> |
||
---|---|---|
.. | ||
G2BMM.h | ||
GBMM.h | ||
activation_backward.h | ||
batch_norm.h | ||
concat.h | ||
conv.h | ||
det.h | ||
dropout.h | ||
element_wise.h | ||
expand.h | ||
extend.h | ||
gather.h | ||
matmul.h | ||
membound.h | ||
pad.h | ||
pooling.h | ||
reduce_mean.h | ||
reshape.h | ||
resize.h | ||
slice.h | ||
softmax.h | ||
split.h | ||
transpose.h | ||
unary.h | ||
where.h |