Commit Graph

90 Commits

Author SHA1 Message Date
wanghailu 2b8bca17e2 format 2023-01-05 14:23:46 +08:00
wanghailu 156a40806d add pad operation 2023-01-05 05:49:53 +00:00
wanghailu c19d6e6bb0 add det operation 2023-01-04 09:24:52 +00:00
wanghailu 68f4630dac add cumsum operation 2023-01-03 08:45:54 +00:00
wanghailu 5ae96ce060 add floormod operation 2023-01-03 07:20:47 +00:00
wanghailu dbb606f158 add floordiv operation and floordivtrunc operation 2023-01-03 07:07:22 +00:00
wanghailu 0079d1271b add cast operation 2022-12-28 08:57:52 +00:00
wanghailu 5329e66d0f add muln operation 2022-12-27 08:22:50 +00:00
wanghailu 45ea5c83f6 add addn operation 2022-12-27 07:03:23 +00:00
wanghailu 9177629a77 add transform operation 2022-12-27 02:13:10 +00:00
wanghailu f98f91de8b add sqrt and rsqrt operation 2022-12-26 06:29:12 +00:00
wanghailu 335dfabf80 add reciprocal operation 2022-12-26 06:15:07 +00:00
wanghailu 376c992aca add power operation 2022-12-26 05:45:40 +00:00
wanghailu 39d2a3571b add negTensor operation 2022-12-26 04:40:24 +00:00
wanghailu 0707fb6aff add mseloss operation 2022-12-26 03:06:34 +00:00
wanghailu 4ad648fa36 add maximum and minimum operation 2022-12-21 07:42:54 +00:00
wanghailu 2749b49ff7 add l2loss operation 2022-12-21 02:21:19 +00:00
wanghailu 34ba231cd4 add log1p operation 2022-12-21 01:41:28 +00:00
wanghailu 8bd1d64c53 add log operation 2022-12-20 03:09:40 +00:00
wanghailu 084063a68f add operation fill 2022-12-19 02:59:33 +00:00
wanghailu 82f510672d add exp operation 2022-12-19 02:03:12 +00:00
wanghailu b27b95a5e2 add erf operation 2022-12-19 01:51:25 +00:00
wanghailu 9346232129 add divnonan operation and test 2022-12-15 08:47:22 +00:00
wanghailu a56fb98eee add operation cnnl div, test and test for divdemo bangc kernel 2022-12-15 08:26:49 +00:00
wanghailu 949e00b732 add operation clip 2022-12-15 06:04:23 +00:00
wanghailu 58b89dd601 add ceil operation and floor operation 2022-12-14 02:50:06 +00:00
wanghailu 46a1bb2773 add copy operation on mlu 2022-12-14 02:32:32 +00:00
wanghailu 820d855ec8 add trigon function operation on mlu: sin,cos,tan,asin,sinh,asinh 2022-12-12 06:24:24 +00:00
wanghailu 392427cca6 add transpsoe code and test 2022-12-12 05:17:53 +00:00
wanghailu 8cfe04e5b7 fix 2022-12-08 07:41:15 +00:00
wanghailu a8bd1a910c add convbpfilter 2022-12-07 06:52:45 +00:00
wanghailu b68dcf8b9a add test 2022-12-06 07:09:51 +00:00
wanghailu 111ff10df0 add test for activation_backward 2022-12-06 04:38:26 +00:00
wanghailu db9069f1b7 add activation backward operation 2022-12-05 08:45:39 +00:00
wanghailu 468ed541af commit for format 2022-12-02 11:38:02 +08:00
wanghailu 267bfa3a4b add activation operatiopn relu, tanh, sigmoid on mlu 2022-12-02 03:24:09 +00:00
wendy12022 c5966f8d81
Add: resize operator and cuda kernel,support nearest/linear coef. (#51)
ADD: resize operator and cuda kernel,support nearest/linear coef.

fix some

fix tests

add more tests for linear mode.

add linear coef mode.

add scales

add tests

fix tests.

add notLarger notSmaller

fix

add test

ADD:resize operator and cuda kernel
2022-11-14 09:30:22 +08:00
zhengly123 63d8aff985
Fix: cuCtxCreate before other initialization (#49)
Fix: create cuCtx at the very beginning

Co-authored-by: Liyan Zheng <liyan-zheng@outlook.com>
2022-10-19 15:41:48 +08:00
Zixuan Ma 00b2f18c17
Fix: unsigned compare in test (#50)
fix: unsigned compare in test.

Test project /home/mazx/git/InfiniTensor/build
      Start  1: test_graph
 1/18 Test  #1: test_graph .......................   Passed    0.03 sec
      Start  2: test_hash
 2/18 Test  #2: test_hash ........................   Passed    0.01 sec
      Start  3: test_tensor_save
 3/18 Test  #3: test_tensor_save .................   Passed    0.02 sec
      Start  4: test_verify
 4/18 Test  #4: test_verify ......................   Passed    0.01 sec
      Start  5: test_batch_norm
 5/18 Test  #5: test_batch_norm ..................   Passed    0.01 sec
      Start  6: test_concat
 6/18 Test  #6: test_concat ......................   Passed    0.01 sec
      Start  7: test_conv
 7/18 Test  #7: test_conv ........................   Passed    0.24 sec
      Start  8: test_conv_transposed_2d
 8/18 Test  #8: test_conv_transposed_2d ..........   Passed    0.01 sec
      Start  9: test_element_wise
 9/18 Test  #9: test_element_wise ................   Passed    0.01 sec
      Start 10: test_extend
10/18 Test #10: test_extend ......................   Passed    0.01 sec
      Start 11: test_gather
11/18 Test #11: test_gather ......................   Passed    0.01 sec
      Start 12: test_matmul
12/18 Test #12: test_matmul ......................   Passed    0.01 sec
      Start 13: test_pad
13/18 Test #13: test_pad .........................   Passed    0.01 sec
      Start 14: test_pooling
14/18 Test #14: test_pooling .....................   Passed    0.01 sec
      Start 15: test_reduce_mean
15/18 Test #15: test_reduce_mean .................   Passed    0.01 sec
      Start 16: test_reshape
16/18 Test #16: test_reshape .....................   Passed    0.01 sec
      Start 17: test_slice
17/18 Test #17: test_slice .......................   Passed    0.01 sec
      Start 18: test_split
18/18 Test #18: test_split .......................   Passed    0.02 sec

100% tests passed, 0 tests failed out of 18
2022-10-19 15:03:03 +08:00
zhengly123 4e0040c8a0
Add: connection among tensors and operators (#45)
* Add: refs_to_wrefs and wrefs_to_refs

* Add: op and tensor connection

* Add: inception-v3 block test

* Refactor: addOperatorAndConnect

Co-authored-by: Liyan Zheng <liyan-zheng@outlook.com>
2022-10-18 22:02:51 +08:00
wendy12022 d1c913010f
ADD:reduce_mean operator and cuda kernel. (#47)
add new line at file ending.
2022-10-15 16:53:58 +08:00
wendy12022 a4d6426589
ADD: batch norm operator and cuda kernel. (#44)
fix numInputs of batchNorm, add new line in file ending.

ADD: batch norm operator and cuda kernel.

add training

remove comments.

fix compile error.

add batch norm operator and cuda kernel.
2022-10-15 16:29:28 +08:00
zhengly123 1152adc94a
Add: python API for timing ConvTranspose (#46)
* Add: python interfaced for timing operators

* Fix: CUDA Runtime run

Co-authored-by: Liyan Zheng <liyan-zheng@outlook.com>
2022-10-07 16:03:11 +08:00
Hardy b0c2a08252
Support bang c kernel wanghailu 0927 (#43)
* fix a little bug which found by new verison CMake

* add code for support BangC language kernel , just like Cuda kernel, not
library

* add bangc kernel

* support BangC kernel

* add code for support BangC kernel

* support bangc kernel

* fix some code from reviewer

* fix code of template fumction

* add code for support bangc kernel

* fix bangc format

Co-authored-by: wanghailu <wanghailu@qiyuanlab.com>
Co-authored-by: Haojie Wang <haojie0429@gmail.com>
2022-09-30 11:01:52 +08:00
wendy12022 26cee55e81
ADD:extend operator and cuda kernel. (#40)
Co-authored-by: Haojie Wang <haojie0429@gmail.com>
2022-09-29 14:52:50 +08:00
wendy12022 fe14c91f54
ADD: Gather operator and cuda kernel. (#41)
fix a memory leak.

add tests.

ADD gather cuda kernel.

ADD gather operator

Co-authored-by: Haojie Wang <haojie0429@gmail.com>
2022-09-29 14:44:20 +08:00
wendy12022 3c6e208f42
ADD:concat/split operator and cuda kernels (#29)
* ADD:concat/split operator and cuda kernels

refector

minor change comment

ADD:concat/split operator and cuda kernels

merge split_kernel and concat_kernel to split_concat_kernel.

Revert "fix"

This reverts commit 459926be09a838658ec55f1e0a72b3cf17037d5c.

fix

ADD:concat/split operator and cuda kernels

change whole tensor name to composed tensor

fix some

remove unused header.

rebase

add CudaKernel

add test for split.

ADD split operator and cuda kernel.

modify test.

ADD:concat operator and cuda kernel.

ADD:concat/split operator and cuda kernels

fix some

remove unused header.

rebase

add CudaKernel

ADD:concat/split operator and cuda kernels

add test for split.

ADD split operator and cuda kernel.

modify test.

ADD:concat operator and cuda kernel.

* remove extra comment; typo fix.

Co-authored-by: Haojie Wang <haojie0429@gmail.com>
2022-09-29 11:01:30 +08:00
wendy12022 5560d0f2fb
ADD:pad/slice operator and cuda kernel. (#39)
fix compile error

refector

clang format

split test.

fix compile error.

ADD slice cuda kernel.

ADD slice operator.

ADD:pad operator and cuda kernel.
2022-09-29 10:29:24 +08:00
zhengly123 1aefc1b27e
Add python interface for CUDA operator evaluation (#42)
* Refactor: seperate data generator

* Add: python bindings for opTimer

* Fix: test_perfengine

Co-authored-by: Liyan Zheng <liyan-zheng@outlook.com>
2022-09-27 10:41:12 +08:00
deathwings602 11d5aa1ccc
Add TVM codegen for MemboundOp (#35)
* Add:  interface for membound TVM kernel and test

* add getAnsorCode

* add evaluation, but link failed

* add evaluation of kernel, but link failed

* Fix: link libcuda and nvrtc

* add print

* Add: const for source of copy

* compile and evaluate the kernel

* add compute

* fix gen_ansor_op.py

* fix membound_TVM

* format and fix CMakeLists.txt

* fix memory leak

Co-authored-by: Liyan Zheng <liyan-zheng@outlook.com>
Co-authored-by: huangshuhong <huangsh19@mails.tsinghua.edu.cn>
2022-09-22 18:06:45 +08:00