wanghailu
2b8bca17e2
format
2023-01-05 14:23:46 +08:00
wanghailu
156a40806d
add pad operation
2023-01-05 05:49:53 +00:00
wanghailu
c19d6e6bb0
add det operation
2023-01-04 09:24:52 +00:00
wanghailu
68f4630dac
add cumsum operation
2023-01-03 08:45:54 +00:00
wanghailu
5ae96ce060
add floormod operation
2023-01-03 07:20:47 +00:00
wanghailu
dbb606f158
add floordiv operation and floordivtrunc operation
2023-01-03 07:07:22 +00:00
wanghailu
0079d1271b
add cast operation
2022-12-28 08:57:52 +00:00
wanghailu
5329e66d0f
add muln operation
2022-12-27 08:22:50 +00:00
wanghailu
45ea5c83f6
add addn operation
2022-12-27 07:03:23 +00:00
wanghailu
9177629a77
add transform operation
2022-12-27 02:13:10 +00:00
wanghailu
f98f91de8b
add sqrt and rsqrt operation
2022-12-26 06:29:12 +00:00
wanghailu
335dfabf80
add reciprocal operation
2022-12-26 06:15:07 +00:00
wanghailu
376c992aca
add power operation
2022-12-26 05:45:40 +00:00
wanghailu
39d2a3571b
add negTensor operation
2022-12-26 04:40:24 +00:00
wanghailu
0707fb6aff
add mseloss operation
2022-12-26 03:06:34 +00:00
wanghailu
4ad648fa36
add maximum and minimum operation
2022-12-21 07:42:54 +00:00
wanghailu
2749b49ff7
add l2loss operation
2022-12-21 02:21:19 +00:00
wanghailu
34ba231cd4
add log1p operation
2022-12-21 01:41:28 +00:00
wanghailu
8bd1d64c53
add log operation
2022-12-20 03:09:40 +00:00
wanghailu
084063a68f
add operation fill
2022-12-19 02:59:33 +00:00
wanghailu
82f510672d
add exp operation
2022-12-19 02:03:12 +00:00
wanghailu
b27b95a5e2
add erf operation
2022-12-19 01:51:25 +00:00
wanghailu
9346232129
add divnonan operation and test
2022-12-15 08:47:22 +00:00
wanghailu
a56fb98eee
add operation cnnl div, test and test for divdemo bangc kernel
2022-12-15 08:26:49 +00:00
wanghailu
949e00b732
add operation clip
2022-12-15 06:04:23 +00:00
wanghailu
58b89dd601
add ceil operation and floor operation
2022-12-14 02:50:06 +00:00
wanghailu
46a1bb2773
add copy operation on mlu
2022-12-14 02:32:32 +00:00
wanghailu
820d855ec8
add trigon function operation on mlu: sin,cos,tan,asin,sinh,asinh
2022-12-12 06:24:24 +00:00
wanghailu
392427cca6
add transpsoe code and test
2022-12-12 05:17:53 +00:00
wanghailu
8cfe04e5b7
fix
2022-12-08 07:41:15 +00:00
wanghailu
a8bd1a910c
add convbpfilter
2022-12-07 06:52:45 +00:00
wanghailu
b68dcf8b9a
add test
2022-12-06 07:09:51 +00:00
wanghailu
111ff10df0
add test for activation_backward
2022-12-06 04:38:26 +00:00
wanghailu
db9069f1b7
add activation backward operation
2022-12-05 08:45:39 +00:00
wanghailu
468ed541af
commit for format
2022-12-02 11:38:02 +08:00
wanghailu
267bfa3a4b
add activation operatiopn relu, tanh, sigmoid on mlu
2022-12-02 03:24:09 +00:00
wendy12022
c5966f8d81
Add: resize operator and cuda kernel,support nearest/linear coef. ( #51 )
...
ADD: resize operator and cuda kernel,support nearest/linear coef.
fix some
fix tests
add more tests for linear mode.
add linear coef mode.
add scales
add tests
fix tests.
add notLarger notSmaller
fix
add test
ADD:resize operator and cuda kernel
2022-11-14 09:30:22 +08:00
zhengly123
63d8aff985
Fix: cuCtxCreate before other initialization ( #49 )
...
Fix: create cuCtx at the very beginning
Co-authored-by: Liyan Zheng <liyan-zheng@outlook.com>
2022-10-19 15:41:48 +08:00
Zixuan Ma
00b2f18c17
Fix: unsigned compare in test ( #50 )
...
fix: unsigned compare in test.
Test project /home/mazx/git/InfiniTensor/build
Start 1: test_graph
1/18 Test #1 : test_graph ....................... Passed 0.03 sec
Start 2: test_hash
2/18 Test #2 : test_hash ........................ Passed 0.01 sec
Start 3: test_tensor_save
3/18 Test #3 : test_tensor_save ................. Passed 0.02 sec
Start 4: test_verify
4/18 Test #4 : test_verify ...................... Passed 0.01 sec
Start 5: test_batch_norm
5/18 Test #5 : test_batch_norm .................. Passed 0.01 sec
Start 6: test_concat
6/18 Test #6 : test_concat ...................... Passed 0.01 sec
Start 7: test_conv
7/18 Test #7 : test_conv ........................ Passed 0.24 sec
Start 8: test_conv_transposed_2d
8/18 Test #8 : test_conv_transposed_2d .......... Passed 0.01 sec
Start 9: test_element_wise
9/18 Test #9 : test_element_wise ................ Passed 0.01 sec
Start 10: test_extend
10/18 Test #10 : test_extend ...................... Passed 0.01 sec
Start 11: test_gather
11/18 Test #11 : test_gather ...................... Passed 0.01 sec
Start 12: test_matmul
12/18 Test #12 : test_matmul ...................... Passed 0.01 sec
Start 13: test_pad
13/18 Test #13 : test_pad ......................... Passed 0.01 sec
Start 14: test_pooling
14/18 Test #14 : test_pooling ..................... Passed 0.01 sec
Start 15: test_reduce_mean
15/18 Test #15 : test_reduce_mean ................. Passed 0.01 sec
Start 16: test_reshape
16/18 Test #16 : test_reshape ..................... Passed 0.01 sec
Start 17: test_slice
17/18 Test #17 : test_slice ....................... Passed 0.01 sec
Start 18: test_split
18/18 Test #18 : test_split ....................... Passed 0.02 sec
100% tests passed, 0 tests failed out of 18
2022-10-19 15:03:03 +08:00
zhengly123
4e0040c8a0
Add: connection among tensors and operators ( #45 )
...
* Add: refs_to_wrefs and wrefs_to_refs
* Add: op and tensor connection
* Add: inception-v3 block test
* Refactor: addOperatorAndConnect
Co-authored-by: Liyan Zheng <liyan-zheng@outlook.com>
2022-10-18 22:02:51 +08:00
wendy12022
d1c913010f
ADD:reduce_mean operator and cuda kernel. ( #47 )
...
add new line at file ending.
2022-10-15 16:53:58 +08:00
wendy12022
a4d6426589
ADD: batch norm operator and cuda kernel. ( #44 )
...
fix numInputs of batchNorm, add new line in file ending.
ADD: batch norm operator and cuda kernel.
add training
remove comments.
fix compile error.
add batch norm operator and cuda kernel.
2022-10-15 16:29:28 +08:00
zhengly123
1152adc94a
Add: python API for timing ConvTranspose ( #46 )
...
* Add: python interfaced for timing operators
* Fix: CUDA Runtime run
Co-authored-by: Liyan Zheng <liyan-zheng@outlook.com>
2022-10-07 16:03:11 +08:00
Hardy
b0c2a08252
Support bang c kernel wanghailu 0927 ( #43 )
...
* fix a little bug which found by new verison CMake
* add code for support BangC language kernel , just like Cuda kernel, not
library
* add bangc kernel
* support BangC kernel
* add code for support BangC kernel
* support bangc kernel
* fix some code from reviewer
* fix code of template fumction
* add code for support bangc kernel
* fix bangc format
Co-authored-by: wanghailu <wanghailu@qiyuanlab.com>
Co-authored-by: Haojie Wang <haojie0429@gmail.com>
2022-09-30 11:01:52 +08:00
wendy12022
26cee55e81
ADD:extend operator and cuda kernel. ( #40 )
...
Co-authored-by: Haojie Wang <haojie0429@gmail.com>
2022-09-29 14:52:50 +08:00
wendy12022
fe14c91f54
ADD: Gather operator and cuda kernel. ( #41 )
...
fix a memory leak.
add tests.
ADD gather cuda kernel.
ADD gather operator
Co-authored-by: Haojie Wang <haojie0429@gmail.com>
2022-09-29 14:44:20 +08:00
wendy12022
3c6e208f42
ADD:concat/split operator and cuda kernels ( #29 )
...
* ADD:concat/split operator and cuda kernels
refector
minor change comment
ADD:concat/split operator and cuda kernels
merge split_kernel and concat_kernel to split_concat_kernel.
Revert "fix"
This reverts commit 459926be09a838658ec55f1e0a72b3cf17037d5c.
fix
ADD:concat/split operator and cuda kernels
change whole tensor name to composed tensor
fix some
remove unused header.
rebase
add CudaKernel
add test for split.
ADD split operator and cuda kernel.
modify test.
ADD:concat operator and cuda kernel.
ADD:concat/split operator and cuda kernels
fix some
remove unused header.
rebase
add CudaKernel
ADD:concat/split operator and cuda kernels
add test for split.
ADD split operator and cuda kernel.
modify test.
ADD:concat operator and cuda kernel.
* remove extra comment; typo fix.
Co-authored-by: Haojie Wang <haojie0429@gmail.com>
2022-09-29 11:01:30 +08:00
wendy12022
5560d0f2fb
ADD:pad/slice operator and cuda kernel. ( #39 )
...
fix compile error
refector
clang format
split test.
fix compile error.
ADD slice cuda kernel.
ADD slice operator.
ADD:pad operator and cuda kernel.
2022-09-29 10:29:24 +08:00
zhengly123
1aefc1b27e
Add python interface for CUDA operator evaluation ( #42 )
...
* Refactor: seperate data generator
* Add: python bindings for opTimer
* Fix: test_perfengine
Co-authored-by: Liyan Zheng <liyan-zheng@outlook.com>
2022-09-27 10:41:12 +08:00
deathwings602
11d5aa1ccc
Add TVM codegen for MemboundOp ( #35 )
...
* Add: interface for membound TVM kernel and test
* add getAnsorCode
* add evaluation, but link failed
* add evaluation of kernel, but link failed
* Fix: link libcuda and nvrtc
* add print
* Add: const for source of copy
* compile and evaluate the kernel
* add compute
* fix gen_ansor_op.py
* fix membound_TVM
* format and fix CMakeLists.txt
* fix memory leak
Co-authored-by: Liyan Zheng <liyan-zheng@outlook.com>
Co-authored-by: huangshuhong <huangsh19@mails.tsinghua.edu.cn>
2022-09-22 18:06:45 +08:00