InfiniTensor/include
Anmuliar 0409eafb5f
Operators g2bmm&gbmm transplantation (#24)
* Function tune and corresponding testcase.

*Add: Tune function in /src/kernel/cuda/conv.cc and corresponding testcase in test_conv.

*Fix: A little bug of perfRecord using in /src/core/runtime.cc.

* Tune part debug

*Add: recover the code, fixed the commit error.

*Add: some anotations in tune function

* clang formmat test

* Fix: mem leak in CUDA Runtime and Conv

* Fix: sync in conv and default sync in timeit

* Change the way to tune operator conv.

Timeit function cudNNUnfused -> Timeit function cudnnConvolutionForward.

* Change: merge the common part of cudnnunfused&tune into cudnndescriptoraccess

* clang test

* clang-format

* clang-format bash.

* Added operator G2BMM and corresponding testcase.

*Added files related to operator G2BMM creating&calling.

*Added custom_ops.cuh&custom_op.h.

* Add operator GBMML

* new version

* Fix: G2BMM and GBMM kernel bugs

* Added testcase of operator GBMML

* clang format

* Added cmake option REQUIRE_GCC9

* Delete redundent file

* Renamed class GBMML into GBMM

* clang format

* Reviewed.

* Added cudahostcompier option.

* Add: explicit CMAKE_CUDA_HOST_COMPILER

* Rename gbmm kernel

* Fix: nvcc warning in GBMM and G2BMM

Co-authored-by: wcz112 <wcz19@mails.tsinghua.edu.cn>
Co-authored-by: Liyan Zheng <liyan-zheng@outlook.com>
2022-09-08 21:31:35 +08:00
..
core Operators g2bmm&gbmm transplantation (#24) 2022-09-08 21:31:35 +08:00
cuda cuDNN conv tuning (#16) 2022-08-29 21:37:07 +08:00
nnet Chore: format dbg 2022-08-09 20:16:39 +08:00
operators Operators g2bmm&gbmm transplantation (#24) 2022-09-08 21:31:35 +08:00
utils Verify wanghailu 0902 (#22) 2022-09-05 15:45:52 +08:00
custom_ops.cuh Operators g2bmm&gbmm transplantation (#24) 2022-09-08 21:31:35 +08:00
custom_ops.h Operators g2bmm&gbmm transplantation (#24) 2022-09-08 21:31:35 +08:00
test.h Extended DataType class and Runtime interaction (#9) 2022-08-23 16:55:59 +08:00