Go to file

Anmuliar 0409eafb5f Operators g2bmm&gbmm transplantation (#24 ) * Function tune and corresponding testcase. Add: Tune function in /src/kernel/cuda/conv.cc and corresponding testcase in test_conv. Fix: A little bug of perfRecord using in /src/core/runtime.cc. * Tune part debug Add: recover the code, fixed the commit error. Add: some anotations in tune function * clang formmat test * Fix: mem leak in CUDA Runtime and Conv * Fix: sync in conv and default sync in timeit * Change the way to tune operator conv. Timeit function cudNNUnfused -> Timeit function cudnnConvolutionForward. * Change: merge the common part of cudnnunfused&tune into cudnndescriptoraccess * clang test * clang-format * clang-format bash. * Added operator G2BMM and corresponding testcase. Added files related to operator G2BMM creating&calling. Added custom_ops.cuh&custom_op.h. * Add operator GBMML * new version * Fix: G2BMM and GBMM kernel bugs * Added testcase of operator GBMML * clang format * Added cmake option REQUIRE_GCC9 * Delete redundent file * Renamed class GBMML into GBMM * clang format * Reviewed. * Added cudahostcompier option. * Add: explicit CMAKE_CUDA_HOST_COMPILER * Rename gbmm kernel * Fix: nvcc warning in GBMM and G2BMM Co-authored-by: wcz112 <wcz19@mails.tsinghua.edu.cn> Co-authored-by: Liyan Zheng <liyan-zheng@outlook.com>		2022-09-08 21:31:35 +08:00
.github/workflows	Add: clang format check github action	2022-08-09 17:58:12 +08:00
3rd-party	add code for backtrace (#21 )	2022-09-01 20:30:12 +08:00
include	Operators g2bmm&gbmm transplantation (#24 )	2022-09-08 21:31:35 +08:00
src	Operators g2bmm&gbmm transplantation (#24 )	2022-09-08 21:31:35 +08:00
test	Operators g2bmm&gbmm transplantation (#24 )	2022-09-08 21:31:35 +08:00
.clang-format	Add: graph, tensor, and operator	2022-07-31 21:44:03 +08:00
.cmake-format.json	Add: graph, tensor, and operator	2022-07-31 21:44:03 +08:00
.gitignore	Add: graph, tensor, and operator	2022-07-31 21:44:03 +08:00
.gitmodules	add code for backtrace (#21 )	2022-09-01 20:30:12 +08:00
CMakeLists.txt	Operators g2bmm&gbmm transplantation (#24 )	2022-09-08 21:31:35 +08:00
LICENSE	Initial commit	2022-07-27 22:40:23 +08:00
README.md	Add CUDA runtime (#6 )	2022-08-22 15:01:03 +08:00

README.md

InfiniTensor

Compilation on Lotus

# Enter the root of InfiniTensor
source test/script/env_lotus.sh 
mkdir build && cd build
cmake .. && make -j 12