Go to file
Anmuliar bd63f738dc
cuDNN conv tuning (#16)
* Function tune and corresponding testcase.

*Add: Tune function in /src/kernel/cuda/conv.cc and corresponding testcase in test_conv.

*Fix: A little bug of perfRecord using in /src/core/runtime.cc.

* Tune part debug

*Add: recover the code, fixed the commit error.

*Add: some anotations in tune function

* clang formmat test

* Fix: mem leak in CUDA Runtime and Conv

* Fix: sync in conv and default sync in timeit

* Change the way to tune operator conv.

Timeit function cudNNUnfused -> Timeit function cudnnConvolutionForward.

* Change: merge the common part of cudnnunfused&tune into cudnndescriptoraccess

* clang test

* clang-format

* clang-format bash.

* Chore: remove print and blank lines

Co-authored-by: wcz112 <wcz19@mails.tsinghua.edu.cn>
Co-authored-by: Liyan Zheng <liyan-zheng@outlook.com>
2022-08-29 21:37:07 +08:00
.github/workflows Add: clang format check github action 2022-08-09 17:58:12 +08:00
3rd-party Add: graph, tensor, and operator 2022-07-31 21:44:03 +08:00
include cuDNN conv tuning (#16) 2022-08-29 21:37:07 +08:00
src cuDNN conv tuning (#16) 2022-08-29 21:37:07 +08:00
test cuDNN conv tuning (#16) 2022-08-29 21:37:07 +08:00
.clang-format Add: graph, tensor, and operator 2022-07-31 21:44:03 +08:00
.cmake-format.json Add: graph, tensor, and operator 2022-07-31 21:44:03 +08:00
.gitignore Add: graph, tensor, and operator 2022-07-31 21:44:03 +08:00
.gitmodules Add: graph, tensor, and operator 2022-07-31 21:44:03 +08:00
CMakeLists.txt Revert "Operator serialization (#14)" (#15) 2022-08-29 16:02:48 +08:00
LICENSE Initial commit 2022-07-27 22:40:23 +08:00
README.md Add CUDA runtime (#6) 2022-08-22 15:01:03 +08:00

README.md

InfiniTensor

Compilation on Lotus

# Enter the root of InfiniTensor
source test/script/env_lotus.sh 
mkdir build && cd build
cmake .. && make -j 12