forked from jiuyuan/InfiniTensor
0dc5347089
* - add LazyAllocator class - calculate memory consumption at present * - basic function of lazy_allocator, remaining test * - modify LazyAllocator * - modify InfiniTensor to fit LazyAllocator * - add setDataBlob - modify alignment - fix GraphObj::dataMalloc * - modified alignment value(64bytes -> 8bytes) - fix LazyAllocator::getPtr() - some dubug codes and commonts - do alignment by chaning size instead of tailAddr * - fix some problem * - translate chinese comments to english * - format codes * - fix test * - code format * - modify codes as YdrMaser and bitzyz suggested * - code format * - modify codes as constroy suggested * - codes format * - modify alignment on cuda * - code format * - add test_lazy_allocator - fix tests where not add input tensor into graph.tensors - fix tests where init tensor's data before calling graph->dataMallocate() * - code format * - remove gpu runtime in test_lazy_allocator * - fix test_lazy_allocator: remove cuda include * - add test * - code format * - add ifdef for test of allocator * - code format * - fix test: remove unused ifdef * - fix bang test * - code format * Merge branch 'master' into dcj/memory_allocator * fix: fix cuda conv_fp16 run fail * fix bang_runtime.cc and cuda_runtime.cc * - update mkl code * - fix codes for mkl * - code format * - remove unused commented codes - add an empty line at the end of the blob.cc --------- Co-authored-by: zhangyunze <z13785159769@163.com> |
||
---|---|---|
.. | ||
blob.h | ||
common.h | ||
constants.h | ||
data_type.h | ||
dummy_mutator.h | ||
graph.h | ||
graph_handler.h | ||
graph_match.h | ||
hash.h | ||
kernel.h | ||
lazy_allocator.h | ||
mutator.h | ||
object.h | ||
op_type.h | ||
operator.h | ||
perf_engine.h | ||
ref.h | ||
runtime.h | ||
search_engine.h | ||
tensor.h | ||
tensor_base.h |