InfiniTensor/include/core
zhangyunze 3e6ef305f1
框架支持bert/gpt2模型构图 (#94)
* feat: support to sqrt op

* feat: support to erf op

* feat: support to expand op

* feat: support to where op

* fix: gather op index can be int64_t(hard coding)

* fix: some wrong use

* style: fix the format style

* test: add test for change op

* fix: rebase to master

* fix: fix matmul b compute wrong

* add expand and where kernel

* Add int64 support for cuda gather kernel

* add test_where.cc

* add "expand.(cu/cc,test,cuda),modified where.cu"

* Separate initialization of datatypes to avoid compile error

* modify where.(cu/cc/h,test), expand and clip

* Format fix

* Format fix

---------

Co-authored-by: xgqdut2016 <kenan_gewei@163.com>
Co-authored-by: panzezhong <panzezhong@qiyuanlab.com>
Co-authored-by: Haojie Wang <haojie0429@gmail.com>
2023-08-29 16:06:52 +08:00
..
blob.h Support bang c kernel wanghailu 0927 (#43) 2022-09-30 11:01:52 +08:00
common.h Add ConvTransposedNHWC (#67) 2023-03-01 14:15:02 +08:00
constants.h Dev for 202303ddl (#66) 2023-04-18 15:10:33 +08:00
data_type.h 框架支持bert/gpt2模型构图 (#94) 2023-08-29 16:06:52 +08:00
dummy_mutator.h Add search engine (#64) 2023-02-12 18:27:52 +08:00
graph.h memory_allocator (#103) 2023-08-13 13:39:35 +08:00
graph_handler.h 框架支持bert/gpt2模型构图 (#94) 2023-08-29 16:06:52 +08:00
graph_match.h ADD: sub graph replacement. (#56) 2023-04-17 13:09:07 +08:00
hash.h Dev for 202303ddl (#66) 2023-04-18 15:10:33 +08:00
kernel.h refactor(core): 添加新的 `OpType` 定义 (#99) 2023-08-07 11:17:05 +08:00
lazy_allocator.h memory_allocator (#103) 2023-08-13 13:39:35 +08:00
mutator.h ADD: add mkl runtime for intel cpu , and add mkl kernel for matmul/conv/convtransposed. (#61) 2023-03-27 21:28:49 +08:00
object.h Dev for 202303ddl (#66) 2023-04-18 15:10:33 +08:00
op_type.h refactor(core): 添加新的 `OpType` 定义 (#99) 2023-08-07 11:17:05 +08:00
operator.h refactor(core): 添加新的 `OpType` 定义 (#99) 2023-08-07 11:17:05 +08:00
perf_engine.h Dev for 202303ddl (#66) 2023-04-18 15:10:33 +08:00
ref.h Dev for 202303ddl (#66) 2023-04-18 15:10:33 +08:00
runtime.h refactor(core): 添加新的 `OpType` 定义 (#99) 2023-08-07 11:17:05 +08:00
search_engine.h Add search engine (#64) 2023-02-12 18:27:52 +08:00
tensor.h cudnn activations support ND-Tensor (#116) 2023-08-22 14:21:59 +08:00
tensor_base.h ADD: sub graph replacement. (#56) 2023-04-17 13:09:07 +08:00