Default Branch

900d8e58e3 · Rope and silu (#214) · Updated 2024-02-04 11:05:27 +08:00

Branches

825b0170c0 · fix broken link in docs · Updated 2024-02-19 17:13:04 +08:00

0
1

936797b960 · support rmsnorm · Updated 2024-02-08 14:58:47 +08:00

0
4

8cc6af0a83 · modify code to pass the cuda_all_reduce test · Updated 2024-02-06 10:53:32 +08:00

0
2

1b9ef0f0ef · Merge branch 'master' into xpu_xccl · Updated 2024-02-05 09:35:57 +08:00

0
52

e7d34badfb · fix format · Updated 2024-01-26 16:11:30 +08:00

14
15

e33131ce5c · fix comment · Updated 2024-01-17 10:57:44 +08:00

14
10

3d98e53b12 · add rope and silu support · Updated 2024-01-11 15:44:07 +08:00

24
41

3b5dd7d28c · Merge branch 'master' into update_pybind11 · Updated 2024-01-05 09:20:33 +08:00

16
2

6a855085d2 · modified kernel · Updated 2024-01-04 16:52:33 +08:00

24
39

dc6befb549 · fix: fix re-dataMalloc for weight tensor and use of naive allocator · Updated 2023-12-29 17:27:36 +08:00

24
39

6458093da4 · fix graph topo & add cublaslt support & others · Updated 2023-12-20 16:33:49 +08:00

37
8

a68ac10107 · Enrich dev doc · Updated 2023-12-05 17:14:28 +08:00

27
3

67974aee8a · Fix https://github.com/InfiniTensor/InfiniTensor/pull/160 (#185) · Updated 2023-11-27 14:18:12 +08:00

27
0
Included

54f4265296 · modified logic · Updated 2023-11-17 17:43:52 +08:00

59
12

965df4e294 · [feature] add fused attention_kvcache operator support (#179) · Updated 2023-11-14 23:44:22 +08:00

33
0
Included

0a5d273130 · Add: print derivation steps for conv2gemm · Updated 2023-11-10 23:16:44 +08:00

34
1

295450e5f4 · Add: show conv2gemm derivation · Updated 2023-11-10 22:49:07 +08:00

88
96

9272d709da · add a simple mempory pool for allocator · Updated 2023-10-19 12:36:01 +08:00

44
1

ee6dd3deac · update test · Updated 2023-10-12 14:07:23 +08:00

48
5

7c484d72b4 · Merge branch 'master' into change_path · Updated 2023-10-12 09:16:12 +08:00

48
2