InfiniTensor/examples
constroy Li feccd4f318
fix tensor parallel for llama (#159)
* fix Slice

* change default rounds of timeit to 10 to reduce time

* fix slice with large ends

* Reshape support Int64

* support position_ids as input

* skip last MatMul in Llama

* skip infer_shapes to parse large model

* update launch.py

* fix split_concat_kernel

* print more message in launch.py

* Reshape supports both Int32 and Int64

* try infer_shapes and warn about failure

* fix format

---------

Co-authored-by: whjthu <haojie0429@gmail.com>
2023-10-30 15:04:16 +08:00
..
NNmodel@51d3105277 impl distributed launch with NCCL (#106) 2023-09-05 09:47:35 +08:00
distributed fix tensor parallel for llama (#159) 2023-10-30 15:04:16 +08:00
python add python examples (#143) 2023-09-28 10:40:45 +08:00