forked from jiuyuan/InfiniTensor
feccd4f318
* fix Slice * change default rounds of timeit to 10 to reduce time * fix slice with large ends * Reshape support Int64 * support position_ids as input * skip last MatMul in Llama * skip infer_shapes to parse large model * update launch.py * fix split_concat_kernel * print more message in launch.py * Reshape supports both Int32 and Int64 * try infer_shapes and warn about failure * fix format --------- Co-authored-by: whjthu <haojie0429@gmail.com> |
||
---|---|---|
.. | ||
launch.py | ||
launch_kvcache.py | ||
parallel.py | ||
parallel_opt.py | ||
placement.py |