forked from jiuyuan/InfiniTensor
d1a90ba3e2
* [feature] support kvcache with static graph * use workspace to optimize kvcache attention --------- Co-authored-by: Haojie Wang <haojie0429@gmail.com> |
||
---|---|---|
.. | ||
NNmodel@b896cec2db | ||
distributed | ||
python |