Update README.md

This commit is contained in:
hoshi-hiyouga 2023-10-15 20:23:22 +08:00 committed by GitHub
parent a6a04be2e6
commit 25d326e135
No known key found for this signature in database
GPG Key ID: 4AEE18F83AFDEB23
1 changed files with 4 additions and 0 deletions

View File

@ -12,6 +12,10 @@
\[ English | [中文](README_zh.md) \]
Launch an all-in-one Web UI via `python src/train_web.py`.
https://github.com/hiyouga/LLaMA-Factory/assets/16256802/6ba60acc-e2e2-4bec-b846-2d88920d5ba1
## Changelog
[23/09/27] We supported **$S^2$-Attn** proposed by [LongLoRA](https://github.com/dvlab-research/LongLoRA) for the LLaMA models. Try `--shift_attn` argument to enable shift short attention.