update readme

This commit is contained in:
hiyouga 2023-10-15 20:28:14 +08:00
parent 25d326e135
commit f5d0da4d2a
2 changed files with 9 additions and 1 deletions

View File

@ -12,7 +12,9 @@
\[ English | [中文](README_zh.md) \]
Launch an all-in-one Web UI via `python src/train_web.py`.
## Example: Fine-tuning large language model within 10 minutes
Launch an **all-in-one Web UI** via `python src/train_web.py`.
https://github.com/hiyouga/LLaMA-Factory/assets/16256802/6ba60acc-e2e2-4bec-b846-2d88920d5ba1

View File

@ -12,6 +12,12 @@
\[ [English](README.md) | 中文 \]
## 示例:在十分钟内微调一个大模型
通过 `python src/train_web.py` 开启**训练推理一体化界面**。
https://github.com/hiyouga/LLaMA-Factory/assets/16256802/6ba60acc-e2e2-4bec-b846-2d88920d5ba1
## 更新日志
[23/09/27] 我们针对 LLaMA 模型支持了 [LongLoRA](https://github.com/dvlab-research/LongLoRA) 提出的 **$S^2$-Attn**。请使用 `--shift_attn` 参数以启用该功能。