update readme
This commit is contained in:
parent
892fd39373
commit
ff0aa793b6
12
README.md
12
README.md
|
@ -65,12 +65,12 @@
|
||||||
## Supported Training Approaches
|
## Supported Training Approaches
|
||||||
|
|
||||||
| Approach | Full-parameter | Partial-parameter | LoRA | QLoRA |
|
| Approach | Full-parameter | Partial-parameter | LoRA | QLoRA |
|
||||||
| ---------------------- | -------------- | ----------------- | ---- | ----- |
|
| ---------------------- | ------------------ | ------------------ | ------------------ | ------------------ |
|
||||||
| Pre-Training | ✅ | ✅ | ✅ | ✅ |
|
| Pre-Training | :white_check_mark: | :white_check_mark: | :white_check_mark: | :white_check_mark: |
|
||||||
| Supervised Fine-Tuning | ✅ | ✅ | ✅ | ✅ |
|
| Supervised Fine-Tuning | :white_check_mark: | :white_check_mark: | :white_check_mark: | :white_check_mark: |
|
||||||
| Reward Modeling | | | ✅ | ✅ |
|
| Reward Modeling | | | :white_check_mark: | :white_check_mark: |
|
||||||
| PPO Training | | | ✅ | ✅ |
|
| PPO Training | | | :white_check_mark: | :white_check_mark: |
|
||||||
| DPO Training | ✅ | | ✅ | ✅ |
|
| DPO Training | :white_check_mark: | | :white_check_mark: | :white_check_mark: |
|
||||||
|
|
||||||
- Use `--quantization_bit 4/8` argument to enable QLoRA.
|
- Use `--quantization_bit 4/8` argument to enable QLoRA.
|
||||||
|
|
||||||
|
|
12
README_zh.md
12
README_zh.md
|
@ -65,12 +65,12 @@
|
||||||
## 训练方法
|
## 训练方法
|
||||||
|
|
||||||
| 方法 | 全参数训练 | 部分参数训练 | LoRA | QLoRA |
|
| 方法 | 全参数训练 | 部分参数训练 | LoRA | QLoRA |
|
||||||
| ---------- | ---------- | ----------- | ---- | ----- |
|
| ---------------------- | ------------------ | ------------------ | ------------------ | ------------------ |
|
||||||
| 预训练 | ✅ | ✅ | ✅ | ✅ |
|
| 预训练 | :white_check_mark: | :white_check_mark: | :white_check_mark: | :white_check_mark: |
|
||||||
| 指令监督微调 | ✅ | ✅ | ✅ | ✅ |
|
| 指令监督微调 | :white_check_mark: | :white_check_mark: | :white_check_mark: | :white_check_mark: |
|
||||||
| 奖励模型训练 | | | ✅ | ✅ |
|
| 奖励模型训练 | | | :white_check_mark: | :white_check_mark: |
|
||||||
| PPO 训练 | | | ✅ | ✅ |
|
| PPO 训练 | | | :white_check_mark: | :white_check_mark: |
|
||||||
| DPO 训练 | ✅ | | ✅ | ✅ |
|
| DPO 训练 | :white_check_mark: | | :white_check_mark: | :white_check_mark: |
|
||||||
|
|
||||||
- 使用 `--quantization_bit 4/8` 参数来启用 QLoRA 训练。
|
- 使用 `--quantization_bit 4/8` 参数来启用 QLoRA 训练。
|
||||||
|
|
||||||
|
|
Loading…
Reference in New Issue