update readme

This commit is contained in:
hiyouga 2023-08-17 11:00:22 +08:00
parent 892fd39373
commit ff0aa793b6
2 changed files with 14 additions and 14 deletions

View File

@ -65,12 +65,12 @@
## Supported Training Approaches ## Supported Training Approaches
| Approach | Full-parameter | Partial-parameter | LoRA | QLoRA | | Approach | Full-parameter | Partial-parameter | LoRA | QLoRA |
| ---------------------- | -------------- | ----------------- | ---- | ----- | | ---------------------- | ------------------ | ------------------ | ------------------ | ------------------ |
| Pre-Training | ✅ | ✅ | ✅ | ✅ | | Pre-Training | :white_check_mark: | :white_check_mark: | :white_check_mark: | :white_check_mark: |
| Supervised Fine-Tuning | ✅ | ✅ | ✅ | ✅ | | Supervised Fine-Tuning | :white_check_mark: | :white_check_mark: | :white_check_mark: | :white_check_mark: |
| Reward Modeling | | | ✅ | ✅ | | Reward Modeling | | | :white_check_mark: | :white_check_mark: |
| PPO Training | | | ✅ | ✅ | | PPO Training | | | :white_check_mark: | :white_check_mark: |
| DPO Training | ✅ | | ✅ | ✅ | | DPO Training | :white_check_mark: | | :white_check_mark: | :white_check_mark: |
- Use `--quantization_bit 4/8` argument to enable QLoRA. - Use `--quantization_bit 4/8` argument to enable QLoRA.

View File

@ -65,12 +65,12 @@
## 训练方法 ## 训练方法
| 方法 | 全参数训练 | 部分参数训练 | LoRA | QLoRA | | 方法 | 全参数训练 | 部分参数训练 | LoRA | QLoRA |
| ---------- | ---------- | ----------- | ---- | ----- | | ---------------------- | ------------------ | ------------------ | ------------------ | ------------------ |
| 预训练 | ✅ | ✅ | ✅ | ✅ | | 预训练 | :white_check_mark: | :white_check_mark: | :white_check_mark: | :white_check_mark: |
| 指令监督微调 | ✅ | ✅ | ✅ | ✅ | | 指令监督微调 | :white_check_mark: | :white_check_mark: | :white_check_mark: | :white_check_mark: |
| 奖励模型训练 | | | ✅ | ✅ | | 奖励模型训练 | | | :white_check_mark: | :white_check_mark: |
| PPO 训练 | | | ✅ | ✅ | | PPO 训练 | | | :white_check_mark: | :white_check_mark: |
| DPO 训练 | ✅ | | ✅ | ✅ | | DPO 训练 | :white_check_mark: | | :white_check_mark: | :white_check_mark: |
- 使用 `--quantization_bit 4/8` 参数来启用 QLoRA 训练。 - 使用 `--quantization_bit 4/8` 参数来启用 QLoRA 训练。