From ff0aa793b6750830b3865c439ef64ed129ec9406 Mon Sep 17 00:00:00 2001 From: hiyouga Date: Thu, 17 Aug 2023 11:00:22 +0800 Subject: [PATCH] update readme --- README.md | 14 +++++++------- README_zh.md | 14 +++++++------- 2 files changed, 14 insertions(+), 14 deletions(-) diff --git a/README.md b/README.md index 1124ade2..be43a481 100644 --- a/README.md +++ b/README.md @@ -64,13 +64,13 @@ ## Supported Training Approaches -| Approach | Full-parameter | Partial-parameter | LoRA | QLoRA | -| ---------------------- | -------------- | ----------------- | ---- | ----- | -| Pre-Training | ✅ | ✅ | ✅ | ✅ | -| Supervised Fine-Tuning | ✅ | ✅ | ✅ | ✅ | -| Reward Modeling | | | ✅ | ✅ | -| PPO Training | | | ✅ | ✅ | -| DPO Training | ✅ | | ✅ | ✅ | +| Approach | Full-parameter | Partial-parameter | LoRA | QLoRA | +| ---------------------- | ------------------ | ------------------ | ------------------ | ------------------ | +| Pre-Training | :white_check_mark: | :white_check_mark: | :white_check_mark: | :white_check_mark: | +| Supervised Fine-Tuning | :white_check_mark: | :white_check_mark: | :white_check_mark: | :white_check_mark: | +| Reward Modeling | | | :white_check_mark: | :white_check_mark: | +| PPO Training | | | :white_check_mark: | :white_check_mark: | +| DPO Training | :white_check_mark: | | :white_check_mark: | :white_check_mark: | - Use `--quantization_bit 4/8` argument to enable QLoRA. diff --git a/README_zh.md b/README_zh.md index 5664c4ec..2a84e697 100644 --- a/README_zh.md +++ b/README_zh.md @@ -64,13 +64,13 @@ ## 训练方法 -| 方法 | 全参数训练 | 部分参数训练 | LoRA | QLoRA | -| ---------- | ---------- | ----------- | ---- | ----- | -| 预训练 | ✅ | ✅ | ✅ | ✅ | -| 指令监督微调 | ✅ | ✅ | ✅ | ✅ | -| 奖励模型训练 | | | ✅ | ✅ | -| PPO 训练 | | | ✅ | ✅ | -| DPO 训练 | ✅ | | ✅ | ✅ | +| 方法 | 全参数训练 | 部分参数训练 | LoRA | QLoRA | +| ---------------------- | ------------------ | ------------------ | ------------------ | ------------------ | +| 预训练 | :white_check_mark: | :white_check_mark: | :white_check_mark: | :white_check_mark: | +| 指令监督微调 | :white_check_mark: | :white_check_mark: | :white_check_mark: | :white_check_mark: | +| 奖励模型训练 | | | :white_check_mark: | :white_check_mark: | +| PPO 训练 | | | :white_check_mark: | :white_check_mark: | +| DPO 训练 | :white_check_mark: | | :white_check_mark: | :white_check_mark: | - 使用 `--quantization_bit 4/8` 参数来启用 QLoRA 训练。