LLaMA-Factory-Mirror/examples/README.md

We provide diverse examples about fine-tuning LLMs.

```
examples/
├── lora_single_gpu/
│   ├── pretrain.sh: Do continuous pre-training using LoRA
│   ├── sft.sh: Do supervised fine-tuning using LoRA
│   ├── reward.sh: Do reward modeling using LoRA
│   ├── ppo.sh: Do PPO training using LoRA
│   ├── dpo.sh: Do DPO training using LoRA
│   ├── orpo.sh: Do ORPO training using LoRA
│   ├── prepare.sh: Save tokenized dataset
│   └── predict.sh: Do batch predict and compute BLEU and ROUGE scores after LoRA tuning
├── qlora_single_gpu/
│   ├── bitsandbytes.sh: Fine-tune 4/8-bit BNB models using QLoRA
│   ├── gptq.sh: Fine-tune 4/8-bit GPTQ models using QLoRA
│   ├── awq.sh: Fine-tune 4-bit AWQ models using QLoRA
│   └── aqlm.sh: Fine-tune 2-bit AQLM models using QLoRA
├── lora_multi_gpu/
│   ├── single_node.sh: Fine-tune model with Accelerate on single node using LoRA
│   ├── multi_node.sh: Fine-tune model with Accelerate on multiple nodes using LoRA
│   └── ds_zero3.sh: Fine-tune model with DeepSpeed ZeRO-3 using LoRA (weight sharding)
├── full_multi_gpu/
│   ├── single_node.sh: Full fine-tune model with DeepSpeed on single node
│   ├── multi_node.sh: Full fine-tune model with DeepSpeed on multiple nodes
│   └── predict.sh: Do parallel batch predict and compute BLEU and ROUGE scores after full tuning
├── merge_lora/
│   ├── merge.sh: Merge LoRA weights into the pre-trained models
│   └── quantize.sh: Quantize the fine-tuned model with AutoGPTQ
├── inference/
│   ├── cli_demo.sh: Chat with fine-tuned model in the CLI with LoRA adapters
│   ├── api_demo.sh: Chat with fine-tuned model in an OpenAI-style API with LoRA adapters
│   ├── web_demo.sh: Chat with fine-tuned model in the Web browser with LoRA adapters
│   └── evaluate.sh: Evaluate model on the MMLU/CMMLU/C-Eval benchmarks with LoRA adapters
└── extras/
    ├── galore/
    │   └── sft.sh: Fine-tune model with GaLore
    ├── badam/
    │   └── sft.sh: Fine-tune model with BAdam
    ├── loraplus/
    │   └── sft.sh: Fine-tune model using LoRA+
    ├── mod/
    │   └── sft.sh: Fine-tune model using Mixture-of-Depths
    ├── llama_pro/
    │   ├── expand.sh: Expand layers in the model
    │   └── sft.sh: Fine-tune the expanded model
    └── fsdp_qlora/
        └── sft.sh: Fine-tune quantized model with FSDP+QLoRA
```
update readme 2024-04-02 20:37:37 +08:00			`We provide diverse examples about fine-tuning LLMs.`

			```
			`examples/`
			`├── lora_single_gpu/`
support badam for all stages 2024-04-16 17:44:48 +08:00			`│ ├── pretrain.sh: Do continuous pre-training using LoRA`
update examples 2024-04-15 22:14:34 +08:00			`│ ├── sft.sh: Do supervised fine-tuning using LoRA`
			`│ ├── reward.sh: Do reward modeling using LoRA`
			`│ ├── ppo.sh: Do PPO training using LoRA`
			`│ ├── dpo.sh: Do DPO training using LoRA`
			`│ ├── orpo.sh: Do ORPO training using LoRA`
update readme 2024-04-02 20:37:37 +08:00			`│ ├── prepare.sh: Save tokenized dataset`
update examples 2024-04-15 22:14:34 +08:00			`│ └── predict.sh: Do batch predict and compute BLEU and ROUGE scores after LoRA tuning`
update readme 2024-04-02 20:37:37 +08:00			`├── qlora_single_gpu/`
update examples 2024-04-15 22:14:34 +08:00			`│ ├── bitsandbytes.sh: Fine-tune 4/8-bit BNB models using QLoRA`
			`│ ├── gptq.sh: Fine-tune 4/8-bit GPTQ models using QLoRA`
			`│ ├── awq.sh: Fine-tune 4-bit AWQ models using QLoRA`
			`│ └── aqlm.sh: Fine-tune 2-bit AQLM models using QLoRA`
update readme 2024-04-02 20:37:37 +08:00			`├── lora_multi_gpu/`
update examples 2024-04-15 22:14:34 +08:00			`│ ├── single_node.sh: Fine-tune model with Accelerate on single node using LoRA`
update readme and examples 2024-04-22 00:37:32 +08:00			`│ ├── multi_node.sh: Fine-tune model with Accelerate on multiple nodes using LoRA`
update examples 2024-04-23 18:29:46 +08:00			`│ └── ds_zero3.sh: Fine-tune model with DeepSpeed ZeRO-3 using LoRA (weight sharding)`
update readme 2024-04-02 20:37:37 +08:00			`├── full_multi_gpu/`
update examples 2024-04-15 22:14:34 +08:00			`│ ├── single_node.sh: Full fine-tune model with DeepSpeed on single node`
			`│ ├── multi_node.sh: Full fine-tune model with DeepSpeed on multiple nodes`
update examples 2024-04-23 18:29:46 +08:00			`│ └── predict.sh: Do parallel batch predict and compute BLEU and ROUGE scores after full tuning`
update readme 2024-04-02 20:37:37 +08:00			`├── merge_lora/`
update examples 2024-04-02 20:51:21 +08:00			`│ ├── merge.sh: Merge LoRA weights into the pre-trained models`
update examples 2024-04-15 22:14:34 +08:00			`│ └── quantize.sh: Quantize the fine-tuned model with AutoGPTQ`
update readme 2024-04-02 20:37:37 +08:00			`├── inference/`
add export_device in webui #3333 2024-04-25 19:02:32 +08:00			`│ ├── cli_demo.sh: Chat with fine-tuned model in the CLI with LoRA adapters`
			`│ ├── api_demo.sh: Chat with fine-tuned model in an OpenAI-style API with LoRA adapters`
			`│ ├── web_demo.sh: Chat with fine-tuned model in the Web browser with LoRA adapters`
update examples 2024-04-15 22:14:34 +08:00			`│ └── evaluate.sh: Evaluate model on the MMLU/CMMLU/C-Eval benchmarks with LoRA adapters`
update readme 2024-04-02 20:37:37 +08:00			`└── extras/`
			`├── galore/`
update examples 2024-04-02 20:51:21 +08:00			`│ └── sft.sh: Fine-tune model with GaLore`
support badam for all stages 2024-04-16 17:44:48 +08:00			`├── badam/`
			`│ └── sft.sh: Fine-tune model with BAdam`
update readme 2024-04-02 20:37:37 +08:00			`├── loraplus/`
update examples 2024-04-15 22:14:34 +08:00			`│ └── sft.sh: Fine-tune model using LoRA+`
fix mod stuff 2024-04-21 18:11:10 +08:00			`├── mod/`
			`│ └── sft.sh: Fine-tune model using Mixture-of-Depths`
update readme 2024-04-02 20:37:37 +08:00			`├── llama_pro/`
update examples 2024-04-02 20:51:21 +08:00			`│ ├── expand.sh: Expand layers in the model`
update examples 2024-04-15 22:14:34 +08:00			`│ └── sft.sh: Fine-tune the expanded model`
update readme 2024-04-02 20:37:37 +08:00			`└── fsdp_qlora/`
update examples 2024-04-15 22:14:34 +08:00			`└── sft.sh: Fine-tune quantized model with FSDP+QLoRA`
update readme 2024-04-02 20:37:37 +08:00			```