LLaMA-Factory-310P3/examples/README.md

44 lines
1.9 KiB
Markdown
Raw Normal View History

2024-04-02 20:37:37 +08:00
We provide diverse examples about fine-tuning LLMs.
```
examples/
├── lora_single_gpu/
2024-04-02 21:09:25 +08:00
│ ├── pretrain.sh: Do pre-training
2024-04-02 20:51:21 +08:00
│ ├── sft.sh: Do supervised fine-tuning
│ ├── reward.sh: Do reward modeling
│ ├── ppo.sh: Do PPO training
│ ├── dpo.sh: Do DPO training
│ ├── orpo.sh: Do ORPO training
2024-04-02 20:37:37 +08:00
│ ├── prepare.sh: Save tokenized dataset
2024-04-02 20:51:21 +08:00
│ └── predict.sh: Do batch predict
2024-04-02 20:37:37 +08:00
├── qlora_single_gpu/
2024-04-02 20:51:21 +08:00
│ ├── bitsandbytes.sh: Fine-tune 4/8-bit BNB models
│ ├── gptq.sh: Fine-tune 4/8-bit GPTQ models
│ ├── awq.sh: Fine-tune 4-bit AWQ models
│ └── aqlm.sh: Fine-tune 2-bit AQLM models
2024-04-02 20:37:37 +08:00
├── lora_multi_gpu/
2024-04-02 20:51:21 +08:00
│ ├── single_node.sh: Fine-tune model with Accelerate on single node
│ └── multi_node.sh: Fine-tune model with Accelerate on multiple nodes
2024-04-02 20:37:37 +08:00
├── full_multi_gpu/
2024-04-02 20:51:21 +08:00
│ ├── single_node.sh: Fine-tune model with DeepSpeed on single node
│ └── multi_node.sh: Fine-tune model with DeepSpeed on multiple nodes
2024-04-02 20:37:37 +08:00
├── merge_lora/
2024-04-02 20:51:21 +08:00
│ ├── merge.sh: Merge LoRA weights into the pre-trained models
│ └── quantize.sh: Quantize fine-tuned model with AutoGPTQ
2024-04-02 20:37:37 +08:00
├── inference/
2024-04-02 20:51:21 +08:00
│ ├── cli_demo.sh: Launch a command line interface
│ ├── api_demo.sh: Launch an OpenAI-style API
│ ├── web_demo.sh: Launch a web interface
│ └── evaluate.sh: Evaluate model on the MMLU benchmark
2024-04-02 20:37:37 +08:00
└── extras/
├── galore/
2024-04-02 20:51:21 +08:00
│ └── sft.sh: Fine-tune model with GaLore
2024-04-02 20:37:37 +08:00
├── loraplus/
2024-04-02 20:51:21 +08:00
│ └── sft.sh: Fine-tune model with LoRA+
2024-04-02 20:37:37 +08:00
├── llama_pro/
2024-04-02 20:51:21 +08:00
│ ├── expand.sh: Expand layers in the model
│ └── sft.sh: Fine-tune expanded model
2024-04-02 20:37:37 +08:00
└── fsdp_qlora/
2024-04-02 20:51:21 +08:00
└── sft.sh: Fine-tune quantized model with FSDP
2024-04-02 20:37:37 +08:00
```