LLaMA-Factory-310P3/examples/lora_single_gpu
hiyouga d1587c80de update examples 2024-03-06 13:14:57 +08:00
..
README.md add examples 2024-03-05 03:16:35 +08:00
dpo.sh update examples 2024-03-06 13:14:57 +08:00
ppo.sh update examples 2024-03-06 13:14:57 +08:00
predict.sh update examples 2024-03-06 13:14:57 +08:00
pretrain.sh update examples 2024-03-06 13:14:57 +08:00
reward.sh update examples 2024-03-06 13:14:57 +08:00
sft.sh update examples 2024-03-06 13:14:57 +08:00

README.md

Usage:

  • pretrain.sh
  • sft.sh -> reward.sh -> ppo.sh
  • sft.sh -> dpo.sh -> predict.sh