Commit Graph

1085 Commits

Author SHA1 Message Date
hiyouga 17bf8a2c3a support ORPO 2024-03-31 18:29:50 +08:00
hiyouga 27776c3474 tiny fix 2024-03-31 00:10:29 +08:00
hoshi-hiyouga de3564ff70
Merge pull request #3057 from marko1616/bugfix/lora-model-merge
Fix Llama model save for full param train
2024-03-31 00:07:20 +08:00
marko1616 d9a5134617 fix blank line contains whitespace 2024-03-30 23:46:55 +08:00
marko1616 eb178eaff3 Fix Llama model save for full param train 2024-03-30 23:45:04 +08:00
hiyouga 7a086ed333 support save args in webui #2807 #3046
some ideas are borrowed from @marko1616
2024-03-30 23:09:12 +08:00
hoshi-hiyouga 257f643a74
Merge pull request #3053 from lealaxy/main
Fix pile dataset download config
2024-03-30 20:41:43 +08:00
hiyouga 831c5321ac upgrade gradio to 4.21.0 2024-03-30 20:37:08 +08:00
li.yunhao 9c2ef9cdf4 fix pile datset hf hub url 2024-03-30 16:06:10 +08:00
hiyouga a0333bb0ce Update wechat.jpg 2024-03-29 16:55:53 +08:00
hiyouga ca793028c6 release v0.6.1 2024-03-29 11:36:08 +08:00
hiyouga c1fe6ce782 update readme 2024-03-28 22:02:32 +08:00
hiyouga 1e43319f9c add project 2024-03-28 20:24:27 +08:00
hiyouga 8d603f8820 fix #2982 2024-03-28 20:22:31 +08:00
hiyouga 6c94305e47 update readme 2024-03-28 18:35:11 +08:00
hiyouga b19c14870d fix #3010 2024-03-28 18:31:17 +08:00
hiyouga 8c77b10912 update trainers 2024-03-28 18:16:27 +08:00
hoshi-hiyouga 3bcd41b639 fix ds optimizer 2024-03-26 23:39:56 +08:00
hiyouga b29d5560f1 fix #2981 2024-03-26 17:53:04 +08:00
hiyouga 3164b4f11b fix bug 2024-03-26 17:30:12 +08:00
hiyouga 511f675402 fix #2961 2024-03-26 17:26:14 +08:00
hiyouga 7ea1a1f5b3 Update wechat.jpg 2024-03-26 16:24:42 +08:00
hiyouga ba70aca8fb release v0.6.0 (real) 2024-03-25 23:37:48 +08:00
hiyouga 98a42cbdaa tiny fix 2024-03-25 23:28:52 +08:00
hiyouga 7b3d8188f5 update readme 2024-03-25 23:06:13 +08:00
hoshi-hiyouga f633ac6646
Merge pull request #2967 from Tsumugii24/main
Update README_zh.md
2024-03-25 23:02:22 +08:00
Tsumugii24 1704599503 Update README.md 2024-03-25 22:54:38 +08:00
Tsumugii24 7aa77a3451 Update README_zh.md 2024-03-25 22:54:26 +08:00
hiyouga 1484f76a95 add arg check 2024-03-25 22:42:58 +08:00
hiyouga 6f2b563f12 release v0.6.0 2024-03-25 22:38:56 +08:00
Tsumugii24 bb4ca1691a Update README_zh.md 2024-03-25 22:31:03 +08:00
hoshi-hiyouga f33a3dfadc
Merge pull request #2963 from rkinas/patch-1
Update requirements.txt
2024-03-25 21:49:34 +08:00
Remek Kinas b02899bf89
Update requirements.txt 2024-03-25 14:30:58 +01:00
hiyouga 558a538724 tiny fix 2024-03-25 21:18:08 +08:00
hoshi-hiyouga 49f9dbb4b1
Merge pull request #2945 from marko1616/bugfix/lora-model-merge
修复了在 transformers > 4.36.2 版本中部分模型合并 Lora 模型时因生成配置校验而导致的崩溃问题
2024-03-25 13:36:08 +08:00
marko1616 c8f0d99704 pass ruff check 2024-03-24 16:12:10 +08:00
marko1616 6f080fdba3 fix Llama lora merge crash 2024-03-24 03:06:11 +08:00
marko1616 51349ea1cc fix Llama lora merge crash 2024-03-24 02:55:23 +08:00
marko1616 c1e2c4ea45 fix Llama lora merge crash 2024-03-24 02:44:35 +08:00
hiyouga 140ad4ad56 fix #2936 2024-03-24 00:43:21 +08:00
hiyouga 7afbc85dae fix #2928 2024-03-24 00:34:54 +08:00
hiyouga a1c8c98c5f fix #2941 2024-03-24 00:28:44 +08:00
hiyouga 564d57aa23 Update wechat.jpg 2024-03-22 14:00:37 +08:00
hoshi-hiyouga ce261fdd64
Merge pull request #2919 from 0xez/main
Update README.md, fix the release date of the paper
2024-03-22 12:12:24 +08:00
0xez be0360303d
Update README_zh.md, fix the release date of the paper 2024-03-22 10:41:17 +08:00
0xez 675ba41562
Update README.md, fix the release date of the paper 2024-03-21 22:14:48 +08:00
hiyouga 96702620c4 move file 2024-03-21 17:05:17 +08:00
hiyouga 5eaa50fa01 add citation 2024-03-21 17:04:10 +08:00
hiyouga 0581bfdbc7 paper release 2024-03-21 13:49:17 +08:00
hiyouga bfe7a91289 update readme 2024-03-21 00:48:42 +08:00