Commit Graph

1164 Commits

Author SHA1 Message Date
hiyouga 7f6e412604 fix requires for windows 2024-04-03 21:56:43 +08:00
hiyouga 148bda353f fix resize vocab at inference #3022 2024-04-03 18:14:24 +08:00
hiyouga ce77d98872 fix #3116 2024-04-03 14:47:59 +08:00
hiyouga f0a9245c7e Update wechat.jpg 2024-04-03 14:42:21 +08:00
hiyouga 49a2dfaf90 update vllm example 2024-04-02 22:45:20 +08:00
hiyouga 66b0fe4e96 update readme 2024-04-02 22:17:48 +08:00
hiyouga fc7f1cc365 update examples 2024-04-02 21:09:25 +08:00
hiyouga 7765f337c7 add zh readme 2024-04-02 20:58:45 +08:00
hiyouga f22eaeb5bc update examples 2024-04-02 20:51:21 +08:00
hiyouga 31ffbde24d update examples 2024-04-02 20:41:49 +08:00
hiyouga 11a6c1bad6 update readme 2024-04-02 20:37:37 +08:00
hiyouga 949e5fe638 update readme 2024-04-02 20:22:11 +08:00
hiyouga 92dab8a90b simplify readme 2024-04-02 20:07:43 +08:00
hiyouga b267aeb53f add moe aux loss control #3085 2024-04-02 14:26:31 +08:00
hiyouga 9ddbe2866a fix #3022 2024-04-02 13:58:39 +08:00
hiyouga a86ae17241 Update SECURITY.md 2024-04-01 23:30:03 +08:00
hiyouga dd73a0c248 set dev version 2024-04-01 23:24:08 +08:00
hiyouga 4a6ca621c0 fix #3083 2024-04-01 22:53:52 +08:00
hiyouga 54b7d34908 add qwen1.5 moe 2024-04-01 21:49:40 +08:00
hiyouga aee634cd20 fix #3077 2024-04-01 21:35:18 +08:00
hiyouga eb259cc573 support infer 4bit model on GPUs #3023 2024-04-01 17:34:04 +08:00
hiyouga d0842f6828 update webui 2024-04-01 16:23:28 +08:00
hiyouga 816d714146 fix ORPO loss 2024-04-01 14:42:41 +08:00
hiyouga 5b9b40403d fix IPO and ORPO loss 2024-04-01 14:37:53 +08:00
hiyouga 5907216a1c fix plots 2024-03-31 19:43:48 +08:00
hiyouga 68aaa4904b use log1p in orpo loss
https://github.com/huggingface/trl/pull/1491
2024-03-31 19:27:08 +08:00
hiyouga 099db6acc0 update readme 2024-03-31 18:46:34 +08:00
hoshi-hiyouga a81d88b780
Merge pull request #3066 from hiyouga/orpo
support ORPO
2024-03-31 18:42:48 +08:00
hiyouga 5195add324 support orpo in webui 2024-03-31 18:34:59 +08:00
hiyouga 17bf8a2c3a support ORPO 2024-03-31 18:29:50 +08:00
hiyouga 27776c3474 tiny fix 2024-03-31 00:10:29 +08:00
hoshi-hiyouga de3564ff70
Merge pull request #3057 from marko1616/bugfix/lora-model-merge
Fix Llama model save for full param train
2024-03-31 00:07:20 +08:00
marko1616 d9a5134617 fix blank line contains whitespace 2024-03-30 23:46:55 +08:00
marko1616 eb178eaff3 Fix Llama model save for full param train 2024-03-30 23:45:04 +08:00
hiyouga 7a086ed333 support save args in webui #2807 #3046
some ideas are borrowed from @marko1616
2024-03-30 23:09:12 +08:00
hoshi-hiyouga 257f643a74
Merge pull request #3053 from lealaxy/main
Fix pile dataset download config
2024-03-30 20:41:43 +08:00
hiyouga 831c5321ac upgrade gradio to 4.21.0 2024-03-30 20:37:08 +08:00
li.yunhao 9c2ef9cdf4 fix pile datset hf hub url 2024-03-30 16:06:10 +08:00
hiyouga a0333bb0ce Update wechat.jpg 2024-03-29 16:55:53 +08:00
hiyouga ca793028c6 release v0.6.1 2024-03-29 11:36:08 +08:00
hiyouga c1fe6ce782 update readme 2024-03-28 22:02:32 +08:00
hiyouga 1e43319f9c add project 2024-03-28 20:24:27 +08:00
hiyouga 8d603f8820 fix #2982 2024-03-28 20:22:31 +08:00
hiyouga 6c94305e47 update readme 2024-03-28 18:35:11 +08:00
hiyouga b19c14870d fix #3010 2024-03-28 18:31:17 +08:00
hiyouga 8c77b10912 update trainers 2024-03-28 18:16:27 +08:00
hoshi-hiyouga 3bcd41b639 fix ds optimizer 2024-03-26 23:39:56 +08:00
hiyouga b29d5560f1 fix #2981 2024-03-26 17:53:04 +08:00
hiyouga 3164b4f11b fix bug 2024-03-26 17:30:12 +08:00
hiyouga 511f675402 fix #2961 2024-03-26 17:26:14 +08:00