Commit Graph

73 Commits

Author SHA1 Message Date
hiyouga d8aa1404be support FlashAttention2 2023-09-10 20:43:56 +08:00
hiyouga bca1a247bc support lora target auto find 2023-09-09 15:38:37 +08:00
hiyouga d8d82ca281 fix chatglm2 tokenizer 2023-09-09 13:50:29 +08:00
hiyouga 85b1f6632a fix baichuan templates 2023-09-07 18:54:14 +08:00
hiyouga 0531886e1f update baichuan2 template 2023-09-06 21:43:06 +08:00
hiyouga 60603a94c6 add Baichuan2 models 2023-09-06 18:40:11 +08:00
hiyouga a9d1fb72f7 refactor dataset_attr, add eos in pt, fix #757 2023-09-01 19:00:45 +08:00
codemayq 604f85487b add ad gen dataset 2023-08-27 20:35:32 +08:00
hiyouga 4318347d3f update template 2023-08-22 19:46:09 +08:00
hiyouga 9020524418 fix PPO trainer #551 , update readme 2023-08-18 11:43:10 +08:00
hiyouga e4eec9ddfd update readme 2023-08-18 01:51:55 +08:00
hiyouga 58f13e22da update training resuming 2023-08-18 01:41:17 +08:00
hiyouga ff0aa793b6 update readme 2023-08-17 11:00:22 +08:00
hiyouga ec94274ca1 web UI integrating RLHF 2023-08-14 10:48:47 +08:00
hiyouga 8a79ded55d update readme 2023-08-12 21:29:06 +08:00
hiyouga 2618e0b5a7 update readme 2023-08-12 21:23:05 +08:00
hiyouga 1836c020c5 update readme 2023-08-12 21:00:11 +08:00
hiyouga a48cb0d474 Release v0.1.6 2023-08-11 23:25:57 +08:00
hiyouga 3ec4351cfd support DPO training (2305.18290) 2023-08-11 03:02:53 +08:00
hiyouga 20cf27976f update readme 2023-08-07 15:02:02 +08:00
codemayq 293bd95712 add detailed model configs 2023-08-07 09:30:23 +08:00
hiyouga 87f8f830e2 support Qwen-7B, fix InternLM-7B inference 2023-08-03 15:53:32 +08:00
hiyouga c689857bbb release v0.1.5 2023-08-02 16:10:31 +08:00
hiyouga ccde51c5ea update readme 2023-08-01 18:48:27 +08:00
hiyouga ac88ce5233 fix RM save model 2023-08-01 11:56:17 +08:00
hiyouga 973a638665 release v0.1.4 2023-08-01 10:08:47 +08:00
hiyouga 62dca5bb82 update readme 2023-07-31 23:42:32 +08:00
hiyouga 0411a4b3e1 support streaming data, fix #284 #274 #268 2023-07-31 23:33:00 +08:00
hiyouga 5ee87138e4 update readme 2023-07-28 17:36:00 +08:00
hiyouga f5c2ccdde4 update dataset 2023-07-26 17:05:12 +08:00
hiyouga 00efa8a07f fix #242 2023-07-25 17:04:02 +08:00
hiyouga 182b425043 update dataset 2023-07-23 20:01:43 +08:00
hiyouga 035c966d5c update readme, fix web ui postprocess 2023-07-22 14:29:22 +08:00
mrhan1993 9f0b57b370 根据GLM Efficient Tuning添加中文README,web添加了server_port 2023-07-21 16:57:58 +08:00
hiyouga c3fcb67486 Update README.md 2023-07-20 17:23:16 +08:00
hiyouga 7159bc54ed add datasets 2023-07-19 20:59:15 +08:00
hiyouga 7a3ade8c69 support LLaMA-2 2023-07-19 16:42:14 +08:00
hiyouga b447fa85aa add web demo 2023-07-18 17:21:16 +08:00
hiyouga f8193e8009 release v0.1.0 2023-07-18 00:18:25 +08:00
hiyouga 1e2b7e0c4b Update README.md 2023-07-15 17:20:39 +08:00
hiyouga f751376613 modity code structure 2023-07-15 16:54:28 +08:00
hiyouga 08439d29b2 fix Baichuan-13B 2023-07-13 23:08:45 +08:00
zxbsmk 4955dc9eed Support for WebNovel dataset 2023-07-12 17:29:47 +08:00
hiyouga 1af031c02b add baichuan template 2023-07-11 18:57:50 +08:00
hiyouga f936a7af0b support Baichuan-13B 2023-07-11 16:16:14 +08:00
hiyouga 8447206bbc Update README.md 2023-07-10 23:09:11 +08:00
hiyouga 4182c7aa8b Update README.md 2023-07-09 14:57:13 +08:00
hiyouga 233f20864b Update README.md 2023-07-07 12:06:28 +08:00
hiyouga a2f507c562 support InternLM 2023-07-07 11:02:28 +08:00
hiyouga 89c623e4bf update readme 2023-07-05 23:03:58 +08:00