Commit Graph

518 Commits

Author SHA1 Message Date
hiyouga 528d91192a Update wechat.jpg 2023-11-12 22:34:19 +08:00
hiyouga 4bd8e3906d fix flashattn warning 2023-11-10 18:34:54 +08:00
hiyouga a0c31c68c4 add todo 2023-11-10 14:38:18 +08:00
hiyouga 3697a3dc9a refactor constants 2023-11-10 14:16:10 +08:00
hiyouga 415bca900e tiny fix 2023-11-09 17:20:49 +08:00
hoshi-hiyouga 462730cbd7
Merge pull request #1454 from yyq/main
Update finetuning_args.py
2023-11-09 17:12:18 +08:00
Yanqing 3684dffa14
Update finetuning_args.py
更新 chatglm/falcon/bloom 的 lora_target 的名称
2023-11-09 17:04:40 +08:00
hiyouga 0e86527d7f fix #1452 2023-11-09 16:41:32 +08:00
hiyouga b3572659f5 update readme 2023-11-09 16:00:24 +08:00
hiyouga 1db59832fd release v0.2.1 2023-11-09 15:54:16 +08:00
hiyouga 386f590209 add template, modify datasets 2023-11-09 15:53:23 +08:00
hoshi-hiyouga 7ca32d8e69
Merge pull request #1436 from lvzii/main
fix tokenizer config changed after pretrain
2023-11-09 14:30:50 +08:00
hiyouga 3df90b988b support parquet format #1446 2023-11-09 14:17:40 +08:00
hiyouga 33422e1fef fix #1438 #1439 2023-11-09 13:45:10 +08:00
lvzi 043c316ac8
fix tokenizer config changed after pretrain
Changing tokenizer's attribute at preprocessing stage will result in saving a wrong tokenizer.
for example, baichuan2
2023-11-08 15:50:46 +08:00
hiyouga 01260d9754 fix ppo train and dpo eval 2023-11-07 22:48:51 +08:00
hiyouga 11c1e1e157 fix #1422 2023-11-07 19:42:01 +08:00
hiyouga c52336d144 fix reward model loading 2023-11-07 17:20:51 +08:00
hiyouga d92f112951 fix args 2023-11-07 16:36:06 +08:00
hiyouga 17c64a0579 update info 2023-11-07 16:28:21 +08:00
hiyouga 479d0af2dc delete file 2023-11-07 16:20:12 +08:00
hiyouga 7ebd63a609 fix #1418 2023-11-07 16:17:22 +08:00
hiyouga b2a60905f3 upgrade peft, fix #1088 #1411 2023-11-07 16:13:36 +08:00
hiyouga 66a91e1fe3 update requirements 2023-11-06 19:01:21 +08:00
hiyouga de95b69282 use seed in evaluate.py 2023-11-06 18:17:51 +08:00
hiyouga e1e04cb1f1 update readme (list in alphabetical order) 2023-11-06 17:18:12 +08:00
hiyouga a7eeb8e17c update templates 2023-11-06 12:25:47 +08:00
hiyouga 2e77a5718a fix #1383 2023-11-06 11:42:23 +08:00
hiyouga d08f5e8a14 fix deepseek template 2023-11-05 13:08:46 +08:00
hiyouga 2a8a258195 support deepseek coder #1378 2023-11-05 12:51:03 +08:00
hiyouga 63ff909310 fix #1365 2023-11-05 12:21:07 +08:00
hiyouga 5227e18c44 Update wechat.jpg 2023-11-05 10:25:59 +08:00
hiyouga 05d9fc7eff tiny fix 2023-11-03 01:26:06 +08:00
hiyouga eb9d9e104a fix #1290 2023-11-03 00:44:53 +08:00
hiyouga b355f6cac9 fix bug in data loader, support dpo eval 2023-11-03 00:34:26 +08:00
hiyouga 2b5e33c338 update data readme 2023-11-03 00:15:23 +08:00
hiyouga cc8ffa10d8 update data readme (zh) 2023-11-02 23:42:49 +08:00
hiyouga a837172413 support sharegpt format, add datasets 2023-11-02 23:10:04 +08:00
hiyouga c1edb0cf1b support pagination in webui preview 2023-11-02 21:21:45 +08:00
hiyouga 34d8b2e56c fix webui 2023-11-02 18:03:14 +08:00
hiyouga 9cde5e8af6 support warning in webui 2023-11-02 17:57:04 +08:00
hiyouga f8703aac08 fix #1349 2023-11-02 17:02:44 +08:00
hiyouga dff128c7e3 fix #1356 2023-11-02 16:51:52 +08:00
hiyouga 083787dbfe fix #1325 2023-11-01 23:38:49 +08:00
hiyouga 8b912690e3 fix chat 2023-11-01 23:07:58 +08:00
hiyouga 84af10cec9 update gradio, support multiple resp in api 2023-11-01 23:02:16 +08:00
hiyouga d8cf8cfdeb fix SFT trainer 2023-10-31 21:52:52 +08:00
hiyouga f4e4a04529 fix #1316 2023-10-31 11:32:08 +08:00
hiyouga 9093cb1a2e Update wechat.jpg 2023-10-30 14:01:08 +08:00
hiyouga 640a520108 update projects 2023-10-29 22:53:47 +08:00