Commit Graph

284 Commits

Author SHA1 Message Date
hiyouga d640c5545f Update callbacks.py 2023-07-15 17:39:16 +08:00
hiyouga 1e2b7e0c4b Update README.md 2023-07-15 17:20:39 +08:00
hiyouga 22d9a9c2af fix callback 2023-07-15 17:18:16 +08:00
hiyouga f751376613 modity code structure 2023-07-15 16:54:28 +08:00
hiyouga 2a0f1f8398 Update wechat.jpg 2023-07-14 17:29:43 +08:00
hiyouga c30db9f1f0 fix eval and pred loss 2023-07-14 13:11:57 +08:00
hiyouga a04115ec27 fix pretrain 2023-07-13 23:41:54 +08:00
hiyouga 08439d29b2 fix Baichuan-13B 2023-07-13 23:08:45 +08:00
hoshi-hiyouga 8cd76ef3c3
Merge pull request #156 from ZhengJun-AI/main
Support for WebNovel dataset
2023-07-12 20:11:19 +08:00
zxbsmk 4955dc9eed Support for WebNovel dataset 2023-07-12 17:29:47 +08:00
hoshi-hiyouga 894f13e41f
Merge pull request #145 from elicassion/patch-1
Fix typo in common.py
2023-07-12 13:50:39 +08:00
Jinghuan Shang dc1e8b7181
Fix typo in common.py
lastest -> latest
2023-07-11 18:03:53 -04:00
hiyouga b2f7cb4465 fix sft encode 2023-07-11 19:50:33 +08:00
hiyouga 1af031c02b add baichuan template 2023-07-11 18:57:50 +08:00
hiyouga f936a7af0b support Baichuan-13B 2023-07-11 16:16:14 +08:00
hiyouga 8447206bbc Update README.md 2023-07-10 23:09:11 +08:00
hiyouga 061c324972 Update wechat.jpg 2023-07-10 18:41:53 +08:00
hiyouga 4182c7aa8b Update README.md 2023-07-09 14:57:13 +08:00
hiyouga 84a06318d4 update api to match langchain 2023-07-07 20:35:39 +08:00
hiyouga 233f20864b Update README.md 2023-07-07 12:06:28 +08:00
hiyouga a2f507c562 support InternLM 2023-07-07 11:02:28 +08:00
hiyouga caa00d3ac2 fix rouge score 2023-07-06 14:28:34 +08:00
hiyouga 89c623e4bf update readme 2023-07-05 23:03:58 +08:00
hiyouga 4abd2485e1 fix streaming response in API 2023-07-05 22:42:31 +08:00
hiyouga e6603977f6 fix freeze tuning 2023-07-05 21:18:28 +08:00
hiyouga a2ba69183b fix bug in PPO stage 2023-07-05 19:14:10 +08:00
hiyouga 8e3540c62d fix compute dtype 2023-07-05 15:13:00 +08:00
hiyouga c136f362c1 support falcon model #72 2023-07-05 15:00:06 +08:00
hiyouga 966b5c70fc Update wechat.jpg 2023-07-05 00:22:22 +08:00
hiyouga cac87fd553 fix bleu score 2023-07-05 00:11:21 +08:00
hiyouga 395ed1cf1b set use_cache before saving model 2023-07-04 23:18:20 +08:00
hiyouga 65e9ce2cdd fix seq2seq predictions 2023-07-04 22:56:51 +08:00
hoshi-hiyouga cb26f78923
Merge pull request #119 from codemayq/main
add the pre-built version of bitsandbytes library for windows user
2023-07-03 19:51:46 +08:00
codemayq d3b30ecde3 add the pre-built version of bitsandbytes library for windows user 2023-07-03 13:58:10 +08:00
hiyouga 0db9d29111 Update auto_gptq.py 2023-07-02 20:56:11 +08:00
hiyouga cf6d57fd3e add autogptq 2023-07-02 20:36:37 +08:00
hiyouga b8e1f09a2e Update wechat.jpg 2023-06-30 15:45:20 +08:00
hiyouga 92fa515e97 fix typo 2023-06-30 10:09:59 +08:00
hiyouga 021b035c1e Update README.md 2023-06-29 19:36:22 +08:00
hiyouga f14bd729a8 rename evaluate.py 2023-06-29 15:40:39 +08:00
hiyouga 23a7266272 Update evaluate.py 2023-06-29 15:40:03 +08:00
hiyouga 70592035b8 Update README.md 2023-06-29 15:37:19 +08:00
hiyouga 3154fec979 add open assistant dataset 2023-06-28 23:09:33 +08:00
hiyouga 4d0fddba21 update loading logic 2023-06-28 12:07:16 +08:00
hiyouga 0a46313cca fix loading best model 2023-06-28 01:55:12 +08:00
hiyouga 7826a8ca77 fix RM accuracy 2023-06-28 01:40:13 +08:00
hiyouga 9cb1af71f3 add star history 2023-06-27 23:56:29 +08:00
hiyouga 450910c1db tiny fix 2023-06-27 23:54:24 +08:00
hiyouga 18f87c1b25 fix initializing data arguments 2023-06-27 22:50:23 +08:00
hiyouga 2e01abfda5 support save full model, replace BOS token 2023-06-27 21:40:11 +08:00