Commit Graph

196 Commits

Author SHA1 Message Date
hiyouga 021b035c1e Update README.md 2023-06-29 19:36:22 +08:00
hiyouga f14bd729a8 rename evaluate.py 2023-06-29 15:40:39 +08:00
hiyouga 23a7266272 Update evaluate.py 2023-06-29 15:40:03 +08:00
hiyouga 70592035b8 Update README.md 2023-06-29 15:37:19 +08:00
hiyouga 3154fec979 add open assistant dataset 2023-06-28 23:09:33 +08:00
hiyouga 4d0fddba21 update loading logic 2023-06-28 12:07:16 +08:00
hiyouga 0a46313cca fix loading best model 2023-06-28 01:55:12 +08:00
hiyouga 7826a8ca77 fix RM accuracy 2023-06-28 01:40:13 +08:00
hiyouga 9cb1af71f3 add star history 2023-06-27 23:56:29 +08:00
hiyouga 450910c1db tiny fix 2023-06-27 23:54:24 +08:00
hiyouga 18f87c1b25 fix initializing data arguments 2023-06-27 22:50:23 +08:00
hiyouga 2e01abfda5 support save full model, replace BOS token 2023-06-27 21:40:11 +08:00
hiyouga 1c732e2537 fix decoding in seq2seq 2023-06-27 19:33:08 +08:00
hiyouga 33f2141507 Update wechat.jpg 2023-06-27 16:41:09 +08:00
hiyouga 4f3772b342 Update evaluate.py 2023-06-26 23:41:33 +08:00
hiyouga 5a0a9daf74 Create evaluate.py 2023-06-26 23:30:18 +08:00
hoshi-hiyouga 907e065454
Merge pull request #86 from Jingsong-Yan/main
Update README.md with baichuan-7b-rtx3090
2023-06-26 20:14:40 +08:00
Jingsong-Yan 90bb5b6f37
Update README.md with baichuan-7b-rtx3090
在 Changelog 中新增 baichuan-7b-rtx3090 分支的描述
2023-06-26 19:45:41 +08:00
hiyouga 993cabdd4c Merge branch 'main' of https://github.com/hiyouga/LLaMA-Efficient-Tuning 2023-06-26 18:07:09 +08:00
hiyouga 1175948029 fix generation in seq2seq.py 2023-06-26 18:07:06 +08:00
hoshi-hiyouga 95b057f5af
Merge pull request #84 from wu-yy/patch-1
Update requirements.txt
2023-06-26 15:39:08 +08:00
蓝鲸123 e2a16d549e
Update requirements.txt
LlamaTokenizer requires the SentencePiece library but it was not found in your environment.
2023-06-26 15:36:19 +08:00
hiyouga cec9760eb8 support prefixes, loading multiple local files 2023-06-26 15:32:40 +08:00
hiyouga f030b09924 update api 2023-06-26 13:39:57 +08:00
hiyouga d21cc71750 Update wechat.jpg 2023-06-25 23:41:11 +08:00
hiyouga 0697643358 update readme 2023-06-23 00:17:05 +08:00
hiyouga 614d3a996c update API 2023-06-22 20:46:24 +08:00
hiyouga 76ecb8c222 match api with OpenAI format 2023-06-22 20:27:00 +08:00
hoshi-hiyouga 9324940b76
Merge pull request #68 from mMrBun/main
Compatible with OpenAI API.
2023-06-22 15:52:34 +08:00
Bun 6e4db0903f Compatible with OpenAI API. 2023-06-21 14:45:04 +08:00
hiyouga ded5aa3c3d Update wechat.jpg 2023-06-19 19:46:04 +08:00
hiyouga f621f7631a add default template 2023-06-16 21:12:17 +08:00
hiyouga 334d1a6d26 add belle multiturn dataset 2023-06-16 20:01:16 +08:00
hiyouga a6c4b141cd fix freeze layers 2023-06-16 17:38:21 +08:00
hiyouga fc4d8155b3 add source prefix 2023-06-16 16:32:17 +08:00
hiyouga 0574b590ef support loading lora from hub 2023-06-16 00:02:17 +08:00
hiyouga 0cee6ad67f support baichuan model 2023-06-15 16:02:01 +08:00
hiyouga c527399424 fix bug in template vanilla 2023-06-15 14:36:55 +08:00
hiyouga 0a36658bb6 Update wechat.jpg 2023-06-15 13:48:53 +08:00
hiyouga d668f8b501 add BOS token in pre-training 2023-06-15 01:46:17 +08:00
hiyouga b6faf0207d support multiturn training like FastChat 2023-06-14 22:27:39 +08:00
hiyouga 875e8e2349 fix loading valuehead 2023-06-13 11:13:06 +08:00
hiyouga 531a3764d9 fix generating args 2023-06-13 01:33:56 +08:00
hiyouga cec6524d6b support RM metrics, add generating Args 2023-06-12 15:48:48 +08:00
hoshi-hiyouga e3f380c1be
Merge pull request #26 from BUAADreamer/main
add code for reading from multi files in one directory
2023-06-11 19:06:29 +08:00
BUAADreamer e3b53a67c7 update json line file to .jsonl 2023-06-11 18:59:19 +08:00
BUAADreamer 676d910260 add some 2023-06-11 18:55:53 +08:00
BUAADreamer a2af9df5a9 add code for reading from multi files in one directory 2023-06-10 16:27:30 +08:00
BUAADreamer 3dd5f9a874 add code for reading from multi files in one directory 2023-06-10 15:53:47 +08:00
hiyouga 2ba5d69c7f tiny fix 2023-06-07 16:42:31 +08:00