hiyouga
|
cb42676694
|
update readme
|
2023-10-13 13:53:43 +08:00 |
hiyouga
|
c4102f306a
|
update discord link
|
2023-10-12 21:44:28 +08:00 |
hiyouga
|
197c754d73
|
rename repository
|
2023-10-12 21:42:29 +08:00 |
hiyouga
|
8e2ed6b8ce
|
update readme
|
2023-10-09 20:02:50 +08:00 |
hiyouga
|
d11a545463
|
fix #1068 #1074
|
2023-09-28 14:39:16 +08:00 |
hiyouga
|
4eae061464
|
update readme
|
2023-09-27 21:57:47 +08:00 |
hiyouga
|
90375f600d
|
support LongLoRA
|
2023-09-27 21:55:50 +08:00 |
hiyouga
|
4dd9b4d982
|
add CMMLU, update eval script
|
2023-09-23 21:10:17 +08:00 |
hiyouga
|
badd2735b5
|
move file
|
2023-09-23 11:52:12 +08:00 |
hiyouga
|
465ee8119a
|
add MMLU and C-Eval script
|
2023-09-23 00:34:17 +08:00 |
hiyouga
|
5cc7a44784
|
fix #1000
|
2023-09-22 15:00:48 +08:00 |
hiyouga
|
044d4425b4
|
update readme
|
2023-09-22 14:34:13 +08:00 |
hiyouga
|
ace3f85a72
|
tiny fix
|
2023-09-21 15:25:29 +08:00 |
hiyouga
|
acda45e463
|
update readme
|
2023-09-16 17:33:01 +08:00 |
hiyouga
|
026af87e7f
|
add MathInstruct dataset
|
2023-09-13 22:30:14 +08:00 |
hiyouga
|
d4be857e23
|
fix #762 #814
|
2023-09-12 16:10:10 +08:00 |
hiyouga
|
ccb3553576
|
Release v0.1.8
|
2023-09-11 17:31:34 +08:00 |
hiyouga
|
baac22f4f4
|
truncate readme
|
2023-09-10 21:04:20 +08:00 |
hiyouga
|
63611de7ae
|
update readme
|
2023-09-10 21:01:20 +08:00 |
hiyouga
|
34005252df
|
update readme
|
2023-09-10 20:52:21 +08:00 |
hiyouga
|
d8aa1404be
|
support FlashAttention2
|
2023-09-10 20:43:56 +08:00 |
hiyouga
|
bca1a247bc
|
support lora target auto find
|
2023-09-09 15:38:37 +08:00 |
hiyouga
|
d8d82ca281
|
fix chatglm2 tokenizer
|
2023-09-09 13:50:29 +08:00 |
hiyouga
|
85b1f6632a
|
fix baichuan templates
|
2023-09-07 18:54:14 +08:00 |
hiyouga
|
0531886e1f
|
update baichuan2 template
|
2023-09-06 21:43:06 +08:00 |
hiyouga
|
60603a94c6
|
add Baichuan2 models
|
2023-09-06 18:40:11 +08:00 |
hiyouga
|
a9d1fb72f7
|
refactor dataset_attr, add eos in pt, fix #757
|
2023-09-01 19:00:45 +08:00 |
codemayq
|
604f85487b
|
add ad gen dataset
|
2023-08-27 20:35:32 +08:00 |
hiyouga
|
4318347d3f
|
update template
|
2023-08-22 19:46:09 +08:00 |
hiyouga
|
9020524418
|
fix PPO trainer #551 , update readme
|
2023-08-18 11:43:10 +08:00 |
hiyouga
|
e4eec9ddfd
|
update readme
|
2023-08-18 01:51:55 +08:00 |
hiyouga
|
58f13e22da
|
update training resuming
|
2023-08-18 01:41:17 +08:00 |
hiyouga
|
ff0aa793b6
|
update readme
|
2023-08-17 11:00:22 +08:00 |
hiyouga
|
2391a84e26
|
update readme_zh
|
2023-08-14 11:13:25 +08:00 |
hiyouga
|
8a79ded55d
|
update readme
|
2023-08-12 21:29:06 +08:00 |
hiyouga
|
3ea1fa35d1
|
update readme
|
2023-08-12 21:25:19 +08:00 |
hiyouga
|
2618e0b5a7
|
update readme
|
2023-08-12 21:23:05 +08:00 |
hiyouga
|
1836c020c5
|
update readme
|
2023-08-12 21:00:11 +08:00 |
hiyouga
|
156710a995
|
Update README_zh.md
|
2023-08-11 14:06:02 +08:00 |
hiyouga
|
3ec4351cfd
|
support DPO training (2305.18290)
|
2023-08-11 03:02:53 +08:00 |
hiyouga
|
20cf27976f
|
update readme
|
2023-08-07 15:02:02 +08:00 |
codemayq
|
293bd95712
|
add detailed model configs
|
2023-08-07 09:30:23 +08:00 |
hiyouga
|
87f8f830e2
|
support Qwen-7B, fix InternLM-7B inference
|
2023-08-03 15:53:32 +08:00 |
hiyouga
|
c689857bbb
|
release v0.1.5
|
2023-08-02 16:10:31 +08:00 |
hiyouga
|
ccde51c5ea
|
update readme
|
2023-08-01 18:48:27 +08:00 |
hiyouga
|
ac88ce5233
|
fix RM save model
|
2023-08-01 11:56:17 +08:00 |
hiyouga
|
973a638665
|
release v0.1.4
|
2023-08-01 10:08:47 +08:00 |
hiyouga
|
62dca5bb82
|
update readme
|
2023-07-31 23:42:32 +08:00 |
hiyouga
|
0411a4b3e1
|
support streaming data, fix #284 #274 #268
|
2023-07-31 23:33:00 +08:00 |
hiyouga
|
5ee87138e4
|
update readme
|
2023-07-28 17:36:00 +08:00 |