hiyouga
|
d8aa1404be
|
support FlashAttention2
|
2023-09-10 20:43:56 +08:00 |
hiyouga
|
bca1a247bc
|
support lora target auto find
|
2023-09-09 15:38:37 +08:00 |
hiyouga
|
d8d82ca281
|
fix chatglm2 tokenizer
|
2023-09-09 13:50:29 +08:00 |
hiyouga
|
85b1f6632a
|
fix baichuan templates
|
2023-09-07 18:54:14 +08:00 |
hiyouga
|
0531886e1f
|
update baichuan2 template
|
2023-09-06 21:43:06 +08:00 |
hiyouga
|
60603a94c6
|
add Baichuan2 models
|
2023-09-06 18:40:11 +08:00 |
hiyouga
|
a9d1fb72f7
|
refactor dataset_attr, add eos in pt, fix #757
|
2023-09-01 19:00:45 +08:00 |
codemayq
|
604f85487b
|
add ad gen dataset
|
2023-08-27 20:35:32 +08:00 |
hiyouga
|
4318347d3f
|
update template
|
2023-08-22 19:46:09 +08:00 |
hiyouga
|
9020524418
|
fix PPO trainer #551 , update readme
|
2023-08-18 11:43:10 +08:00 |
hiyouga
|
e4eec9ddfd
|
update readme
|
2023-08-18 01:51:55 +08:00 |
hiyouga
|
58f13e22da
|
update training resuming
|
2023-08-18 01:41:17 +08:00 |
hiyouga
|
ff0aa793b6
|
update readme
|
2023-08-17 11:00:22 +08:00 |
hiyouga
|
ec94274ca1
|
web UI integrating RLHF
|
2023-08-14 10:48:47 +08:00 |
hiyouga
|
8a79ded55d
|
update readme
|
2023-08-12 21:29:06 +08:00 |
hiyouga
|
2618e0b5a7
|
update readme
|
2023-08-12 21:23:05 +08:00 |
hiyouga
|
1836c020c5
|
update readme
|
2023-08-12 21:00:11 +08:00 |
hiyouga
|
a48cb0d474
|
Release v0.1.6
|
2023-08-11 23:25:57 +08:00 |
hiyouga
|
3ec4351cfd
|
support DPO training (2305.18290)
|
2023-08-11 03:02:53 +08:00 |
hiyouga
|
20cf27976f
|
update readme
|
2023-08-07 15:02:02 +08:00 |
codemayq
|
293bd95712
|
add detailed model configs
|
2023-08-07 09:30:23 +08:00 |
hiyouga
|
87f8f830e2
|
support Qwen-7B, fix InternLM-7B inference
|
2023-08-03 15:53:32 +08:00 |
hiyouga
|
c689857bbb
|
release v0.1.5
|
2023-08-02 16:10:31 +08:00 |
hiyouga
|
ccde51c5ea
|
update readme
|
2023-08-01 18:48:27 +08:00 |
hiyouga
|
ac88ce5233
|
fix RM save model
|
2023-08-01 11:56:17 +08:00 |
hiyouga
|
973a638665
|
release v0.1.4
|
2023-08-01 10:08:47 +08:00 |
hiyouga
|
62dca5bb82
|
update readme
|
2023-07-31 23:42:32 +08:00 |
hiyouga
|
0411a4b3e1
|
support streaming data, fix #284 #274 #268
|
2023-07-31 23:33:00 +08:00 |
hiyouga
|
5ee87138e4
|
update readme
|
2023-07-28 17:36:00 +08:00 |
hiyouga
|
f5c2ccdde4
|
update dataset
|
2023-07-26 17:05:12 +08:00 |
hiyouga
|
00efa8a07f
|
fix #242
|
2023-07-25 17:04:02 +08:00 |
hiyouga
|
182b425043
|
update dataset
|
2023-07-23 20:01:43 +08:00 |
hiyouga
|
035c966d5c
|
update readme, fix web ui postprocess
|
2023-07-22 14:29:22 +08:00 |
mrhan1993
|
9f0b57b370
|
根据GLM Efficient Tuning添加中文README,web添加了server_port
|
2023-07-21 16:57:58 +08:00 |
hiyouga
|
c3fcb67486
|
Update README.md
|
2023-07-20 17:23:16 +08:00 |
hiyouga
|
7159bc54ed
|
add datasets
|
2023-07-19 20:59:15 +08:00 |
hiyouga
|
7a3ade8c69
|
support LLaMA-2
|
2023-07-19 16:42:14 +08:00 |
hiyouga
|
b447fa85aa
|
add web demo
|
2023-07-18 17:21:16 +08:00 |
hiyouga
|
f8193e8009
|
release v0.1.0
|
2023-07-18 00:18:25 +08:00 |
hiyouga
|
1e2b7e0c4b
|
Update README.md
|
2023-07-15 17:20:39 +08:00 |
hiyouga
|
f751376613
|
modity code structure
|
2023-07-15 16:54:28 +08:00 |
hiyouga
|
08439d29b2
|
fix Baichuan-13B
|
2023-07-13 23:08:45 +08:00 |
zxbsmk
|
4955dc9eed
|
Support for WebNovel dataset
|
2023-07-12 17:29:47 +08:00 |
hiyouga
|
1af031c02b
|
add baichuan template
|
2023-07-11 18:57:50 +08:00 |
hiyouga
|
f936a7af0b
|
support Baichuan-13B
|
2023-07-11 16:16:14 +08:00 |
hiyouga
|
8447206bbc
|
Update README.md
|
2023-07-10 23:09:11 +08:00 |
hiyouga
|
4182c7aa8b
|
Update README.md
|
2023-07-09 14:57:13 +08:00 |
hiyouga
|
233f20864b
|
Update README.md
|
2023-07-07 12:06:28 +08:00 |
hiyouga
|
a2f507c562
|
support InternLM
|
2023-07-07 11:02:28 +08:00 |
hiyouga
|
89c623e4bf
|
update readme
|
2023-07-05 23:03:58 +08:00 |