hoshi-hiyouga
|
7ca32d8e69
|
Merge pull request #1436 from lvzii/main
fix tokenizer config changed after pretrain
|
2023-11-09 14:30:50 +08:00 |
hiyouga
|
3df90b988b
|
support parquet format #1446
|
2023-11-09 14:17:40 +08:00 |
hiyouga
|
33422e1fef
|
fix #1438 #1439
|
2023-11-09 13:45:10 +08:00 |
lvzi
|
043c316ac8
|
fix tokenizer config changed after pretrain
Changing tokenizer's attribute at preprocessing stage will result in saving a wrong tokenizer.
for example, baichuan2
|
2023-11-08 15:50:46 +08:00 |
hiyouga
|
01260d9754
|
fix ppo train and dpo eval
|
2023-11-07 22:48:51 +08:00 |
hiyouga
|
11c1e1e157
|
fix #1422
|
2023-11-07 19:42:01 +08:00 |
hiyouga
|
c52336d144
|
fix reward model loading
|
2023-11-07 17:20:51 +08:00 |
hiyouga
|
d92f112951
|
fix args
|
2023-11-07 16:36:06 +08:00 |
hiyouga
|
17c64a0579
|
update info
|
2023-11-07 16:28:21 +08:00 |
hiyouga
|
479d0af2dc
|
delete file
|
2023-11-07 16:20:12 +08:00 |
hiyouga
|
7ebd63a609
|
fix #1418
|
2023-11-07 16:17:22 +08:00 |
hiyouga
|
b2a60905f3
|
upgrade peft, fix #1088 #1411
|
2023-11-07 16:13:36 +08:00 |
hiyouga
|
66a91e1fe3
|
update requirements
|
2023-11-06 19:01:21 +08:00 |
hiyouga
|
de95b69282
|
use seed in evaluate.py
|
2023-11-06 18:17:51 +08:00 |
hiyouga
|
e1e04cb1f1
|
update readme (list in alphabetical order)
|
2023-11-06 17:18:12 +08:00 |
hiyouga
|
a7eeb8e17c
|
update templates
|
2023-11-06 12:25:47 +08:00 |
hiyouga
|
2e77a5718a
|
fix #1383
|
2023-11-06 11:42:23 +08:00 |
hiyouga
|
d08f5e8a14
|
fix deepseek template
|
2023-11-05 13:08:46 +08:00 |
hiyouga
|
2a8a258195
|
support deepseek coder #1378
|
2023-11-05 12:51:03 +08:00 |
hiyouga
|
63ff909310
|
fix #1365
|
2023-11-05 12:21:07 +08:00 |
hiyouga
|
5227e18c44
|
Update wechat.jpg
|
2023-11-05 10:25:59 +08:00 |
hiyouga
|
05d9fc7eff
|
tiny fix
|
2023-11-03 01:26:06 +08:00 |
hiyouga
|
eb9d9e104a
|
fix #1290
|
2023-11-03 00:44:53 +08:00 |
hiyouga
|
b355f6cac9
|
fix bug in data loader, support dpo eval
|
2023-11-03 00:34:26 +08:00 |
hiyouga
|
2b5e33c338
|
update data readme
|
2023-11-03 00:15:23 +08:00 |
hiyouga
|
cc8ffa10d8
|
update data readme (zh)
|
2023-11-02 23:42:49 +08:00 |
hiyouga
|
a837172413
|
support sharegpt format, add datasets
|
2023-11-02 23:10:04 +08:00 |
hiyouga
|
c1edb0cf1b
|
support pagination in webui preview
|
2023-11-02 21:21:45 +08:00 |
hiyouga
|
34d8b2e56c
|
fix webui
|
2023-11-02 18:03:14 +08:00 |
hiyouga
|
9cde5e8af6
|
support warning in webui
|
2023-11-02 17:57:04 +08:00 |
hiyouga
|
f8703aac08
|
fix #1349
|
2023-11-02 17:02:44 +08:00 |
hiyouga
|
dff128c7e3
|
fix #1356
|
2023-11-02 16:51:52 +08:00 |
hiyouga
|
083787dbfe
|
fix #1325
|
2023-11-01 23:38:49 +08:00 |
hiyouga
|
8b912690e3
|
fix chat
|
2023-11-01 23:07:58 +08:00 |
hiyouga
|
84af10cec9
|
update gradio, support multiple resp in api
|
2023-11-01 23:02:16 +08:00 |
hiyouga
|
d8cf8cfdeb
|
fix SFT trainer
|
2023-10-31 21:52:52 +08:00 |
hiyouga
|
f4e4a04529
|
fix #1316
|
2023-10-31 11:32:08 +08:00 |
hiyouga
|
9093cb1a2e
|
Update wechat.jpg
|
2023-10-30 14:01:08 +08:00 |
hiyouga
|
640a520108
|
update projects
|
2023-10-29 22:53:47 +08:00 |
hiyouga
|
59f342e76f
|
add projects
|
2023-10-29 22:07:13 +08:00 |
hiyouga
|
f28a034a9b
|
update constants
|
2023-10-29 13:30:20 +08:00 |
hiyouga
|
52fc24d166
|
fix vicuna template
|
2023-10-27 22:15:25 +08:00 |
hiyouga
|
4117f38827
|
fix chatglm3 template
|
2023-10-27 21:12:06 +08:00 |
hiyouga
|
4600c29e93
|
update readme
|
2023-10-27 19:19:03 +08:00 |
hiyouga
|
1c0ab9a908
|
support chatglm3
|
2023-10-27 19:16:28 +08:00 |
hiyouga
|
3fe7df628d
|
support dataset cache
|
2023-10-26 21:48:45 +08:00 |
hiyouga
|
838ed9aa87
|
fix #1287
|
2023-10-26 17:49:41 +08:00 |
hiyouga
|
aff9363ce3
|
fix #1285
|
2023-10-26 16:34:52 +08:00 |
hiyouga
|
d357e08b58
|
Update wechat.jpg
|
2023-10-24 16:02:12 +08:00 |
hiyouga
|
2caf91f824
|
remove filter in preprocess
|
2023-10-23 23:46:02 +08:00 |