hiyouga
|
c4facc03af
|
release v0.3.0
|
2023-11-16 16:00:11 +08:00 |
hiyouga
|
72e6699547
|
update readme
|
2023-11-16 15:58:37 +08:00 |
hoshi-hiyouga
|
f04bc2a428
|
Merge #1525 from hiyouga/dev, fix #224 #336 #931 #936 #1011
Refactor llmtuner, support full-parameter RLHF
|
2023-11-16 15:47:13 +08:00 |
hiyouga
|
08f3c11429
|
fix css
|
2023-11-16 15:45:38 +08:00 |
hiyouga
|
6efa38be46
|
fix bug in web ui
|
2023-11-16 15:21:24 +08:00 |
hiyouga
|
7537dd434f
|
update ppo and demo in webui
|
2023-11-16 14:55:26 +08:00 |
hiyouga
|
ff52b1779c
|
fix bug in freeze tuning
|
2023-11-16 14:25:11 +08:00 |
hiyouga
|
83cee2a604
|
tiny fix
|
2023-11-16 03:27:19 +08:00 |
hiyouga
|
1817ffc86f
|
fix rlhf callback
|
2023-11-16 03:26:19 +08:00 |
hiyouga
|
856522a3df
|
fix bug in PPO training
|
2023-11-16 02:32:54 +08:00 |
hiyouga
|
35b91ea34c
|
fix import bug
|
2023-11-16 02:27:03 +08:00 |
hiyouga
|
ce78303600
|
support full-parameter PPO
|
2023-11-16 02:08:04 +08:00 |
hiyouga
|
8350bcf85d
|
add demo mode for web UI
|
2023-11-15 23:51:26 +08:00 |
hoshi-hiyouga
|
01b9f63465
|
Create CODE_OF_CONDUCT.md
|
2023-11-15 20:42:15 +08:00 |
hiyouga
|
1e19cf242a
|
update readme and constants
|
2023-11-15 18:04:37 +08:00 |
hiyouga
|
4907452d95
|
support multiple modules in freeze training #1514
|
2023-11-15 17:08:18 +08:00 |
hiyouga
|
bbbce1f516
|
fix imports
|
2023-11-15 16:47:45 +08:00 |
hiyouga
|
4736344eb1
|
disentangle model from tuner and rename modules
|
2023-11-15 16:29:09 +08:00 |
hiyouga
|
2f02f688e1
|
fix #1507
|
2023-11-15 16:22:32 +08:00 |
hiyouga
|
829e879e04
|
Update cal_lr.py
|
2023-11-14 21:14:42 +08:00 |
hiyouga
|
5619e76dc5
|
Update cal_lr.py
|
2023-11-14 21:13:01 +08:00 |
hiyouga
|
fcb2daf7f3
|
Update cal_lr.py
|
2023-11-14 21:09:30 +08:00 |
hiyouga
|
42c8fc4fb9
|
add cal_lr.py
|
2023-11-14 20:58:37 +08:00 |
hiyouga
|
d125ef5535
|
fix #1494
|
2023-11-14 18:07:20 +08:00 |
hiyouga
|
3743b7420b
|
fix #1489
|
2023-11-14 15:27:05 +08:00 |
hiyouga
|
2d42be32c1
|
support eval remote dataset
|
2023-11-14 02:42:30 +08:00 |
hiyouga
|
88ab33254e
|
fix dc link
|
2023-11-13 23:22:56 +08:00 |
hiyouga
|
35cc1e28f6
|
release v0.2.2, fix #1478 #1466
|
2023-11-13 23:09:05 +08:00 |
hiyouga
|
87390ae3b7
|
fix #424
|
2023-11-13 22:42:23 +08:00 |
hiyouga
|
442aefb925
|
refactor evaluation, upgrade trl to 074
|
2023-11-13 22:20:35 +08:00 |
hiyouga
|
528d91192a
|
Update wechat.jpg
|
2023-11-12 22:34:19 +08:00 |
hiyouga
|
4bd8e3906d
|
fix flashattn warning
|
2023-11-10 18:34:54 +08:00 |
hiyouga
|
a0c31c68c4
|
add todo
|
2023-11-10 14:38:18 +08:00 |
hiyouga
|
3697a3dc9a
|
refactor constants
|
2023-11-10 14:16:10 +08:00 |
hiyouga
|
415bca900e
|
tiny fix
|
2023-11-09 17:20:49 +08:00 |
hoshi-hiyouga
|
462730cbd7
|
Merge pull request #1454 from yyq/main
Update finetuning_args.py
|
2023-11-09 17:12:18 +08:00 |
Yanqing
|
3684dffa14
|
Update finetuning_args.py
更新 chatglm/falcon/bloom 的 lora_target 的名称
|
2023-11-09 17:04:40 +08:00 |
hiyouga
|
0e86527d7f
|
fix #1452
|
2023-11-09 16:41:32 +08:00 |
hiyouga
|
b3572659f5
|
update readme
|
2023-11-09 16:00:24 +08:00 |
hiyouga
|
1db59832fd
|
release v0.2.1
|
2023-11-09 15:54:16 +08:00 |
hiyouga
|
386f590209
|
add template, modify datasets
|
2023-11-09 15:53:23 +08:00 |
hoshi-hiyouga
|
7ca32d8e69
|
Merge pull request #1436 from lvzii/main
fix tokenizer config changed after pretrain
|
2023-11-09 14:30:50 +08:00 |
hiyouga
|
3df90b988b
|
support parquet format #1446
|
2023-11-09 14:17:40 +08:00 |
hiyouga
|
33422e1fef
|
fix #1438 #1439
|
2023-11-09 13:45:10 +08:00 |
lvzi
|
043c316ac8
|
fix tokenizer config changed after pretrain
Changing tokenizer's attribute at preprocessing stage will result in saving a wrong tokenizer.
for example, baichuan2
|
2023-11-08 15:50:46 +08:00 |
hiyouga
|
01260d9754
|
fix ppo train and dpo eval
|
2023-11-07 22:48:51 +08:00 |
hiyouga
|
11c1e1e157
|
fix #1422
|
2023-11-07 19:42:01 +08:00 |
hiyouga
|
c52336d144
|
fix reward model loading
|
2023-11-07 17:20:51 +08:00 |
hiyouga
|
d92f112951
|
fix args
|
2023-11-07 16:36:06 +08:00 |
hiyouga
|
17c64a0579
|
update info
|
2023-11-07 16:28:21 +08:00 |