Peter Pan
|
b0ca8fe634
|
add rm dataset explanation
Signed-off-by: Peter Pan <Peter.Pan@daocloud.io>
|
2023-08-22 01:33:59 -04:00 |
hoshi-hiyouga
|
bc7795655f
|
Merge pull request #619 from hiyouga/feature-templateTest
add template encode test
|
2023-08-21 20:56:34 +08:00 |
codemayq
|
cbbee7933e
|
add template encode test
|
2023-08-21 20:51:24 +08:00 |
hiyouga
|
5235b15c91
|
fix #617
|
2023-08-21 18:16:11 +08:00 |
hiyouga
|
02d69b6fde
|
fix #608
|
2023-08-21 17:49:36 +08:00 |
hiyouga
|
0a3f698425
|
fix baichuan template for training #597 #616
|
2023-08-21 17:41:51 +08:00 |
hiyouga
|
5c052836a0
|
fix #595
|
2023-08-20 16:40:00 +08:00 |
hoshi-hiyouga
|
1968d9d1d0
|
Merge pull request #596 from beat4ocean/beat
fix KeyError: 'lang' bug
|
2023-08-20 16:37:40 +08:00 |
beat4ocean
|
7b45de6b9f
|
fix KeyError: 'lang' bug
|
2023-08-20 15:32:36 +08:00 |
hiyouga
|
0676497104
|
fix ppo trainer #551
|
2023-08-20 14:07:11 +08:00 |
hiyouga
|
290be836b7
|
Update wechat.jpg
|
2023-08-19 18:03:36 +08:00 |
hiyouga
|
9c9009f49f
|
Release v0.1.7
|
2023-08-18 17:21:27 +08:00 |
hiyouga
|
d75e377b0f
|
tiny fix
|
2023-08-18 13:07:35 +08:00 |
hiyouga
|
53e33418d0
|
support ppo score norm (trl 0.5.1.dev required)
|
2023-08-18 12:02:42 +08:00 |
hiyouga
|
9020524418
|
fix PPO trainer #551 , update readme
|
2023-08-18 11:43:10 +08:00 |
hiyouga
|
e4eec9ddfd
|
update readme
|
2023-08-18 01:51:55 +08:00 |
hiyouga
|
10cd6c9171
|
Update .gitignore
|
2023-08-18 01:43:42 +08:00 |
hiyouga
|
58f13e22da
|
update training resuming
|
2023-08-18 01:41:17 +08:00 |
hoshi-hiyouga
|
7926432d27
|
Merge pull request #434 from niuba/main
add last_checkpoint support
|
2023-08-18 01:38:31 +08:00 |
hoshi-hiyouga
|
7252903245
|
Merge branch 'main' into main
|
2023-08-18 01:37:23 +08:00 |
hiyouga
|
d125218cde
|
support bf16 ppo #551
|
2023-08-18 00:40:32 +08:00 |
hiyouga
|
9f4c2adc9a
|
fix ChatGLM2 ppo #527 #528
|
2023-08-18 00:34:59 +08:00 |
hiyouga
|
be21fc83f9
|
fix generation bug #532
|
2023-08-17 22:21:34 +08:00 |
hiyouga
|
b0ed0dec5e
|
fix streaming in pt stage #548 #549
|
2023-08-17 17:59:26 +08:00 |
hiyouga
|
ff0aa793b6
|
update readme
|
2023-08-17 11:00:22 +08:00 |
hiyouga
|
892fd39373
|
fix baichuan and intern template
|
2023-08-17 01:27:20 +08:00 |
hiyouga
|
d9e62711a3
|
fix generation
|
2023-08-16 22:39:54 +08:00 |
hiyouga
|
7407d9daa1
|
fix system prompt
|
2023-08-16 01:35:52 +08:00 |
hiyouga
|
273135f595
|
fix baichuan template #481
|
2023-08-15 11:38:21 +08:00 |
hoshi-hiyouga
|
7f35487c4a
|
Merge pull request #516 from liuyanyi/add_gitignore
[Enhance] Add .gitignore file
|
2023-08-15 11:25:40 +08:00 |
hiyouga
|
af6c011fcb
|
fix ChatGLM RLHF
|
2023-08-15 11:19:20 +08:00 |
hiyouga
|
a7dd9611db
|
Update wechat.jpg
|
2023-08-15 11:13:46 +08:00 |
Yanyi Liu
|
448478f938
|
Add .gitignore
|
2023-08-15 11:13:45 +08:00 |
hiyouga
|
80b4053602
|
alert pad_token source
|
2023-08-15 00:07:56 +08:00 |
hiyouga
|
9d0f6214b6
|
update webui
|
2023-08-14 22:45:26 +08:00 |
hoshi-hiyouga
|
adb0f186e9
|
Merge pull request #511 from hiyouga/feature-autoTemplate
add template match and stage in webui
|
2023-08-14 22:44:04 +08:00 |
codemayq
|
0bf892ff1a
|
auto match template when change model_name
|
2023-08-14 20:56:05 +08:00 |
codemayq
|
79c68e5527
|
add template match and stage in webui
|
2023-08-14 20:42:59 +08:00 |
hiyouga
|
d019956808
|
fix ChatGLM lm_head #494
|
2023-08-14 14:14:48 +08:00 |
hiyouga
|
20a29297b1
|
fix bug in webui
|
2023-08-14 11:38:42 +08:00 |
hiyouga
|
ca08e5efd3
|
fix webui cache
|
2023-08-14 11:37:01 +08:00 |
hiyouga
|
2391a84e26
|
update readme_zh
|
2023-08-14 11:13:25 +08:00 |
hiyouga
|
ec94274ca1
|
web UI integrating RLHF
|
2023-08-14 10:48:47 +08:00 |
hiyouga
|
2f2fd55d81
|
fix #480
|
2023-08-14 00:23:56 +08:00 |
hiyouga
|
d69b1388e6
|
fix webui
|
2023-08-12 23:52:07 +08:00 |
hiyouga
|
9dc6a296e3
|
tiny fix
|
2023-08-12 22:02:43 +08:00 |
hiyouga
|
8545c11c45
|
fix rope scaling
|
2023-08-12 22:00:01 +08:00 |
hiyouga
|
8a79ded55d
|
update readme
|
2023-08-12 21:29:06 +08:00 |
hiyouga
|
3ea1fa35d1
|
update readme
|
2023-08-12 21:25:19 +08:00 |
hiyouga
|
2618e0b5a7
|
update readme
|
2023-08-12 21:23:05 +08:00 |