Commit Graph

562 Commits

Author SHA1 Message Date
Peter Pan b0ca8fe634 add rm dataset explanation
Signed-off-by: Peter Pan <Peter.Pan@daocloud.io>
2023-08-22 01:33:59 -04:00
hoshi-hiyouga bc7795655f
Merge pull request #619 from hiyouga/feature-templateTest
add template encode test
2023-08-21 20:56:34 +08:00
codemayq cbbee7933e add template encode test 2023-08-21 20:51:24 +08:00
hiyouga 5235b15c91 fix #617 2023-08-21 18:16:11 +08:00
hiyouga 02d69b6fde fix #608 2023-08-21 17:49:36 +08:00
hiyouga 0a3f698425 fix baichuan template for training #597 #616 2023-08-21 17:41:51 +08:00
hiyouga 5c052836a0 fix #595 2023-08-20 16:40:00 +08:00
hoshi-hiyouga 1968d9d1d0
Merge pull request #596 from beat4ocean/beat
fix KeyError: 'lang' bug
2023-08-20 16:37:40 +08:00
beat4ocean 7b45de6b9f fix KeyError: 'lang' bug 2023-08-20 15:32:36 +08:00
hiyouga 0676497104 fix ppo trainer #551 2023-08-20 14:07:11 +08:00
hiyouga 290be836b7 Update wechat.jpg 2023-08-19 18:03:36 +08:00
hiyouga 9c9009f49f Release v0.1.7 2023-08-18 17:21:27 +08:00
hiyouga d75e377b0f tiny fix 2023-08-18 13:07:35 +08:00
hiyouga 53e33418d0 support ppo score norm (trl 0.5.1.dev required) 2023-08-18 12:02:42 +08:00
hiyouga 9020524418 fix PPO trainer #551 , update readme 2023-08-18 11:43:10 +08:00
hiyouga e4eec9ddfd update readme 2023-08-18 01:51:55 +08:00
hiyouga 10cd6c9171 Update .gitignore 2023-08-18 01:43:42 +08:00
hiyouga 58f13e22da update training resuming 2023-08-18 01:41:17 +08:00
hoshi-hiyouga 7926432d27
Merge pull request #434 from niuba/main
add last_checkpoint support
2023-08-18 01:38:31 +08:00
hoshi-hiyouga 7252903245
Merge branch 'main' into main 2023-08-18 01:37:23 +08:00
hiyouga d125218cde support bf16 ppo #551 2023-08-18 00:40:32 +08:00
hiyouga 9f4c2adc9a fix ChatGLM2 ppo #527 #528 2023-08-18 00:34:59 +08:00
hiyouga be21fc83f9 fix generation bug #532 2023-08-17 22:21:34 +08:00
hiyouga b0ed0dec5e fix streaming in pt stage #548 #549 2023-08-17 17:59:26 +08:00
hiyouga ff0aa793b6 update readme 2023-08-17 11:00:22 +08:00
hiyouga 892fd39373 fix baichuan and intern template 2023-08-17 01:27:20 +08:00
hiyouga d9e62711a3 fix generation 2023-08-16 22:39:54 +08:00
hiyouga 7407d9daa1 fix system prompt 2023-08-16 01:35:52 +08:00
hiyouga 273135f595 fix baichuan template #481 2023-08-15 11:38:21 +08:00
hoshi-hiyouga 7f35487c4a
Merge pull request #516 from liuyanyi/add_gitignore
[Enhance] Add .gitignore file
2023-08-15 11:25:40 +08:00
hiyouga af6c011fcb fix ChatGLM RLHF 2023-08-15 11:19:20 +08:00
hiyouga a7dd9611db Update wechat.jpg 2023-08-15 11:13:46 +08:00
Yanyi Liu 448478f938
Add .gitignore 2023-08-15 11:13:45 +08:00
hiyouga 80b4053602 alert pad_token source 2023-08-15 00:07:56 +08:00
hiyouga 9d0f6214b6 update webui 2023-08-14 22:45:26 +08:00
hoshi-hiyouga adb0f186e9
Merge pull request #511 from hiyouga/feature-autoTemplate
add template match and stage in webui
2023-08-14 22:44:04 +08:00
codemayq 0bf892ff1a auto match template when change model_name 2023-08-14 20:56:05 +08:00
codemayq 79c68e5527 add template match and stage in webui 2023-08-14 20:42:59 +08:00
hiyouga d019956808 fix ChatGLM lm_head #494 2023-08-14 14:14:48 +08:00
hiyouga 20a29297b1 fix bug in webui 2023-08-14 11:38:42 +08:00
hiyouga ca08e5efd3 fix webui cache 2023-08-14 11:37:01 +08:00
hiyouga 2391a84e26 update readme_zh 2023-08-14 11:13:25 +08:00
hiyouga ec94274ca1 web UI integrating RLHF 2023-08-14 10:48:47 +08:00
hiyouga 2f2fd55d81 fix #480 2023-08-14 00:23:56 +08:00
hiyouga d69b1388e6 fix webui 2023-08-12 23:52:07 +08:00
hiyouga 9dc6a296e3 tiny fix 2023-08-12 22:02:43 +08:00
hiyouga 8545c11c45 fix rope scaling 2023-08-12 22:00:01 +08:00
hiyouga 8a79ded55d update readme 2023-08-12 21:29:06 +08:00
hiyouga 3ea1fa35d1 update readme 2023-08-12 21:25:19 +08:00
hiyouga 2618e0b5a7 update readme 2023-08-12 21:23:05 +08:00