Commit Graph

830 Commits

Author SHA1 Message Date
hiyouga eac9921e5c tiny fix 2023-06-04 12:55:40 +08:00
hiyouga 3b9eee8cd2 support QLoRA 2023-06-04 00:08:56 +08:00
hiyouga 1bd13d7ca1 fix int8 inference 2023-06-03 23:22:05 +08:00
hiyouga 926291940d reduce repetition penalty 2023-06-03 21:57:39 +08:00
hiyouga 0f69a0c19e fix int8 inference 2023-06-03 21:17:47 +08:00
hiyouga de09ee1315 add ziya prompt template 2023-06-03 19:05:51 +08:00
hiyouga 771f454ff1 use low_cpu_mem_usage to speed up loading 2023-06-03 18:19:01 +08:00
hiyouga dca27b4412 add logits processor 2023-06-03 16:34:54 +08:00
hiyouga ed6161fa6a remove unused code 2023-06-03 00:10:54 +08:00
hiyouga 72a85ccc39 add wechat 2023-06-02 21:47:10 +08:00
hiyouga b8a034807e tiny fix 2023-06-02 19:02:25 +08:00
hiyouga e3aaef7d4a fix layer norm name in PPO 2023-06-02 17:30:01 +08:00
hiyouga bd565af370 fix #1 2023-06-02 14:25:00 +08:00
hiyouga 50d9a20f81 alter rewards data type 2023-06-02 14:19:51 +08:00
hiyouga e6126244c1 fix possibly OOM error 2023-06-01 23:54:44 +08:00
hiyouga fd709eacff fix bug at inference 2023-05-31 18:11:53 +08:00
hiyouga 38ca429228 update readme 2023-05-31 16:57:43 +08:00
hiyouga 740a5daf56 support BLOOM models 2023-05-31 16:54:06 +08:00
hoshi-hiyouga c36620ece4
Merge pull request #1 from mMrBun/main
Support conversation via API.
2023-05-30 16:34:00 +08:00
hiyouga a72492e649 remove dummy code 2023-05-30 16:28:00 +08:00
mMrBun 748b804bac Support conversation via API. 2023-05-30 15:00:28 +08:00
mMrBun e821682430 Support conversation via API. 2023-05-30 14:46:22 +08:00
hiyouga 6ccdfb4001 update readme 2023-05-29 21:54:01 +08:00
hiyouga 7698f9aa9a update readme 2023-05-29 21:53:02 +08:00
hiyouga 8ff96509fa add pre-training script 2023-05-29 21:37:22 +08:00
hiyouga c0e5df92d6 fix checkpoint loading 2023-05-29 17:43:16 +08:00
hiyouga ce71cc8b6d tiny fix 2023-05-29 09:42:29 +08:00
hiyouga 166c837b95 tiny fix 2023-05-28 21:48:33 +08:00
hiyouga 0c9fda01e3 use fp16 model, add logcallback 2023-05-28 21:30:28 +08:00
hiyouga 769c6ab56b Initial commit 2023-05-28 18:09:04 +08:00