hiyouga
|
771f454ff1
|
use low_cpu_mem_usage to speed up loading
|
2023-06-03 18:19:01 +08:00 |
hiyouga
|
dca27b4412
|
add logits processor
|
2023-06-03 16:34:54 +08:00 |
hiyouga
|
ed6161fa6a
|
remove unused code
|
2023-06-03 00:10:54 +08:00 |
hiyouga
|
72a85ccc39
|
add wechat
|
2023-06-02 21:47:10 +08:00 |
hiyouga
|
b8a034807e
|
tiny fix
|
2023-06-02 19:02:25 +08:00 |
hiyouga
|
e3aaef7d4a
|
fix layer norm name in PPO
|
2023-06-02 17:30:01 +08:00 |
hiyouga
|
bd565af370
|
fix #1
|
2023-06-02 14:25:00 +08:00 |
hiyouga
|
50d9a20f81
|
alter rewards data type
|
2023-06-02 14:19:51 +08:00 |
hiyouga
|
e6126244c1
|
fix possibly OOM error
|
2023-06-01 23:54:44 +08:00 |
hiyouga
|
fd709eacff
|
fix bug at inference
|
2023-05-31 18:11:53 +08:00 |
hiyouga
|
38ca429228
|
update readme
|
2023-05-31 16:57:43 +08:00 |
hiyouga
|
740a5daf56
|
support BLOOM models
|
2023-05-31 16:54:06 +08:00 |
hoshi-hiyouga
|
c36620ece4
|
Merge pull request #1 from mMrBun/main
Support conversation via API.
|
2023-05-30 16:34:00 +08:00 |
hiyouga
|
a72492e649
|
remove dummy code
|
2023-05-30 16:28:00 +08:00 |
mMrBun
|
748b804bac
|
Support conversation via API.
|
2023-05-30 15:00:28 +08:00 |
mMrBun
|
e821682430
|
Support conversation via API.
|
2023-05-30 14:46:22 +08:00 |
hiyouga
|
6ccdfb4001
|
update readme
|
2023-05-29 21:54:01 +08:00 |
hiyouga
|
7698f9aa9a
|
update readme
|
2023-05-29 21:53:02 +08:00 |
hiyouga
|
8ff96509fa
|
add pre-training script
|
2023-05-29 21:37:22 +08:00 |
hiyouga
|
c0e5df92d6
|
fix checkpoint loading
|
2023-05-29 17:43:16 +08:00 |
hiyouga
|
ce71cc8b6d
|
tiny fix
|
2023-05-29 09:42:29 +08:00 |
hiyouga
|
166c837b95
|
tiny fix
|
2023-05-28 21:48:33 +08:00 |
hiyouga
|
0c9fda01e3
|
use fp16 model, add logcallback
|
2023-05-28 21:30:28 +08:00 |
hiyouga
|
769c6ab56b
|
Initial commit
|
2023-05-28 18:09:04 +08:00 |