hiyouga
|
2c867b9bb1
|
add custom baichuan-13B code supports left-padding
|
2023-07-15 22:37:17 +08:00 |
hiyouga
|
552d773dad
|
fix callback
|
2023-07-15 22:01:43 +08:00 |
hiyouga
|
8528a84e74
|
update stream_chat
|
2023-07-15 19:51:02 +08:00 |
hiyouga
|
657cf0f55a
|
create chat model
|
2023-07-15 19:26:20 +08:00 |
hiyouga
|
d640c5545f
|
Update callbacks.py
|
2023-07-15 17:39:16 +08:00 |
hiyouga
|
1e2b7e0c4b
|
Update README.md
|
2023-07-15 17:20:39 +08:00 |
hiyouga
|
22d9a9c2af
|
fix callback
|
2023-07-15 17:18:16 +08:00 |
hiyouga
|
f751376613
|
modity code structure
|
2023-07-15 16:54:28 +08:00 |
hiyouga
|
2a0f1f8398
|
Update wechat.jpg
|
2023-07-14 17:29:43 +08:00 |
hiyouga
|
c30db9f1f0
|
fix eval and pred loss
|
2023-07-14 13:11:57 +08:00 |
hiyouga
|
a04115ec27
|
fix pretrain
|
2023-07-13 23:41:54 +08:00 |
hiyouga
|
08439d29b2
|
fix Baichuan-13B
|
2023-07-13 23:08:45 +08:00 |
hoshi-hiyouga
|
8cd76ef3c3
|
Merge pull request #156 from ZhengJun-AI/main
Support for WebNovel dataset
|
2023-07-12 20:11:19 +08:00 |
zxbsmk
|
4955dc9eed
|
Support for WebNovel dataset
|
2023-07-12 17:29:47 +08:00 |
hoshi-hiyouga
|
894f13e41f
|
Merge pull request #145 from elicassion/patch-1
Fix typo in common.py
|
2023-07-12 13:50:39 +08:00 |
Jinghuan Shang
|
dc1e8b7181
|
Fix typo in common.py
lastest -> latest
|
2023-07-11 18:03:53 -04:00 |
hiyouga
|
b2f7cb4465
|
fix sft encode
|
2023-07-11 19:50:33 +08:00 |
hiyouga
|
1af031c02b
|
add baichuan template
|
2023-07-11 18:57:50 +08:00 |
hiyouga
|
f936a7af0b
|
support Baichuan-13B
|
2023-07-11 16:16:14 +08:00 |
hiyouga
|
8447206bbc
|
Update README.md
|
2023-07-10 23:09:11 +08:00 |
hiyouga
|
061c324972
|
Update wechat.jpg
|
2023-07-10 18:41:53 +08:00 |
hiyouga
|
4182c7aa8b
|
Update README.md
|
2023-07-09 14:57:13 +08:00 |
hiyouga
|
84a06318d4
|
update api to match langchain
|
2023-07-07 20:35:39 +08:00 |
hiyouga
|
233f20864b
|
Update README.md
|
2023-07-07 12:06:28 +08:00 |
hiyouga
|
a2f507c562
|
support InternLM
|
2023-07-07 11:02:28 +08:00 |
hiyouga
|
caa00d3ac2
|
fix rouge score
|
2023-07-06 14:28:34 +08:00 |
hiyouga
|
89c623e4bf
|
update readme
|
2023-07-05 23:03:58 +08:00 |
hiyouga
|
4abd2485e1
|
fix streaming response in API
|
2023-07-05 22:42:31 +08:00 |
hiyouga
|
e6603977f6
|
fix freeze tuning
|
2023-07-05 21:18:28 +08:00 |
hiyouga
|
a2ba69183b
|
fix bug in PPO stage
|
2023-07-05 19:14:10 +08:00 |
hiyouga
|
8e3540c62d
|
fix compute dtype
|
2023-07-05 15:13:00 +08:00 |
hiyouga
|
c136f362c1
|
support falcon model #72
|
2023-07-05 15:00:06 +08:00 |
hiyouga
|
966b5c70fc
|
Update wechat.jpg
|
2023-07-05 00:22:22 +08:00 |
hiyouga
|
cac87fd553
|
fix bleu score
|
2023-07-05 00:11:21 +08:00 |
hiyouga
|
395ed1cf1b
|
set use_cache before saving model
|
2023-07-04 23:18:20 +08:00 |
hiyouga
|
65e9ce2cdd
|
fix seq2seq predictions
|
2023-07-04 22:56:51 +08:00 |
hoshi-hiyouga
|
cb26f78923
|
Merge pull request #119 from codemayq/main
add the pre-built version of bitsandbytes library for windows user
|
2023-07-03 19:51:46 +08:00 |
codemayq
|
d3b30ecde3
|
add the pre-built version of bitsandbytes library for windows user
|
2023-07-03 13:58:10 +08:00 |
hiyouga
|
0db9d29111
|
Update auto_gptq.py
|
2023-07-02 20:56:11 +08:00 |
hiyouga
|
cf6d57fd3e
|
add autogptq
|
2023-07-02 20:36:37 +08:00 |
hiyouga
|
b8e1f09a2e
|
Update wechat.jpg
|
2023-06-30 15:45:20 +08:00 |
hiyouga
|
92fa515e97
|
fix typo
|
2023-06-30 10:09:59 +08:00 |
hiyouga
|
021b035c1e
|
Update README.md
|
2023-06-29 19:36:22 +08:00 |
hiyouga
|
f14bd729a8
|
rename evaluate.py
|
2023-06-29 15:40:39 +08:00 |
hiyouga
|
23a7266272
|
Update evaluate.py
|
2023-06-29 15:40:03 +08:00 |
hiyouga
|
70592035b8
|
Update README.md
|
2023-06-29 15:37:19 +08:00 |
hiyouga
|
3154fec979
|
add open assistant dataset
|
2023-06-28 23:09:33 +08:00 |
hiyouga
|
4d0fddba21
|
update loading logic
|
2023-06-28 12:07:16 +08:00 |
hiyouga
|
0a46313cca
|
fix loading best model
|
2023-06-28 01:55:12 +08:00 |
hiyouga
|
7826a8ca77
|
fix RM accuracy
|
2023-06-28 01:40:13 +08:00 |