xingjun.wang
fe4acc66b0
add new datasets
2023-12-12 12:44:15 +08:00
xingjun.wang
0ce18a3782
add open orca
2023-12-12 12:34:04 +08:00
xingjun.wang
cfba1009d0
update
2023-12-12 12:03:23 +08:00
xingjun.wang
5b979147f0
for test
2023-12-12 11:52:59 +08:00
xingjun.wang
8a908a8c64
for test
2023-12-12 11:47:59 +08:00
yuze.zyz
e4cf2a75ca
fix typo
2023-12-08 18:13:26 +08:00
yuze.zyz
9c2247d700
support ms dataset
2023-12-08 18:00:57 +08:00
hoshi-hiyouga
00f5c9ee16
Merge branch 'main' into feat/support_ms
2023-12-01 20:23:46 +08:00
yuze.zyz
5a2392f105
remove useless code
2023-12-01 17:28:23 +08:00
tastelikefeet
d9e52957e2
fix bug
2023-12-01 17:27:00 +08:00
hiyouga
a5a248d569
fix err hint
2023-12-01 17:13:22 +08:00
hiyouga
a51b8ec620
add err hint
2023-12-01 17:04:37 +08:00
hoshi-hiyouga
aec946b119
Merge pull request #1699 from Samge0/patch-1
...
Update .gitignore
2023-12-01 16:52:57 +08:00
SamgeShao
7cabb9903d
Update .gitignore
2023-12-01 16:37:41 +08:00
yuze.zyz
5aa6751e52
add readme
2023-12-01 16:11:30 +08:00
hiyouga
e597d3c084
tiny fix
2023-12-01 15:58:50 +08:00
hoshi-hiyouga
fbc6220692
Merge pull request #1695 from Samge0/dev
...
Improve:"CUDA_VISIBLE_DEVICES" read from the env
2023-12-01 15:56:18 +08:00
hoshi-hiyouga
d043a4e7ba
Merge pull request #1690 from billvsme/main
...
Improve get_current_device
2023-12-01 15:44:35 +08:00
hiyouga
bf6f6aeefe
fix #1696
2023-12-01 15:34:50 +08:00
tastelikefeet
8ce4d11e38
add model
2023-12-01 15:06:17 +08:00
hoshi-hiyouga
a0fde6e421
Merge pull request #1689 from mlinmg/patch-2
...
Update dataset_info.json - Added Nectar
2023-12-01 14:29:36 +08:00
samge
421d4de604
Improve:"CUDA_VISIBLE_DEVICES" read from the env
2023-12-01 11:35:02 +08:00
Marco
9468ee9012
Update dataset_info.json
...
Added the Nectar dataset already preprocessed and divided in sft and rl to which I added a preprompt to each instruction since it has been seen that this increase instruction following
2023-11-30 16:21:34 +01:00
billvsme
40dfcbc3d4
improve get_current_device
2023-11-30 22:40:35 +08:00
hiyouga
327d7f7efe
fix #1597
2023-11-30 21:47:06 +08:00
hiyouga
1585962eb7
fix #1668
2023-11-30 21:02:00 +08:00
hiyouga
a38dbf55e3
fix #1682
2023-11-30 20:03:32 +08:00
hiyouga
509abe8864
add models
2023-11-30 19:16:13 +08:00
yuze.zyz
fb2204c183
fix
2023-11-29 21:43:58 +08:00
yuze.zyz
d38a2e7341
support ms
2023-11-29 20:36:55 +08:00
hiyouga
9d38e5687d
add gpu requirement #1657
2023-11-29 12:05:03 +08:00
hiyouga
77d1b14fc2
fix #1658
2023-11-28 20:57:24 +08:00
hiyouga
475a3fa0f4
fix #1659
2023-11-28 20:52:28 +08:00
hiyouga
c2d4300ac4
Update wechat.jpg
2023-11-28 17:27:23 +08:00
hiyouga
859a6ea942
support export size setting
2023-11-26 18:34:09 +08:00
hiyouga
ff1c289229
support Yi-34B-Chat models
2023-11-23 19:31:49 +08:00
hiyouga
5085b00a1d
update readme
2023-11-21 13:15:46 +08:00
hiyouga
35c2da3eba
set version
2023-11-20 22:57:44 +08:00
hiyouga
9ea9380145
support GPTQ tuning #729 #1481 #1545 , fix chatglm template #1453 #1480 #1569
2023-11-20 22:52:11 +08:00
hiyouga
5021062493
update ppo trainer
2023-11-20 21:39:15 +08:00
hoshi-hiyouga
48211e3799
Merge pull request #1553 from hannlp/hans
...
Change the default argument settings for PPO training
2023-11-20 20:32:55 +08:00
hiyouga
2a36fd5064
fix value head model resuming
2023-11-20 19:01:37 +08:00
hiyouga
99a3f06377
fix #1567
2023-11-20 18:46:36 +08:00
hiyouga
00baaa990e
better data streaming
2023-11-19 23:32:47 +08:00
hiyouga
211b2db5a8
fix model card network issue
2023-11-19 23:03:19 +08:00
hiyouga
bfb9433165
fix Mistral template
...
https://github.com/lm-sys/FastChat/pull/2547
2023-11-19 16:29:30 +08:00
hiyouga
065bfaeed4
fix #1263
2023-11-19 16:05:18 +08:00
hiyouga
1740131d63
fix #1558
2023-11-19 14:15:47 +08:00
hiyouga
ff6056405d
fix evaluator and cached_file in 4.31.0
2023-11-18 19:39:23 +08:00
hiyouga
a2019c8b61
update benchmark
2023-11-18 11:30:01 +08:00