hiyouga
|
509abe8864
|
add models
|
2023-11-30 19:16:13 +08:00 |
hiyouga
|
9d38e5687d
|
add gpu requirement #1657
|
2023-11-29 12:05:03 +08:00 |
hiyouga
|
5085b00a1d
|
update readme
|
2023-11-21 13:15:46 +08:00 |
hiyouga
|
9ea9380145
|
support GPTQ tuning #729 #1481 #1545 , fix chatglm template #1453 #1480 #1569
|
2023-11-20 22:52:11 +08:00 |
hiyouga
|
5021062493
|
update ppo trainer
|
2023-11-20 21:39:15 +08:00 |
hoshi-hiyouga
|
48211e3799
|
Merge pull request #1553 from hannlp/hans
Change the default argument settings for PPO training
|
2023-11-20 20:32:55 +08:00 |
hiyouga
|
a2019c8b61
|
update benchmark
|
2023-11-18 11:30:01 +08:00 |
hiyouga
|
90212280d6
|
update readme
|
2023-11-18 11:15:56 +08:00 |
hiyouga
|
329134f58c
|
add benchmark
|
2023-11-18 11:09:52 +08:00 |
Yuchen Han
|
c9b499fa7e
|
Update README.md
|
2023-11-17 00:17:36 -08:00 |
hiyouga
|
72e6699547
|
update readme
|
2023-11-16 15:58:37 +08:00 |
hiyouga
|
ce78303600
|
support full-parameter PPO
|
2023-11-16 02:08:04 +08:00 |
hiyouga
|
8350bcf85d
|
add demo mode for web UI
|
2023-11-15 23:51:26 +08:00 |
hiyouga
|
1e19cf242a
|
update readme and constants
|
2023-11-15 18:04:37 +08:00 |
hiyouga
|
88ab33254e
|
fix dc link
|
2023-11-13 23:22:56 +08:00 |
hiyouga
|
442aefb925
|
refactor evaluation, upgrade trl to 074
|
2023-11-13 22:20:35 +08:00 |
hiyouga
|
3697a3dc9a
|
refactor constants
|
2023-11-10 14:16:10 +08:00 |
hiyouga
|
b3572659f5
|
update readme
|
2023-11-09 16:00:24 +08:00 |
hiyouga
|
e1e04cb1f1
|
update readme (list in alphabetical order)
|
2023-11-06 17:18:12 +08:00 |
hiyouga
|
a7eeb8e17c
|
update templates
|
2023-11-06 12:25:47 +08:00 |
hiyouga
|
cc8ffa10d8
|
update data readme (zh)
|
2023-11-02 23:42:49 +08:00 |
hiyouga
|
a837172413
|
support sharegpt format, add datasets
|
2023-11-02 23:10:04 +08:00 |
hiyouga
|
640a520108
|
update projects
|
2023-10-29 22:53:47 +08:00 |
hiyouga
|
59f342e76f
|
add projects
|
2023-10-29 22:07:13 +08:00 |
hiyouga
|
52fc24d166
|
fix vicuna template
|
2023-10-27 22:15:25 +08:00 |
hiyouga
|
4600c29e93
|
update readme
|
2023-10-27 19:19:03 +08:00 |
hiyouga
|
1c0ab9a908
|
support chatglm3
|
2023-10-27 19:16:28 +08:00 |
hiyouga
|
7b4acf7265
|
reimplement neftune
|
2023-10-22 16:15:08 +08:00 |
anvie
|
57fb40aa04
|
add NEFTune optimization
|
2023-10-21 13:24:10 +07:00 |
hiyouga
|
b665e9e133
|
fix #1232
|
2023-10-20 23:28:52 +08:00 |
hiyouga
|
6496a99b7d
|
fix #1217
|
2023-10-19 15:52:24 +08:00 |
hoshi-hiyouga
|
beacb798ea
|
Update README.md
|
2023-10-16 00:23:37 +08:00 |
hiyouga
|
f5d0da4d2a
|
update readme
|
2023-10-15 20:28:14 +08:00 |
hoshi-hiyouga
|
25d326e135
|
Update README.md
|
2023-10-15 20:23:22 +08:00 |
hiyouga
|
ea82f8a82a
|
refactor export, fix #1190
|
2023-10-15 16:01:48 +08:00 |
hiyouga
|
cb42676694
|
update readme
|
2023-10-13 13:53:43 +08:00 |
hiyouga
|
c4102f306a
|
update discord link
|
2023-10-12 21:44:28 +08:00 |
hiyouga
|
197c754d73
|
rename repository
|
2023-10-12 21:42:29 +08:00 |
hiyouga
|
8e2ed6b8ce
|
update readme
|
2023-10-09 20:02:50 +08:00 |
hiyouga
|
d11a545463
|
fix #1068 #1074
|
2023-09-28 14:39:16 +08:00 |
hiyouga
|
4eae061464
|
update readme
|
2023-09-27 21:57:47 +08:00 |
hiyouga
|
90375f600d
|
support LongLoRA
|
2023-09-27 21:55:50 +08:00 |
hiyouga
|
4dd9b4d982
|
add CMMLU, update eval script
|
2023-09-23 21:10:17 +08:00 |
hiyouga
|
badd2735b5
|
move file
|
2023-09-23 11:52:12 +08:00 |
hiyouga
|
465ee8119a
|
add MMLU and C-Eval script
|
2023-09-23 00:34:17 +08:00 |
hiyouga
|
5cc7a44784
|
fix #1000
|
2023-09-22 15:00:48 +08:00 |
hiyouga
|
044d4425b4
|
update readme
|
2023-09-22 14:34:13 +08:00 |
hiyouga
|
ace3f85a72
|
tiny fix
|
2023-09-21 15:25:29 +08:00 |
hiyouga
|
acda45e463
|
update readme
|
2023-09-16 17:33:01 +08:00 |
hiyouga
|
026af87e7f
|
add MathInstruct dataset
|
2023-09-13 22:30:14 +08:00 |