hiyouga
|
0a690ada6f
|
update examples
|
2024-08-09 20:13:46 +08:00 |
hiyouga
|
e2a28f51c6
|
add adam_mini to readme
|
2024-08-09 20:02:03 +08:00 |
hiyouga
|
86f7099fa3
|
update scripts
|
2024-08-09 19:16:23 +08:00 |
hiyouga
|
1bbd49faae
|
fix #4944
|
2024-07-24 16:42:51 +08:00 |
hiyouga
|
c9bb0757ec
|
update pissa example
|
2024-07-06 15:47:32 +08:00 |
hiyouga
|
2f78b5d62a
|
update examples
|
2024-06-28 01:17:07 +08:00 |
hiyouga
|
095fab58d3
|
tiny fix about badam
|
2024-06-25 01:54:53 +08:00 |
Jonery
|
97c5235160
|
add example
|
2024-06-18 13:50:26 +08:00 |
hiyouga
|
2bf2863a58
|
tiny fix
|
2024-06-17 17:47:25 +08:00 |
hiyouga
|
8c1046d78a
|
support pissa
|
2024-06-16 01:08:12 +08:00 |
hiyouga
|
b6e008c152
|
update examples
|
2024-06-13 03:15:06 +08:00 |
hiyouga
|
cae4737907
|
lora modules: all by default
|
2024-06-06 03:53:28 +08:00 |
hiyouga
|
dc4a00dd63
|
update train hparams
|
2024-06-06 01:49:20 +08:00 |
hiyouga
|
5a13b3baa6
|
tiny fix
|
2024-06-04 00:31:10 +08:00 |
hiyouga
|
eed33862bc
|
fix #4005 #4013
|
2024-06-03 19:12:29 +08:00 |
hiyouga
|
c450ee87a3
|
improve KTO impl., replace datasets
|
2024-05-18 03:44:56 +08:00 |
hiyouga
|
e5bba7cf1b
|
update badam example #3764
|
2024-05-17 02:21:10 +08:00 |
hiyouga
|
ddec9e1b84
|
update examples
|
2024-05-17 01:02:00 +08:00 |
hiyouga
|
2a67ab3925
|
fix #3694
|
2024-05-16 00:35:28 +08:00 |
hiyouga
|
dae83f4199
|
update examples
|
2024-05-13 20:39:36 +08:00 |
hiyouga
|
f02f87c6fb
|
update example docs
|
2024-05-06 22:51:02 +08:00 |
hiyouga
|
34d33e2257
|
update docs
|
2024-05-06 21:47:00 +08:00 |
Oscar
|
eeb415f6fa
|
Fix badam example outdated argument
|
2024-05-05 23:35:19 +08:00 |
hiyouga
|
245fe47ece
|
update webui and add CLIs
|
2024-05-03 02:58:23 +08:00 |
hiyouga
|
a1f1fac33b
|
update readme and examples
|
2024-04-22 00:37:32 +08:00 |
hiyouga
|
ddbd29d777
|
remove extras
|
2024-04-22 00:35:41 +08:00 |
hiyouga
|
5c62881c5a
|
fix bug in galore optimizer
|
2024-04-21 18:53:22 +08:00 |
hiyouga
|
f58425ab45
|
fix mod stuff
|
2024-04-21 18:11:10 +08:00 |
Marco
|
620add7b9f
|
Added Mixture of Depths
|
2024-04-18 20:31:24 +02:00 |
hoshi-hiyouga
|
57dcd91e17
|
Update sft.sh
|
2024-04-16 17:25:40 +08:00 |
Jonery
|
7ecb61822b
|
resolve gradient checkpointing issue.
|
2024-04-16 12:05:27 +08:00 |
Jonery
|
06c8908d3f
|
Feature BAdam
|
2024-04-15 23:15:27 +08:00 |
hiyouga
|
cce52351b5
|
update examples
|
2024-04-15 22:14:34 +08:00 |
hiyouga
|
f22eaeb5bc
|
update examples
|
2024-04-02 20:51:21 +08:00 |
hiyouga
|
31ffbde24d
|
update examples
|
2024-04-02 20:41:49 +08:00 |
hiyouga
|
11a6c1bad6
|
update readme
|
2024-04-02 20:37:37 +08:00 |
hiyouga
|
92dab8a90b
|
simplify readme
|
2024-04-02 20:07:43 +08:00 |
hiyouga
|
8c77b10912
|
update trainers
|
2024-03-28 18:16:27 +08:00 |
hiyouga
|
72367307df
|
improve lora+ impl.
|
2024-03-13 23:32:51 +08:00 |
齐保元
|
a0965cd62c
|
[FEATURE]: ADD LORA+ ALGORITHM
|
2024-03-13 19:43:27 +08:00 |
hiyouga
|
8664262cde
|
support layerwise galore
|
2024-03-10 00:24:11 +08:00 |
hiyouga
|
4c00bcdcae
|
update examples
|
2024-03-09 02:30:37 +08:00 |
hiyouga
|
10be2f0ecc
|
fix aqlm version
|
2024-03-09 00:09:09 +08:00 |
hiyouga
|
8a45213440
|
fix example params
|
2024-03-08 20:41:43 +08:00 |
hiyouga
|
33a4c24a8a
|
fix galore
|
2024-03-08 00:44:51 +08:00 |
hiyouga
|
7230e1177d
|
add galore examples
|
2024-03-07 22:53:45 +08:00 |