hiyouga
|
328ad06bd4
|
set version
|
2023-12-16 20:17:51 +08:00 |
hiyouga
|
a66186b872
|
add noisy mean initialization #1815
|
2023-12-16 19:47:51 +08:00 |
hiyouga
|
b87c74289d
|
support dpo-ftx
|
2023-12-16 19:21:41 +08:00 |
hiyouga
|
71389be37c
|
support autogptq in llama board #246
|
2023-12-16 16:31:30 +08:00 |
hoshi-hiyouga
|
93f64ce9a8
|
Merge pull request #1868 from yhyu13/improve_hfargparser
Improve logging for unknown args
|
2023-12-16 16:06:09 +08:00 |
yhyu13
|
fc70a92cb6
|
Use llmtuner logger
|
2023-12-16 07:15:27 +00:00 |
yhyu13
|
26817143ff
|
Improve logging for unknown args
|
2023-12-16 05:16:29 +00:00 |
hiyouga
|
3551171d49
|
update tips
|
2023-12-15 23:52:50 +08:00 |
hiyouga
|
439a26c276
|
fix #1770
|
2023-12-15 23:50:15 +08:00 |
hiyouga
|
3524aa1e58
|
support quantization in export model
|
2023-12-15 23:44:50 +08:00 |
hiyouga
|
87ef3f47b5
|
update dc link
|
2023-12-15 22:11:31 +08:00 |
hoshi-hiyouga
|
e2bd597b3c
|
Merge pull request #1864 from hiyouga/dev
Refactor hyper-parameters of adapters and model loader
|
2023-12-15 22:06:56 +08:00 |
hiyouga
|
00c77104f8
|
fix bug
|
2023-12-15 21:54:02 +08:00 |
hiyouga
|
9e509b99af
|
fix bug
|
2023-12-15 21:49:26 +08:00 |
hiyouga
|
2740aa9cbb
|
add configurer
|
2023-12-15 21:46:40 +08:00 |
hiyouga
|
0716f5e470
|
refactor adapter hparam
|
2023-12-15 20:53:11 +08:00 |
hiyouga
|
d4c351f1ec
|
add loftq
|
2023-12-14 21:53:56 +08:00 |
hiyouga
|
bfdee1608f
|
fix valuehead model
|
2023-12-14 20:15:20 +08:00 |
hoshi-hiyouga
|
bf2d9c8feb
|
Update wechat.jpg
|
2023-12-13 18:23:18 +08:00 |
hoshi-hiyouga
|
81167cd19d
|
tiny fix
|
2023-12-13 17:32:36 +08:00 |
hoshi-hiyouga
|
9b0630f84f
|
revert peft version
|
2023-12-13 10:49:45 +08:00 |
hoshi-hiyouga
|
573a12c86b
|
update peft version
|
2023-12-13 10:23:51 +08:00 |
hoshi-hiyouga
|
6953096c9d
|
tiny fix
|
2023-12-13 10:21:29 +08:00 |
hoshi-hiyouga
|
1fcd545c3d
|
fix #1819
|
2023-12-13 10:14:01 +08:00 |
hiyouga
|
3a8a50d4d4
|
remove loftq
|
2023-12-13 01:53:46 +08:00 |
hiyouga
|
2c8e88f9c1
|
fix sharegpt loading
|
2023-12-13 00:56:16 +08:00 |
hiyouga
|
3552035d7e
|
add model urls
|
2023-12-13 00:09:17 +08:00 |
hiyouga
|
28cc07868c
|
update readme
|
2023-12-12 23:30:29 +08:00 |
hiyouga
|
6219dfbd93
|
support loftq
|
2023-12-12 22:47:06 +08:00 |
hiyouga
|
ada0e536c9
|
fix #1795
|
2023-12-12 19:58:34 +08:00 |
hiyouga
|
0a9c6e0146
|
support system column #1765
|
2023-12-12 19:45:59 +08:00 |
hiyouga
|
d5b2c57a35
|
fix modelscope data hub
|
2023-12-12 18:33:06 +08:00 |
hoshi-hiyouga
|
382319915c
|
Merge pull request #1802 from tastelikefeet/feat/support_ms
Support ModelScope Datahub
|
2023-12-12 17:58:37 +08:00 |
hoshi-hiyouga
|
6382efec52
|
Merge branch 'main' into feat/support_ms
|
2023-12-12 17:55:32 +08:00 |
hiyouga
|
e6ddebd3ae
|
fix webui
|
2023-12-12 15:27:40 +08:00 |
xingjun.wang
|
e80a989d49
|
modify guanaco
|
2023-12-12 15:00:37 +08:00 |
xingjun.wang
|
73b50a26b9
|
update dataset info
|
2023-12-12 14:53:59 +08:00 |
xingjun.wang
|
adc98c86da
|
add use_streaming
|
2023-12-12 14:23:05 +08:00 |
xingjun.wang
|
1909f0d117
|
fix cache dir
|
2023-12-12 14:21:33 +08:00 |
xingjun.wang
|
168321a4da
|
add print info for test
|
2023-12-12 14:14:40 +08:00 |
xingjun.wang
|
edc82b923a
|
update cache dir
|
2023-12-12 13:08:18 +08:00 |
xingjun.wang
|
09533e95ed
|
update args for MsDataset.load
|
2023-12-12 13:02:54 +08:00 |
xingjun.wang
|
fe4acc66b0
|
add new datasets
|
2023-12-12 12:44:15 +08:00 |
xingjun.wang
|
0ce18a3782
|
add open orca
|
2023-12-12 12:34:04 +08:00 |
xingjun.wang
|
cfba1009d0
|
update
|
2023-12-12 12:03:23 +08:00 |
xingjun.wang
|
5b979147f0
|
for test
|
2023-12-12 11:52:59 +08:00 |
xingjun.wang
|
8a908a8c64
|
for test
|
2023-12-12 11:47:59 +08:00 |
hiyouga
|
8cace77808
|
update readme
|
2023-12-12 11:44:30 +08:00 |
hiyouga
|
96380f5e18
|
support mixtral
|
2023-12-12 11:39:04 +08:00 |
hiyouga
|
f4657de7d5
|
fix baichuan resize
|
2023-12-11 20:55:50 +08:00 |