hiyouga
|
818726e9bc
|
add GaLore results
|
2024-03-09 04:11:55 +08:00 |
hiyouga
|
393c2de27c
|
update hardware requirements
|
2024-03-09 03:58:18 +08:00 |
hiyouga
|
10be2f0ecc
|
fix aqlm version
|
2024-03-09 00:09:09 +08:00 |
hiyouga
|
4a2cc60b94
|
update readme
|
2024-03-08 03:06:21 +08:00 |
hiyouga
|
33a4c24a8a
|
fix galore
|
2024-03-08 00:44:51 +08:00 |
hiyouga
|
57452a4aa1
|
add Yi-9B model
|
2024-03-07 23:11:57 +08:00 |
hiyouga
|
7230e1177d
|
add galore examples
|
2024-03-07 22:53:45 +08:00 |
hiyouga
|
28f7862188
|
support galore
|
2024-03-07 22:41:36 +08:00 |
hiyouga
|
725f7cd70f
|
update readme
|
2024-03-07 20:34:49 +08:00 |
hiyouga
|
77211d9843
|
tiny fix
|
2024-03-07 20:29:34 +08:00 |
hiyouga
|
d07ad5cc1c
|
support vllm
|
2024-03-07 20:26:31 +08:00 |
hiyouga
|
0048a2021e
|
tiny fix
|
2024-03-06 17:25:08 +08:00 |
hiyouga
|
9658c63cd9
|
fix add tokens
|
2024-03-06 15:04:02 +08:00 |
hiyouga
|
3016e65657
|
fix version checking
|
2024-03-06 14:51:51 +08:00 |
hiyouga
|
df9e6bb063
|
update readme
|
2024-03-05 03:20:23 +08:00 |
hiyouga
|
24a79bd50f
|
update readme
|
2024-03-04 19:29:26 +08:00 |
hiyouga
|
7c227e07dd
|
update readme
|
2024-03-03 01:41:07 +08:00 |
hiyouga
|
894d183214
|
update readme, add starcoder2, cosmopedia
|
2024-03-03 01:01:46 +08:00 |
hoshi-hiyouga
|
4bf7eb72e0
|
Update README.md
|
2024-03-03 00:48:47 +08:00 |
hoshi-hiyouga
|
585c884ea9
|
Update README.md
|
2024-03-03 00:48:06 +08:00 |
hiyouga
|
318315c76d
|
add colab demo
|
2024-03-02 19:58:21 +08:00 |
hiyouga
|
bb16502c33
|
add twitter
|
2024-02-29 17:45:30 +08:00 |
hiyouga
|
fa5ab21ebc
|
release v0.5.3
|
2024-02-29 00:34:19 +08:00 |
hiyouga
|
804c1e7083
|
add examples
|
2024-02-28 23:19:25 +08:00 |
hiyouga
|
38d8b2cef8
|
update chatglm3 template
|
2024-02-28 21:11:23 +08:00 |
hiyouga
|
a2dccce06a
|
update readme
|
2024-02-28 20:50:01 +08:00 |
hiyouga
|
cfefacaa37
|
support DoRA, AWQ, AQLM #2512
|
2024-02-28 19:53:28 +08:00 |
hiyouga
|
3ba1054593
|
update readme
|
2024-02-26 17:25:47 +08:00 |
hiyouga
|
261f631a1c
|
update readme
|
2024-02-25 16:26:08 +08:00 |
hiyouga
|
aca948da8f
|
add papers
|
2024-02-25 15:34:47 +08:00 |
hiyouga
|
ad76482cf9
|
add papers
|
2024-02-25 15:18:58 +08:00 |
hiyouga
|
c99e19641a
|
support gemma
|
2024-02-21 23:27:36 +08:00 |
hiyouga
|
daa3185350
|
tiny fix
|
2024-02-21 18:30:29 +08:00 |
hoshi-hiyouga
|
869fd208a8
|
Update README.md
|
2024-02-20 16:07:55 +08:00 |
codemayq
|
d47e40633a
|
1. update the version of pre-built bitsandbytes library
2. add pre-built flash-attn library
|
2024-02-20 11:28:25 +08:00 |
codemayq
|
95f53a46bd
|
1. update the version of pre-built bitsandbytes library
2. add pre-built flash-attn library
|
2024-02-20 11:26:22 +08:00 |
hiyouga
|
7924ffc55d
|
support llama pro #2338 , add rslora
|
2024-02-15 02:27:36 +08:00 |
hiyouga
|
7d2dc83c5e
|
improve aligner
|
2024-02-10 16:39:19 +08:00 |
hiyouga
|
54ea9684ed
|
improve fix tokenizer
|
2024-02-09 14:53:14 +08:00 |
hoshi-hiyouga
|
d0daaa01f9
|
Merge pull request #2423 from mayflower/main
Support for german sft and dpo
|
2024-02-07 15:58:20 +08:00 |
hiyouga
|
ccabb5b04a
|
support qwen1.5
|
2024-02-06 00:10:51 +08:00 |
Johann-Peter Hartmann
|
d9a8301ed4
|
Add support for german datasets
|
2024-01-30 10:18:01 +01:00 |
hiyouga
|
a0d59aa4ec
|
release v0.5.0 (real)
|
2024-01-21 01:54:49 +08:00 |
hiyouga
|
5608a0da8e
|
update readme
|
2024-01-18 14:30:48 +08:00 |
hiyouga
|
5a207bb723
|
tiny fix
|
2024-01-15 23:34:23 +08:00 |
Junu Moon(Fran)
|
7a320de097
|
fix: typo on README.md
|
2024-01-15 19:50:35 +09:00 |
hiyouga
|
ca3933dc52
|
support deepseek moe
|
2024-01-14 00:14:49 +08:00 |
hiyouga
|
d1a73fe26c
|
fix phi modules
|
2024-01-13 23:12:47 +08:00 |
JessyTsu1
|
8c5e4a8896
|
Update README.md
|
2024-01-11 23:18:29 +08:00 |
JessyTsu1
|
d72aff5ae6
|
Update README.md
|
2024-01-11 23:17:00 +08:00 |