hiyouga
|
836ca05586
|
update readme
|
2024-04-22 00:51:35 +08:00 |
hiyouga
|
34d66a3a85
|
update readme
|
2024-04-22 00:42:25 +08:00 |
hiyouga
|
a1f1fac33b
|
update readme and examples
|
2024-04-22 00:37:32 +08:00 |
hiyouga
|
a83e7587a0
|
update readme
|
2024-04-22 00:21:01 +08:00 |
hiyouga
|
f58425ab45
|
fix mod stuff
|
2024-04-21 18:11:10 +08:00 |
Marco
|
620add7b9f
|
Added Mixture of Depths
|
2024-04-18 20:31:24 +02:00 |
hoshi-hiyouga
|
2aaaede247
|
support llama3
|
2024-04-19 01:13:50 +08:00 |
hiyouga
|
3b43a3b7c5
|
tiny fix
|
2024-04-18 00:22:17 +08:00 |
hiyouga
|
e2f1c6fc6a
|
update readme
|
2024-04-17 23:40:49 +08:00 |
hiyouga
|
cab0598fd0
|
add mixtral 8x22B models
|
2024-04-17 23:35:59 +08:00 |
hiyouga
|
5d62a51c12
|
update readme and gradio version
|
2024-04-16 18:09:16 +08:00 |
hiyouga
|
e3d8fc75eb
|
support badam for all stages
|
2024-04-16 17:44:48 +08:00 |
hiyouga
|
cf52911fed
|
update readme
|
2024-04-16 02:36:54 +08:00 |
hiyouga
|
6084eb7cf1
|
update readme
|
2024-04-16 02:35:36 +08:00 |
hiyouga
|
6543f3d449
|
add codegemma
|
2024-04-16 00:11:15 +08:00 |
hiyouga
|
e0dbac2845
|
support cohere commandR #3184
|
2024-04-15 23:26:42 +08:00 |
hiyouga
|
9d4c949461
|
release v0.6.2
|
2024-04-11 20:08:51 +08:00 |
hiyouga
|
a88fe8c1af
|
update readme
|
2024-04-07 00:48:24 +08:00 |
hiyouga
|
7f6e412604
|
fix requires for windows
|
2024-04-03 21:56:43 +08:00 |
hiyouga
|
49a2dfaf90
|
update vllm example
|
2024-04-02 22:45:20 +08:00 |
hiyouga
|
66b0fe4e96
|
update readme
|
2024-04-02 22:17:48 +08:00 |
hiyouga
|
7765f337c7
|
add zh readme
|
2024-04-02 20:58:45 +08:00 |
hiyouga
|
11a6c1bad6
|
update readme
|
2024-04-02 20:37:37 +08:00 |
hiyouga
|
949e5fe638
|
update readme
|
2024-04-02 20:22:11 +08:00 |
hiyouga
|
92dab8a90b
|
simplify readme
|
2024-04-02 20:07:43 +08:00 |
hiyouga
|
54b7d34908
|
add qwen1.5 moe
|
2024-04-01 21:49:40 +08:00 |
hiyouga
|
aee634cd20
|
fix #3077
|
2024-04-01 21:35:18 +08:00 |
hiyouga
|
099db6acc0
|
update readme
|
2024-03-31 18:46:34 +08:00 |
hiyouga
|
17bf8a2c3a
|
support ORPO
|
2024-03-31 18:29:50 +08:00 |
hiyouga
|
c1fe6ce782
|
update readme
|
2024-03-28 22:02:32 +08:00 |
hiyouga
|
1e43319f9c
|
add project
|
2024-03-28 20:24:27 +08:00 |
hiyouga
|
6c94305e47
|
update readme
|
2024-03-28 18:35:11 +08:00 |
hiyouga
|
8c77b10912
|
update trainers
|
2024-03-28 18:16:27 +08:00 |
hiyouga
|
7b3d8188f5
|
update readme
|
2024-03-25 23:06:13 +08:00 |
hoshi-hiyouga
|
f633ac6646
|
Merge pull request #2967 from Tsumugii24/main
Update README_zh.md
|
2024-03-25 23:02:22 +08:00 |
Tsumugii24
|
7aa77a3451
|
Update README_zh.md
|
2024-03-25 22:54:26 +08:00 |
hiyouga
|
6f2b563f12
|
release v0.6.0
|
2024-03-25 22:38:56 +08:00 |
Tsumugii24
|
bb4ca1691a
|
Update README_zh.md
|
2024-03-25 22:31:03 +08:00 |
hiyouga
|
a1c8c98c5f
|
fix #2941
|
2024-03-24 00:28:44 +08:00 |
0xez
|
be0360303d
|
Update README_zh.md, fix the release date of the paper
|
2024-03-22 10:41:17 +08:00 |
hiyouga
|
5eaa50fa01
|
add citation
|
2024-03-21 17:04:10 +08:00 |
hiyouga
|
0581bfdbc7
|
paper release
|
2024-03-21 13:49:17 +08:00 |
hiyouga
|
bfe7a91289
|
update readme
|
2024-03-21 00:48:42 +08:00 |
hiyouga
|
8408225162
|
support fsdp + qlora
|
2024-03-21 00:36:06 +08:00 |
hiyouga
|
9bec3c98a2
|
fix #2777 #2895
|
2024-03-20 17:59:45 +08:00 |
khazic
|
0531dac30d
|
Updated README with new information
|
2024-03-20 14:21:16 +08:00 |
刘一博
|
df9b4fb90a
|
Updated README with new information
|
2024-03-20 14:11:28 +08:00 |
hiyouga
|
72367307df
|
improve lora+ impl.
|
2024-03-13 23:32:51 +08:00 |
hiyouga
|
b3247d6a16
|
support olmo
|
2024-03-12 18:30:38 +08:00 |
hiyouga
|
8664262cde
|
support layerwise galore
|
2024-03-10 00:24:11 +08:00 |
hiyouga
|
818726e9bc
|
add GaLore results
|
2024-03-09 04:11:55 +08:00 |
hiyouga
|
393c2de27c
|
update hardware requirements
|
2024-03-09 03:58:18 +08:00 |
hiyouga
|
10be2f0ecc
|
fix aqlm version
|
2024-03-09 00:09:09 +08:00 |
hiyouga
|
4a2cc60b94
|
update readme
|
2024-03-08 03:06:21 +08:00 |
hiyouga
|
33a4c24a8a
|
fix galore
|
2024-03-08 00:44:51 +08:00 |
hiyouga
|
57452a4aa1
|
add Yi-9B model
|
2024-03-07 23:11:57 +08:00 |
hiyouga
|
7230e1177d
|
add galore examples
|
2024-03-07 22:53:45 +08:00 |
hiyouga
|
28f7862188
|
support galore
|
2024-03-07 22:41:36 +08:00 |
hiyouga
|
725f7cd70f
|
update readme
|
2024-03-07 20:34:49 +08:00 |
hiyouga
|
d07ad5cc1c
|
support vllm
|
2024-03-07 20:26:31 +08:00 |
hiyouga
|
0048a2021e
|
tiny fix
|
2024-03-06 17:25:08 +08:00 |
hiyouga
|
9658c63cd9
|
fix add tokens
|
2024-03-06 15:04:02 +08:00 |
hiyouga
|
3016e65657
|
fix version checking
|
2024-03-06 14:51:51 +08:00 |
hiyouga
|
df9e6bb063
|
update readme
|
2024-03-05 03:20:23 +08:00 |
hiyouga
|
24a79bd50f
|
update readme
|
2024-03-04 19:29:26 +08:00 |
hiyouga
|
7c227e07dd
|
update readme
|
2024-03-03 01:41:07 +08:00 |
hiyouga
|
894d183214
|
update readme, add starcoder2, cosmopedia
|
2024-03-03 01:01:46 +08:00 |
hoshi-hiyouga
|
1006f372ae
|
Update README_zh.md
|
2024-03-03 00:49:08 +08:00 |
hiyouga
|
318315c76d
|
add colab demo
|
2024-03-02 19:58:21 +08:00 |
hiyouga
|
bb16502c33
|
add twitter
|
2024-02-29 17:45:30 +08:00 |
hiyouga
|
fa5ab21ebc
|
release v0.5.3
|
2024-02-29 00:34:19 +08:00 |
hiyouga
|
804c1e7083
|
add examples
|
2024-02-28 23:19:25 +08:00 |
hiyouga
|
38d8b2cef8
|
update chatglm3 template
|
2024-02-28 21:11:23 +08:00 |
hiyouga
|
a2dccce06a
|
update readme
|
2024-02-28 20:50:01 +08:00 |
hiyouga
|
cfefacaa37
|
support DoRA, AWQ, AQLM #2512
|
2024-02-28 19:53:28 +08:00 |
hiyouga
|
3ba1054593
|
update readme
|
2024-02-26 17:25:47 +08:00 |
hiyouga
|
261f631a1c
|
update readme
|
2024-02-25 16:26:08 +08:00 |
hiyouga
|
aca948da8f
|
add papers
|
2024-02-25 15:34:47 +08:00 |
hiyouga
|
ad76482cf9
|
add papers
|
2024-02-25 15:18:58 +08:00 |
hiyouga
|
c99e19641a
|
support gemma
|
2024-02-21 23:27:36 +08:00 |
hiyouga
|
daa3185350
|
tiny fix
|
2024-02-21 18:30:29 +08:00 |
hoshi-hiyouga
|
175a48d79d
|
Update README_zh.md
|
2024-02-20 16:06:59 +08:00 |
codemayq
|
95f53a46bd
|
1. update the version of pre-built bitsandbytes library
2. add pre-built flash-attn library
|
2024-02-20 11:26:22 +08:00 |
hiyouga
|
7924ffc55d
|
support llama pro #2338 , add rslora
|
2024-02-15 02:27:36 +08:00 |
hiyouga
|
7d2dc83c5e
|
improve aligner
|
2024-02-10 16:39:19 +08:00 |
hiyouga
|
54ea9684ed
|
improve fix tokenizer
|
2024-02-09 14:53:14 +08:00 |
hiyouga
|
ccabb5b04a
|
support qwen1.5
|
2024-02-06 00:10:51 +08:00 |
hiyouga
|
a0d59aa4ec
|
release v0.5.0 (real)
|
2024-01-21 01:54:49 +08:00 |
hiyouga
|
5ff10fac4f
|
fix pretrain data loader
|
2024-01-18 14:42:52 +08:00 |
hiyouga
|
5608a0da8e
|
update readme
|
2024-01-18 14:30:48 +08:00 |
hiyouga
|
5a207bb723
|
tiny fix
|
2024-01-15 23:34:23 +08:00 |
hiyouga
|
3c8e72f585
|
Update README_zh.md
|
2024-01-14 00:17:28 +08:00 |
hiyouga
|
d1a73fe26c
|
fix phi modules
|
2024-01-13 23:12:47 +08:00 |
JessyTsu1
|
cdeca0cabc
|
Update README_zh.md
|
2024-01-11 23:17:48 +08:00 |
hiyouga
|
4571068e1e
|
fix #1789
|
2024-01-09 18:31:27 +08:00 |
hiyouga
|
c7ea17d616
|
add yuan model
|
2023-12-29 13:50:24 +08:00 |
hiyouga
|
65c5b0477c
|
fix args
|
2023-12-28 18:47:19 +08:00 |
hiyouga
|
5b93d545e2
|
tiny update
|
2023-12-25 18:29:34 +08:00 |
hiyouga
|
e44b82ee24
|
update patcher
|
2023-12-23 15:24:27 +08:00 |
hiyouga
|
0ad86a4f62
|
update readme
|
2023-12-23 02:17:41 +08:00 |