hiyouga
e1f4e53915
add NPU docker images
2024-05-15 19:20:11 +08:00
hiyouga
b96d84835f
update readme
2024-05-14 23:57:08 +08:00
hiyouga
fc547ee591
update readme
2024-05-14 23:55:49 +08:00
hiyouga
c27afa296b
fix #3702
2024-05-13 18:24:35 +08:00
hiyouga
d12b8f866a
support Yi 1.5
2024-05-13 16:51:20 +08:00
hiyouga
58c522cd5c
remove checksum and fix ui args
2024-05-12 01:10:30 +08:00
hiyouga
638043ced4
update readme
2024-05-12 00:33:49 +08:00
hoshi-hiyouga
b8d5d9c8ef
Update README.md
2024-05-11 22:43:04 +08:00
BUAADreamer
508d474754
Merge branch 'hiyouga:main' into main
2024-05-10 20:34:41 +08:00
hiyouga
75aec4cf8e
resolve python 3.8 package
2024-05-09 16:52:27 +08:00
BUAADreamer
fdb3955448
add mllm processor save and Chinese-LLaVA-Med show
2024-05-09 13:53:39 +08:00
hiyouga
10ab83f4c4
add deepseek moe 236B
2024-05-08 16:37:54 +08:00
hiyouga
b3a9ae4085
update readme
2024-05-07 22:17:04 +08:00
hiyouga
92e9195b3c
update readme
2024-05-07 21:17:31 +08:00
hiyouga
5177f3ba90
update readme
2024-05-07 19:03:47 +08:00
Katehuuh
984f7fbbf7
Update README.md
...
Add Projects Nekochu/Luminia-13B-v3
2024-05-07 06:23:36 +02:00
hiyouga
8e09e20ece
update readme
2024-05-07 06:19:29 +08:00
hiyouga
f50c365871
update readme
2024-05-06 23:34:59 +08:00
hiyouga
f02f87c6fb
update example docs
2024-05-06 22:51:02 +08:00
hiyouga
34d33e2257
update docs
2024-05-06 21:47:00 +08:00
hiyouga
57a39783d1
update readme
2024-05-04 17:01:21 +08:00
hiyouga
d4283bb6bf
update readme
2024-05-04 00:43:53 +08:00
hiyouga
9d2ce57345
update readme and webui launch
2024-05-04 00:43:02 +08:00
hiyouga
1409654cef
update readme
2024-05-04 00:31:02 +08:00
hiyouga
245fe47ece
update webui and add CLIs
2024-05-03 02:58:23 +08:00
hiyouga
32347901d4
fix setup
2024-04-28 03:49:13 +08:00
hiyouga
5ee04d418c
update readme
2024-04-26 23:39:19 +08:00
hiyouga
031775ade8
update readme
2024-04-26 20:09:14 +08:00
hiyouga
375b25131b
support Qwen1.5 110B
2024-04-26 19:59:22 +08:00
hiyouga
e83e2fa897
update readme
2024-04-26 05:49:26 +08:00
hiyouga
27ba1b63ce
update readme
2024-04-26 05:44:30 +08:00
hiyouga
44a43ee152
add olmo 1.7
2024-04-24 05:50:50 +08:00
hiyouga
07737a3d2d
reenable sdpa and fast tok by default
2024-04-24 02:18:44 +08:00
hiyouga
1a13f05555
support phi-3
2024-04-24 00:28:53 +08:00
hiyouga
db7f3b9784
update readme
2024-04-22 17:09:17 +08:00
hiyouga
836ca05586
update readme
2024-04-22 00:51:35 +08:00
hiyouga
34d66a3a85
update readme
2024-04-22 00:42:25 +08:00
hiyouga
a1f1fac33b
update readme and examples
2024-04-22 00:37:32 +08:00
hiyouga
a83e7587a0
update readme
2024-04-22 00:21:01 +08:00
hiyouga
f58425ab45
fix mod stuff
2024-04-21 18:11:10 +08:00
Marco
620add7b9f
Added Mixture of Depths
2024-04-18 20:31:24 +02:00
hoshi-hiyouga
2aaaede247
support llama3
2024-04-19 01:13:50 +08:00
hiyouga
942362d008
fix #3324
2024-04-18 15:34:45 +08:00
hiyouga
3b43a3b7c5
tiny fix
2024-04-18 00:22:17 +08:00
hiyouga
e2f1c6fc6a
update readme
2024-04-17 23:40:49 +08:00
hiyouga
cab0598fd0
add mixtral 8x22B models
2024-04-17 23:35:59 +08:00
hiyouga
5d62a51c12
update readme and gradio version
2024-04-16 18:09:16 +08:00
hiyouga
e3d8fc75eb
support badam for all stages
2024-04-16 17:44:48 +08:00
hiyouga
cf52911fed
update readme
2024-04-16 02:36:54 +08:00
hiyouga
6084eb7cf1
update readme
2024-04-16 02:35:36 +08:00
hiyouga
6543f3d449
add codegemma
2024-04-16 00:11:15 +08:00
hiyouga
e0dbac2845
support cohere commandR #3184
2024-04-15 23:26:42 +08:00
hiyouga
9d4c949461
release v0.6.2
2024-04-11 20:08:51 +08:00
hiyouga
a88fe8c1af
update readme
2024-04-07 00:48:24 +08:00
hiyouga
7f6e412604
fix requires for windows
2024-04-03 21:56:43 +08:00
hiyouga
49a2dfaf90
update vllm example
2024-04-02 22:45:20 +08:00
hiyouga
66b0fe4e96
update readme
2024-04-02 22:17:48 +08:00
hiyouga
7765f337c7
add zh readme
2024-04-02 20:58:45 +08:00
hiyouga
11a6c1bad6
update readme
2024-04-02 20:37:37 +08:00
hiyouga
949e5fe638
update readme
2024-04-02 20:22:11 +08:00
hiyouga
92dab8a90b
simplify readme
2024-04-02 20:07:43 +08:00
hiyouga
54b7d34908
add qwen1.5 moe
2024-04-01 21:49:40 +08:00
hiyouga
aee634cd20
fix #3077
2024-04-01 21:35:18 +08:00
hiyouga
099db6acc0
update readme
2024-03-31 18:46:34 +08:00
hiyouga
17bf8a2c3a
support ORPO
2024-03-31 18:29:50 +08:00
hiyouga
c1fe6ce782
update readme
2024-03-28 22:02:32 +08:00
hiyouga
1e43319f9c
add project
2024-03-28 20:24:27 +08:00
hiyouga
6c94305e47
update readme
2024-03-28 18:35:11 +08:00
hiyouga
8c77b10912
update trainers
2024-03-28 18:16:27 +08:00
hiyouga
7b3d8188f5
update readme
2024-03-25 23:06:13 +08:00
hoshi-hiyouga
f633ac6646
Merge pull request #2967 from Tsumugii24/main
...
Update README_zh.md
2024-03-25 23:02:22 +08:00
Tsumugii24
1704599503
Update README.md
2024-03-25 22:54:38 +08:00
hiyouga
6f2b563f12
release v0.6.0
2024-03-25 22:38:56 +08:00
hiyouga
a1c8c98c5f
fix #2941
2024-03-24 00:28:44 +08:00
0xez
675ba41562
Update README.md, fix the release date of the paper
2024-03-21 22:14:48 +08:00
hiyouga
5eaa50fa01
add citation
2024-03-21 17:04:10 +08:00
hiyouga
0581bfdbc7
paper release
2024-03-21 13:49:17 +08:00
hiyouga
bfe7a91289
update readme
2024-03-21 00:48:42 +08:00
hiyouga
8408225162
support fsdp + qlora
2024-03-21 00:36:06 +08:00
hiyouga
9bec3c98a2
fix #2777 #2895
2024-03-20 17:59:45 +08:00
khazic
0531dac30d
Updated README with new information
2024-03-20 14:21:16 +08:00
刘一博
df9b4fb90a
Updated README with new information
2024-03-20 14:11:28 +08:00
hiyouga
72367307df
improve lora+ impl.
2024-03-13 23:32:51 +08:00
hiyouga
b3247d6a16
support olmo
2024-03-12 18:30:38 +08:00
hoshi-hiyouga
c901aa63ff
Merge pull request #2743 from S3Studio/DockerizeSupport
...
Add dockerize support
2024-03-12 00:05:49 +08:00
hiyouga
8664262cde
support layerwise galore
2024-03-10 00:24:11 +08:00
hiyouga
818726e9bc
add GaLore results
2024-03-09 04:11:55 +08:00
hiyouga
393c2de27c
update hardware requirements
2024-03-09 03:58:18 +08:00
hiyouga
10be2f0ecc
fix aqlm version
2024-03-09 00:09:09 +08:00
S3Studio
3d911ae713
Add dockerize support
...
Already tested with the model of Qwen:1.8B and the dataset of alpaca_data_zh. Some python libraries are added to the Dockerfile as a result of the exception messages displayed throughout test procedure.
2024-03-08 10:47:28 +08:00
hiyouga
4a2cc60b94
update readme
2024-03-08 03:06:21 +08:00
hiyouga
33a4c24a8a
fix galore
2024-03-08 00:44:51 +08:00
hiyouga
57452a4aa1
add Yi-9B model
2024-03-07 23:11:57 +08:00
hiyouga
7230e1177d
add galore examples
2024-03-07 22:53:45 +08:00
hiyouga
28f7862188
support galore
2024-03-07 22:41:36 +08:00
hiyouga
725f7cd70f
update readme
2024-03-07 20:34:49 +08:00
hiyouga
77211d9843
tiny fix
2024-03-07 20:29:34 +08:00
hiyouga
d07ad5cc1c
support vllm
2024-03-07 20:26:31 +08:00
hiyouga
0048a2021e
tiny fix
2024-03-06 17:25:08 +08:00
hiyouga
9658c63cd9
fix add tokens
2024-03-06 15:04:02 +08:00